Free Access to GAQM.Databricks-Certified-Data-Engineer-Associate.v2024-09-16.q91 with Valid Practice Test (Page 3)

Question 6

A data engineer is using the following code block as part of a batch ingestion pipeline to read from a composable table:

Which of the following changes needs to be made so this code block will work when the transactions table is a stream source?

A.Replace predict with a stream-friendly prediction function
B.Replace schema(schema) with option ("maxFilesPerTrigger", 1)
C.Replace "transactions" with the path to the location of the Delta table
D.Replace format("delta") with format("stream")
E.Replace spark.read with spark.readStream

Question 7

A data engineer has a Job with multiple tasks that runs nightly. Each of the tasks runs slowly because the clusters take a long time to start.
Which of the following actions can the data engineer perform to improve the start up time for the clusters used for the Job?

A.They can use endpoints available in Databricks SQL
B.They can use jobs clusters instead of all-purpose clusters
C.They can configure the clusters to be single-node
D.They can use clusters that are from a cluster pool
E.They can configure the clusters to autoscale for larger data sizes

Question 8

Which of the following SQL keywords can be used to convert a table from a long format to a wide format?

A.PIVOT
B.CONVERT
C.WHERE
D.TRANSFORM
E.SUM

Question 9

Which of the following Structured Streaming queries is performing a hop from a Silver table to a Gold table?

A.
B.
C.
D.
E.

Question 10

An engineering manager wants to monitor the performance of a recent project using a Databricks SQL query.
For the first week following the project's release, the manager wants the query results to be updated every minute. However, the manager is concerned that the compute resources used for the query will be left running and cost the organization a lot of money beyond the first week of the project's release.
Which of the following approaches can the engineering team use to ensure the query does not cost the organization any money beyond the first week of the project's release?

A.They can set a limit to the number of DBUs that are consumed by the SQL Endpoint.
B.They can set the query's refresh schedule to end after a certain number of refreshes.
C.They cannot ensure the query does not cost the organization money beyond the first week of the project's release.
D.They can set a limit to the number of individuals that are able to manage the query's refresh schedule.
E.They can set the query's refresh schedule to end on a certain date in the query scheduler.

Question 6

Question 7

Question 8

Question 9

Question 10

Download PDF File