Free Access to GAQM.Databricks-Certified-Data-Engineer-Associate.v2024-09-16.q91 with Valid Practice Test (Page 4)

Question 11

Which of the following must be specified when creating a new Delta Live Tables pipeline?

A.A key-value pair configuration
B.The preferred DBU/hour cost
C.A path to cloud storage location for the written data
D.A location of a target database for the written data
E.At least one notebook library to be executed

Question 12

Which of the following commands can be used to write data into a Delta table while avoiding the writing of duplicate records?

A.DROP
B.IGNORE
C.MERGE
D.APPEND
E.INSERT

Question 13

A Delta Live Table pipeline includes two datasets defined using STREAMING LIVE TABLE. Three datasets are defined against Delta Lake table sources using LIVE TABLE.
The table is configured to run in Production mode using the Continuous Pipeline Mode.
Assuming previously unprocessed data exists and all definitions are valid, what is the expected outcome after clicking Start to update the pipeline?

A.All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will persist to allow for additional testing.
B.All datasets will be updated once and the pipeline will persist without any processing. The compute resources will persist but go unused.
C.All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will be deployed for the update and terminated when the pipeline is stopped.
D.All datasets will be updated once and the pipeline will shut down. The compute resources will be terminated.
E.All datasets will be updated once and the pipeline will shut down. The compute resources will persist to allow for additional testing.

Question 14

A data engineer has developed a data pipeline to ingest data from a JSON source using Auto Loader, but the engineer has not provided any type inference or schema hints in their pipeline. Upon reviewing the data, the data engineer has noticed that all of the columns in the target table are of the string type despite some of the fields only including float or boolean values.
Which of the following describes why Auto Loader inferred all of the columns to be of the string type?

A.There was a type mismatch between the specific schema and the inferred schema
B.JSON data is a text-based format
C.Auto Loader only works with string data
D.All of the fields had at least one null value
E.Auto Loader cannot infer the schema of ingested data

Question 15

A data engineer is attempting to drop a Spark SQL table my_table and runs the following command:
DROP TABLE IF EXISTS my_table;
After running this command, the engineer notices that the data files and metadata files have been deleted from the file system.
Which of the following describes why all of these files were deleted?

A.The table was managed
B.The table's data was smaller than 10 GB
C.The table's data was larger than 10 GB
D.The table was external
E.The table did not have a location

Question 11

Question 12

Question 13

Question 14

Question 15

Download PDF File