Free Access to GAQM.Databricks-Certified-Data-Engineer-Associate.v2024-11-18.q107 with Valid Practice Test (Page 18)

Question 81

Which of the following benefits is provided by the array functions from Spark SQL?

A.An ability to work with data in a variety of types at once
B.An ability to work with data within certain partitions and windows
C.An ability to work with time-related data in specified intervals
D.An ability to work with complex, nested data ingested from JSON files
E.An ability to work with an array of tables for procedural automation

Question 82

Which file format is used for storing Delta Lake Table?

A.Parquet
B.Delta
C.SV
D.JSON

Question 83

A single Job runs two notebooks as two separate tasks. A data engineer has noticed that one of the notebooks is running slowly in the Job's current run. The data engineer asks a tech lead for help in identifying why this might be the case.
Which of the following approaches can the tech lead use to identify why the notebook is running slowly as part of the Job?

A.They can navigate to the Tasks tab in the Jobs UI and click on the active run to review the processing notebook.
B.There is no way to determine why a Job task is running slowly.
C.They can navigate to the Tasks tab in the Jobs UI to immediately review the processing notebook.
D.They can navigate to the Runs tab in the Jobs UI to immediately review the processing notebook.
E.They can navigate to the Runs tab in the Jobs UI and click on the active run to review the processing notebook.

Question 84

A data engineer has realized that the data files associated with a Delta table are incredibly small. They want to compact the small files to form larger files to improve performance.
Which of the following keywords can be used to compact the small files?

A.REDUCE
B.OPTIMIZE
C.COMPACTION
D.REPARTITION
E.VACUUM

Question 85

Which of the following benefits of using the Databricks Lakehouse Platform is provided by Delta Lake?

A.The ability to manipulate the same data using a variety of languages
B.The ability to collaborate in real time on a single notebook
C.The ability to set up alerts for query failures
D.The ability to support batch and streaming workloads
E.The ability to distribute complex data operations

Question 81

Question 82

Question 83

Question 84

Question 85

Download PDF File