Free Access to GAQM.Databricks-Certified-Data-Engineer-Associate.v2024-11-18.q107 with Valid Practice Test (Page 9)

Question 36

What is stored in a Databricks customer's cloud account?

A.Data
B.Cluster management metadata
C.Databricks web application
D.Notebooks

Question 37

A data engineer has configured a Structured Streaming job to read from a table, manipulate the data, and then perform a streaming write into a new table.
The cade block used by the data engineer is below:

If the data engineer only wants the query to execute a micro-batch to process data every 5 seconds, which of the following lines of code should the data engineer use to fill in the blank?

A.trigger("5 seconds")
B.trigger()
C.trigger(once="5 seconds")
D.trigger(processingTime="5 seconds")
E.trigger(continuous="5 seconds")

Question 38

A data analyst has developed a query that runs against Delta table. They want help from the data engineering team to implement a series of tests to ensure the data returned by the query is clean. However, the data engineering team uses Python for its tests rather than SQL.
Which of the following operations could the data engineering team use to run the query and operate with the results in PySpark?

A.SELECT * FROM sales
B.spark.delta.table
C.spark.sql
D.There is no way to share data between PySpark and SQL.
E.spark.table

Question 39

A data engineering team has two tables. The first table march_transactions is a collection of all retail transactions in the month of March. The second table april_transactions is a collection of all retail transactions in the month of April. There are no duplicate records between the tables.
Which of the following commands should be run to create a new table all_transactions that contains all records from march_transactions and april_transactions without duplicate records?

A.CREATE TABLE all_transactions AS
SELECT * FROM march_transactions
UNION SELECT * FROM april_transactions;
B.CREATE TABLE all_transactions AS
SELECT * FROM march_transactions
OUTER JOIN SELECT * FROM april_transactions;
C.CREATE TABLE all_transactions AS
SELECT * FROM march_transactions
INTERSECT SELECT * from april_transactions;
D.CREATE TABLE all_transactions AS
SELECT * FROM march_transactions
INNER JOIN SELECT * FROM april_transactions;
E.CREATE TABLE all_transactions AS
SELECT * FROM march_transactions
MERGE SELECT * FROM april_transactions;

Question 40

A data engineer is working with two tables. Each of these tables is displayed below in its entirety.
The data engineer runs the following query to join these tables together:
Which of the following will be returned by the above query?

A.Option A
B.Option B
C.Option C
D.Option D
E.Option E

Question 36

Question 37

Question 38

Question 39

Question 40

Download PDF File