Free Access to Google.Professional-Data-Engineer.v2024-01-19.q177 with Valid Practice Test (Page 18)

Question 81

You have a data pipeline with a Cloud Dataflow job that aggregates and writes time series metrics to Cloud Bigtable. This data feeds a dashboard used by thousands of users across the organization. You need to support additional concurrent users and reduce the amount of time required to write the dat
a. Which two actions should you take? (Choose two.)

A.Configure your Cloud Dataflow pipeline to use local execution
B.Increase the maximum number of Cloud Dataflow workers by setting maxNumWorkers in PipelineOptions
C.Increase the number of nodes in the Cloud Bigtable cluster
D.Modify your Cloud Dataflow pipeline to use the Flatten transform before writing to Cloud Bigtable
E.Modify your Cloud Dataflow pipeline to use the CoGroupByKey transform before writing to Cloud Bigtable

Question 82

A data scientist has created a BigQuery ML model and asks you to create an ML pipeline to serve predictions. You have a REST API application with the requirement to serve predictions for an individual user ID with latency under 100 milliseconds. You use the following query to generate predictions: SELECT predicted_label, user_id FROM ML.PREDICT (MODEL 'dataset.model', table user_features). How should you create the ML pipeline?

A.Add a WHERE clause to the query, and grant the BigQuery Data Viewer role to the application service account.
B.Create a Cloud Dataflow pipeline using BigQueryIO to read predictions for all users from the query. Write the results to Cloud Bigtable using BigtableIO. Grant the Bigtable Reader role to the application service account so that the application can read predictions for individual users from Cloud Bigtable.
C.Create an Authorized View with the provided query. Share the dataset that contains the view with the application service account.
D.Create a Cloud Dataflow pipeline using BigQueryIO to read results from the query. Grant the Dataflow Worker role to the application service account.

Question 83

You are building a data pipeline on Google Cloud. You need to prepare data using a casual method for a machine-learning process. You want to support a logistic regression model. You also need to monitor and adjust for null values, which must remain real-valued and cannot be removed. What should you do?

A.Use Cloud Dataprep to find null values in sample source data. Convert all nulls to `none' using a Cloud Dataproc job.
B.Use Cloud Dataflow to find null values in sample source data. Convert all nulls to `none' using a Cloud Dataprep job.
C.Use Cloud Dataflow to find null values in sample source data. Convert all nulls to using a custom script.
D.Use Cloud Dataprep to find null values in sample source data. Convert all nulls to 0 using a Cloud Dataprep job.

Question 84

You've migrated a Hadoop job from an on-prem cluster to dataproc and GCS. Your Spark job is a complicated analytical workload that consists of many shuffing operations and initial data are parquet files (on average 200-400 MB size each). You see some degradation in performance after the migration to Dataproc, so you'd like to optimize for it. You need to keep in mind that your organization is very cost- sensitive, so you'd like to continue using Dataproc on preemptibles (with 2 non-preemptible workers only) for this workload.
What should you do?

A.Switch from HDDs to SSDs, copy initial data from GCS to HDFS, run the Spark job and copy results back to GCS.
B.Switch from HDDs to SSDs, override the preemptible VMs configuration to increase the boot disk size.
C.Increase the size of your parquet files to ensure them to be 1 GB minimum.
D.Switch to TFRecords formats (appr. 200MB per file) instead of parquet files.

Question 85

Your neural network model is taking days to train. You want to increase the training speed. What can you do?

A.Subsample your test dataset.
B.Subsample your training dataset.
C.Increase the number of input features to your model.
D.Increase the number of layers in your neural network.

Question 81

Question 82

Question 83

Question 84

Question 85

Download PDF File