100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.6 TrustPilot
logo-home
Exam (elaborations)

GCP Professional Data Engineer Exam With Verified Solutions

Rating
-
Sold
-
Pages
15
Grade
A+
Uploaded on
01-11-2024
Written in
2024/2025

GCP Professional Data Engineer Exam With Verified Solutions...

Institution
GCP Professional Data Engineer
Module
GCP Professional Data Engineer

Content preview

GCP Professional Data Engineer Exam With
Verified Solutions


1. You are building an application that will only detect and label certain
business-to-business product logos within an image. You do not have an in-depth
background working with machine learning models, but you need to get your application
up and running. What is the current best method to accomplish this task? a. The current
best method would be to utilize the AutoML Vision service to train a custom model using
the Vision API.

i. The newly added AutoML services let you train custom image-and-among other
models-using the Google-pretrained API's as a base. Training a custom model will also
work on AI Platform, but this route requires less manual model overhead.



2. Your company streams telemetry data into BigQuery for long-term storage of 2 years
and analysis. Data comes in at a rate of close to 100 million records per day. They want
to be able to run queries against certain time periods of data without incurring the costs
of querying all available records. What is the preferred method to do this? a. Partition a
single table by day, and run queries against individual partitions.

i. Partitioning a single table based on date allows you to keep only one table while, at the
same time, perform queries on a small subset of it. Even though it is technically valid to
use many tables - one for every day - using wildcards, best practice is partitioning a
single table.



3. You are an administrator for a few organizations within the same company. Each
organization has data in their own BigQuery table within a single project. Because of
reasons related to application access, all of the tables must remain in the same project.
You believe each organization should have the ability to view and execute queries
against their own data without revealing data from organizations to unauthorized
viewers. What would you recommend? - Answer a. In that project, create one dataset
per organization. Put each organization's table into its own dataset. Bind access to the
dataset per organization they are in to that company. Now they can see their table but
nobody else's.

i. You can only assign roles at the dataset level. Putting tables into different datasets lets
you control access per dataset.

, 4. Your company is making the move to Google Cloud and has decided to go with a
managed database service to reduce overhead. Your current database supports a
product catalog that does real-time inventory tracking for a retailer. Your database is
500 GB in size. The data is semi-structured, but doesn't require full atomicity. You want
a truly no-ops/ serverless solution. Which of the following should you use for your
storage? a. Cloud Datastore

i. Datastore is ideal for semi-structured data less than 1TB in size. Product catalogs are
a recommended use case.



5. How would you configure your Dataproc environment to use BigQuery as an input and
output source? - Answer a. Install the BigQuery connector on your Dataproc cluster.

i. You can install the BigQuery connector to your cluster for direct programmatic
read/write access to BigQuery. Note that a Cloud Storage bucket is used between the
two services, but you'll interact directly with BigQuery from Dataproc.



6. In AI Platform, what does the CUSTOM tier allow you to configure? Choose the best
answer. - Answer a. Custom number of workers and parameter servers. Machine type of
master server

i. Correct. You can customize the number of workers and parameter servers, but
masters are set to one.



7. You are creating a data pipeline in Google Cloud. You need to preprocess source data
for a machinelearning model. In particular, you need to quickly remove duplicate rows
from three input tables, and you need to remove outliers from columns of data for which
you don't know the distribution of data. What do you do? - Answer a. The following
procedure uses Cloud Dataprep to review the range of values in sample source data
table columns and add the necessary transformations to the job. To do so, for each
column, click the column name, and click each appropriate suggested transformation,
then click Add to add each transformation to the Cloud Dataprep job.

i. Dataprep would be the correct choice since the requirements are to prepare/clean the
source data. For deduplication, using the suggestion transformation is easier and faster
than creating a recipe, which is more work than necessary.



8. You keep regular snapshots of the boot disks of running Compute Engine instances as
part of a backup and restore plan. You need to restore these snapshots for the fewest
number of steps with replacement instances. What do you do? - Answer a. Use the
snapshots to create replacement instances as needed.

Written for

Institution
GCP Professional Data Engineer
Module
GCP Professional Data Engineer

Document information

Uploaded on
November 1, 2024
Number of pages
15
Written in
2024/2025
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
Easton West Virgina University
Follow You need to be logged in order to follow users or courses
Sold
537
Member since
3 year
Number of followers
221
Documents
25955
Last sold
20 hours ago

3.9

115 reviews

5
54
4
21
3
23
2
7
1
10

Trending documents

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their exams and reviewed by others who've used these revision notes.

Didn't get what you expected? Choose another document

No problem! You can straightaway pick a different document that better suits what you're after.

Pay as you like, start learning straight away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and smashed it. It really can be that simple.”

Alisha Student

Frequently asked questions