100% tevredenheidsgarantie Direct beschikbaar na je betaling Lees online óf als PDF Geen vaste maandelijkse kosten 4,6 TrustPilot
logo-home
Tentamen (uitwerkingen)

Databricks Certified Data Engineer Associate Practice Questions and Answers 2023 with complete solution

Beoordeling
-
Verkocht
-
Pagina's
14
Cijfer
A+
Geüpload op
17-04-2023
Geschreven in
2022/2023

Databricks Certified Data Engineer Associate Practice Questions and Answers 2023 with complete solution Which of the following describes a benefit of a data lakehouse that is unavailable in a traditional data warehouse? A. A data lakehouse provides a relational system of data management. B. A data lakehouse captures snapshots of data for version control purposes. C. A data lakehouse couples storage and compute for complete control. D. A data lakehouse utilizes proprietary storage formats for data. E. A data lakehouse enables both batch and streaming analytics E. A data lakehouse enables both batch and streaming analytics Which of the following locations hosts the driver and worker nodes of a Databricks-managed cluster? A. Data plane B. Control plane C. Databricks Filesystem D. JDBC data source E. Databricks web application A. Data plane A data architect is designing a data model that works for both video-based machine learning workloads and highly audited batch ETL/ELT workloads. Which of the following describes how using a data lakehouse can help the data architect meet the needs of both workloads? A. A data lakehouse requires very little data modeling. B. A data lakehouse combines compute and storage for simple governance. C. A data lakehouse provides autoscaling for compute clusters. D. A data lakehouse stores unstructured data and is ACID-compliant. E. A data lakehouse fully exists in the cloud. D. A data lakehouse stores unstructured data and is ACID-compliant. Which of the following describes a scenario in which a data engineer will want to use a Job cluster instead of an all-purpose cluster? A. An ad-hoc analytics report needs to be developed while minimizing compute costs. B. A data team needs to collaborate on the development of a machine learning model. C. An automated workflow needs to be run every 30 minutes. D. A Databricks SQL query needs to be scheduled for upward reporting. E. A data engineer needs to manually investigate a production error. C. An automated workflow needs to be run every 30 minutes. A data engineer has created a Delta table as part of a data pipeline. Downstream data analysts now need SELECT permission on the Delta table. Assuming the data engineer is the Delta table owner, which part of the Databricks Lakehouse Platform can the data engineer use to grant the data analysts the appropriate access? A. Repos B. Jobs C. Data Explorer

Meer zien Lees minder
Instelling
Vak









Oeps! We kunnen je document nu niet laden. Probeer het nog eens of neem contact op met support.

Geschreven voor

Vak

Documentinformatie

Geüpload op
17 april 2023
Aantal pagina's
14
Geschreven in
2022/2023
Type
Tentamen (uitwerkingen)
Bevat
Vragen en antwoorden

Onderwerpen

Voorbeeld van de inhoud

Databricks Certified Data Engineer Associate Practice
Questions and Answers 2023 with complete solution
Which of the following describes a benefit of a data lakehouse that is unavailable in a
traditional data warehouse?
A. A data lakehouse provides a relational system of data management.
B. A data lakehouse captures snapshots of data for version control purposes.
C. A data lakehouse couples storage and compute for complete control.
D. A data lakehouse utilizes proprietary storage formats for data.
E. A data lakehouse enables both batch and streaming analytics
E. A data lakehouse enables both batch and streaming analytics
Which of the following locations hosts the driver and worker nodes of a Databricks-
managed cluster?
A. Data plane
B. Control plane
C. Databricks Filesystem
D. JDBC data source
E. Databricks web application
A. Data plane
A data architect is designing a data model that works for both video-based machine
learning workloads and highly audited batch ETL/ELT workloads. Which of the following
describes how using a data lakehouse can help the data architect meet the needs of
both workloads?
A. A data lakehouse requires very little data modeling.
B. A data lakehouse combines compute and storage for simple governance.
C. A data lakehouse provides autoscaling for compute clusters.
D. A data lakehouse stores unstructured data and is ACID-compliant.
E. A data lakehouse fully exists in the cloud.
D. A data lakehouse stores unstructured data and is ACID-compliant.
Which of the following describes a scenario in which a data engineer will want to use a
Job cluster instead of an all-purpose cluster?
A. An ad-hoc analytics report needs to be developed while minimizing compute costs.
B. A data team needs to collaborate on the development of a machine learning model.
C. An automated workflow needs to be run every 30 minutes.
D. A Databricks SQL query needs to be scheduled for upward reporting.
E. A data engineer needs to manually investigate a production error.
C. An automated workflow needs to be run every 30 minutes.
A data engineer has created a Delta table as part of a data pipeline. Downstream data
analysts now need SELECT permission on the Delta table. Assuming the data engineer
is the Delta table owner, which part of the Databricks Lakehouse Platform can the data
engineer use to grant the data analysts the appropriate access?
A. Repos
B. Jobs
C. Data Explorer

, D. Databricks Filesystem
E. Dashboards
C. Data Explorer
Two junior data engineers are authoring separate parts of a single data pipeline
notebook. They are working on separate Git branches so they can pair program on the
same notebook simultaneously. A senior data engineer experienced in Databricks
suggests there is a better alternative for this type of collaboration. Which of the following
supports the senior data engineer's claim?
A. Databricks Notebooks support automatic change-tracking and versioning
B. Databricks Notebooks support real-time coauthoring on a single notebook
C. Databricks Notebooks support commenting and notification comments
D. Databricks Notebooks support the use of multiple languages in the same notebook
E. Databricks Notebooks support the creation of interactive data visualizations
B. Databricks Notebooks support real-time coauthoring on a single notebook
Which of the following describes how Databricks Repos can help facilitate CI/CD
workflows on the Databricks Lakehouse Platform?
A. Databricks Repos can facilitate the pull request, review, and approval process before
merging branches
B. Databricks Repos can merge changes from a secondary Git branch into a main Git
branch
C. Databricks Repos can be used to design, develop, and trigger Git automation
pipelines
D. Databricks Repos can store the single-source-of-truth Git repository
E. Databricks Repos can commit or push code changes to trigger a CI/CD process
E. Databricks Repos can commit or push code changes to trigger a CI/CD process
Which of the following statements describes Delta Lake?
A. Delta Lake is an open source analytics engine used for big data workloads.
B. Delta Lake is an open format storage layer that delivers reliability, security, and
performance.
C. Delta Lake is an open source platform to help manage the complete machine
learning lifecycle.
D. Delta Lake is an open source data storage format for distributed data.
E. Delta Lake is an open format storage layer that processes data.
B. Delta Lake is an open format storage layer that delivers reliability, security, and
performance.
A data architect has determined that a table of the following format is necessary:
------------------------------------
| id | birthDate | avgRating |
------------------------------------
| a1 | 1990-01-06 | 5.5 |
------------------------------------
| a2 | 1974-11-21 | 7.1 |
------------------------------------
| .. | .. | .. |
------------------------------------
Which of the following code blocks uses SQL DDL commands to create an empty Delta

Maak kennis met de verkoper

Seller avatar
De reputatie van een verkoper is gebaseerd op het aantal documenten dat iemand tegen betaling verkocht heeft en de beoordelingen die voor die items ontvangen zijn. Er zijn drie niveau’s te onderscheiden: brons, zilver en goud. Hoe beter de reputatie, hoe meer de kwaliteit van zijn of haar werk te vertrouwen is.
magdamwikash23 Western Governers University
Volgen Je moet ingelogd zijn om studenten of vakken te kunnen volgen
Verkocht
112
Lid sinds
3 jaar
Aantal volgers
94
Documenten
5329
Laatst verkocht
1 maand geleden
Magda

NURSING STUDY GUIDES/EXAMS AND NOTES ALL VERIFIED BY EXPERTS All my uploaded documents, exams and essays are verified by relevant experts.I can assure an A or at least 90% if you use any of my documents.

3,9

14 beoordelingen

5
7
4
2
3
2
2
2
1
1

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

Student with book image

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Veelgestelde vragen