Databricks Lakehouse Fundamentals
2025
Delta Lake - answer open-source storage layer designed to run on top of an existing
data lake and improve its reliability, security, and performance.
Delta Lakes support (6) - answer1. ACID transactions
2. scalable metadata
3. unified streaming
4. batch data processing
5. Schema enforcement
6. semi/unstructured data allowed
Data Lake - answer system or repository of data stored in its natural/raw format, usually
object blobs or files.
Data Warehousing - answer the management of data storage and retrieval
Data Lakehouse - answer A data management system that combines the best of both
data warehousing and data lakes
downsides to data lakes (4) - answer No transactional support
Poor data reliability
Slow analysis performance
Data governance concerns
Downsides to data warehousing (3) - answer No support for semi or unstructured data
Struggle with 3Vs
Long processing times
delta lake uses - and - to process and query data at scale - answeradvanced caching
and indexing
data warehouse has what kind of data - answerstructured and clean
data warehouse has what kind of schemas - answerpredefined
data lakes have what kind of data - answerStructured, Semi, and unstructured data
data lakes support - answerstreaming
Benefits of data lakehouse - answer1. one security and governance
2025
Delta Lake - answer open-source storage layer designed to run on top of an existing
data lake and improve its reliability, security, and performance.
Delta Lakes support (6) - answer1. ACID transactions
2. scalable metadata
3. unified streaming
4. batch data processing
5. Schema enforcement
6. semi/unstructured data allowed
Data Lake - answer system or repository of data stored in its natural/raw format, usually
object blobs or files.
Data Warehousing - answer the management of data storage and retrieval
Data Lakehouse - answer A data management system that combines the best of both
data warehousing and data lakes
downsides to data lakes (4) - answer No transactional support
Poor data reliability
Slow analysis performance
Data governance concerns
Downsides to data warehousing (3) - answer No support for semi or unstructured data
Struggle with 3Vs
Long processing times
delta lake uses - and - to process and query data at scale - answeradvanced caching
and indexing
data warehouse has what kind of data - answerstructured and clean
data warehouse has what kind of schemas - answerpredefined
data lakes have what kind of data - answerStructured, Semi, and unstructured data
data lakes support - answerstreaming
Benefits of data lakehouse - answer1. one security and governance