TEST PRACTICE questions and
answers
What is batch data? - n
Ans✔
Batch data is any load of data that has a beginning and end, is not continuous.
Qs
What is Batch Data? - n
Ans✔
Batch data includes CSV, TSV, Json, XML, Parquet, Blob files, another database,
cache for offline viewing
Qs
Describe Streaming Data - n
Ans✔
Streaming data is continuous no start or stop. Examples would be IOT, logs, etc.
Other examples: sensor data, event or IOT hub, blob storage for logs, apache kafka,
netflix, youtube, course video.
Qs
Describe characteristics of relational data - n
Ans✔
Relational data is structured, has a schema, is rigid. Databases composed of tables
with rows and columns. Data integrity based on keys, datatypes, relations.
Components include:
,- tables
- views
- Primary: unique row
- Foreign Keys: child parent relationship
Schema - layout of the database including table names, column names and their
data types
Databases enforce integrity
Qs
What are the 5 types of Analytics? - n
Ans✔
Descriptive (What Happened), Diagnostic (Why it Happened), Predictive (What will
Happen), Prescriptive (What should I do), Cognitive (Machine learning predictions
based on model)
Qs
What are Descriptive Analytics? - n
Ans✔
Descriptive Analytics describe what happened. For example revenue is down 10%
year over year.
Qs
What are Diagnostic Analytics? - n
Ans✔
Describes why it happened. For example revenue is down 10% year over year due
to corona virus which resulted in less purchases industry wide.
Qs
What are Predictive Analytics? - n
,Ans✔
Describe what will happen. Next time we have a pandemic we can expect revenue
to drop 10% based on history.
Qs
What are Prescriptive Analytics? - n
Ans✔
Describe what to do to fix problem. We need to drop prices 10% to encourage
customers to make bigger purchases since less people are outside.
Qs
What are cognitive analytics? - n
Ans✔
ML recommendations based on a model. AI
Qs
What is ELT process? - n
Ans✔
(ELT) a form of data processing that stands for extract, load then transform.
Example of how this would be done
Extract data and load it to data lake then perform transformations with databricks
and move to synapse data warehouse
Data is available before transformations
Qs
ETL - n
Ans✔
, Extract, transform, load. Perform extract and transform logic before loading data to
be available.
Data is not available before transformations are performed
Qs
What are Microsoft Azure's 4 Relational Databases? - n
Ans✔
SQL Server in VM
SQL Managed Instance
Azure SQL Database
Azure SQL Database for MySQL, PostgreSQL or Maria DB
Qs
Why use SQL Server in a VM? - n
Ans✔
Guaranteed to be compatible to on premise sql server.
No data limitations (run above 4 TB)
Pay for server and licensing not per DB (could be pos/neg)
Qs
What are potential disadvantages to SQL Server in a VM? - n
Ans✔
You have to do all updates, pick install your sql server version. You manage
everything.
Qs
What is SQL Managed Instance - n