Data Analytics - Answers The process of analyzing and interpreting data that leads to actionable insights
Data Analysis - Answers A subset of data analytics that focuses on cleaning and analysing a prepared
dataset
Descriptive Analysis - Answers The set of analytical techniques to determine what has happened in the
past. Data Visualization is often used as a technique
Diagnostic Analysis - Answers The set of analytical techniques to determine why something has
happened. Correlation, Data Discovery and Drill-Down are often used as techniques
Predictive Analysis - Answers The set of analytical techniques to determine what might happen in the
future. Machine Learning and Forecasting are often used as a technique.
Prescriptive Analysis - Answers The set of analytical techniques to determine the best course of action to
achieve an outcome. Machine Learning and Simulations are often used as techniques
Machine Learning Model - Answers The output of a Machine Learning Algorithm run on data. It
represents what was learnt from the Machine Learning Algorithm and also contains logic to make
predictions based on new data
Machine Learning Algorithm - Answers A procedure that is run on a large amount of data to create a
model. The Machine Learning Algorithm builds a model, by minimizing the error
Artificial Intelligence - Answers A branch of Compute Science concerned with creating machines that can
perform tasks requiring human intelligence
Machine Learning - Answers A subset of Artificial Intelligence that makes predictions based on data
Advantages of Machine Learning for Analysis - Answers With Machine Learning huge volumes can be
analyzed faster and in an automated fashion.
Generative AI - Answers A subset of Artificial Intelligence that focuses on creating original content
(images, text, music, etc.) based on user prompts. Usually utilizing Deep Learning (which is a subset of
Machine Learning)
Amazon CodeWhisperer Advantages - Answers A service that analyses code and comments to complete
existing code and generate suggestions for entirely new blocks of code. It can help with speed, security
and quality of software
Three Levels of Machine Learning Services in AWS - Answers 1. AI application services (ready made AI)
2. Platform ML services (convenient ML tools)
3. ML frameworks and infrastructure (foundation to create own tools and workflows)
, Big Data - Answers Data that suffers from the 5V's (volume, variety, velocity, veracity, value)
Big Data Challenges (5V's) - Answers 1. volume (amount of data)
2. variety (data comes from different types of sources)
3. velocity (speed of data entering)
4. veracity (trustworthiness of data)
5. value (ability to extract insights from data)
Challenge of large data volumes (5V's) - Answers It's hard to capture, store, transform and analyze it
Three Types of Data (sorted by how easy it is to analyse) - Answers 1. structured -> any tabular data with
a strict schema
2. semi-structured -> Json, XML, etc.
3. unstructured -> Images, Text, etc.
Cloud Storage for Scalability (sorted from best to worst) - Answers 1. Object Storage
2. Document Storage
3. File Storage
Cloud Storage for File Management (sorted from best to worst) - Answers 1. File Storage
2. Object Storage
3. Document Storage
Cloud Storage for Metadata Management (sorted from best to worst) - Answers 1. Object Storage
2. File Storage
2. Document Storage
Cloud Storage for Performance (sorted from best to worst) - Answers 1. Document Storage
2. File Storage
3. Object Storage
Storage Solutions for Unstructured Data - Answers 1. Data Lakes
2. Document Database
Storage Solution for Structured Data - Answers Relational Database Management System (RDBMS)