Analysis – Complete Solved Assignment with Step-by-Step Answers and
Exam-Focused Study Guide (Updated 2025 Edition)
Question 1:
Which of the following methods is commonly used for data cleaning?
A) Data mining
B) Data profiling
C) Data visualization
D) Data encoding
CORRECT ANSWER: B) Data profiling
Rationale:
Data profiling involves examining and analyzing the data to understand its structure,
content, and quality, which is essential for effective data cleaning.
Question 2:
What does ETL stand for in data management?
A) Extract, Transform, Load
B) Evaluate, Transfer, Launch
C) Extract, Transfer, Load
D) Evaluate, Transform, Load
CORRECT ANSWER: A) Extract, Transform, Load
Rationale:
ETL is a process used to extract data from various sources, transform it into a suitable
format, and load it into a database or data warehouse.
Question 3:
Which automation tool is widely used for web scraping?
A) Selenium
B) Tableau
C) Power BI
D) SQL Server
CORRECT ANSWER: A) Selenium
Rationale:
Selenium is a popular automation tool used for web testing and scraping by allowing
user interactions with web pages programmatically.
Question 4:
,In data analysis, what does 'anomaly detection' mean?
A) Finding missing values in datasets
B) Identifying rare items, events, or observations that raise suspicions
C) Grouping similar data points together
D) Summarizing the main characteristics of data
CORRECT ANSWER: B) Identifying rare items, events, or observations that raise
suspicions
Rationale:
Anomaly detection is crucial in data analysis to identify unusual patterns that do not
conform to expected behavior, which can indicate fraud, errors, or significant insights.
Question 5:
Which of the following programming languages is primarily used for data analysis?
A) HTML
B) Java
C) Python
D) C++
CORRECT ANSWER: C) Python
Rationale:
Python is widely used in data analysis due to its simplicity, extensive libraries such as
Pandas and NumPy, and strong community support.
Question 1:
Which of the following methods is commonly used for data cleaning?
A) Data mining
B) Data profiling
C) Data visualization
D) Data encoding
CORRECT ANSWER: B) Data profiling
Rationale:
Data profiling involves examining data for quality and structure, enabling effective
cleaning.
Question 2:
What does ETL stand for in data management?
A) Extract, Transform, Load
B) Evaluate, Transfer, Launch
, C) Extract, Transfer, Load
D) Evaluate, Transform, Load
CORRECT ANSWER: A) Extract, Transform, Load
Rationale:
ETL is a standard process for moving data from one system to another after
transforming it into a usable format.
Question 3:
Which automation tool is widely used for web scraping?
A) Selenium
B) Tableau
C) Power BI
D) SQL Server
CORRECT ANSWER: A) Selenium
Rationale:
Selenium automates web browsers, making it an excellent choice for scraping web data
directly from websites.
Question 4:
In data analysis, what does 'anomaly detection' mean?
A) Finding missing values in datasets
B) Identifying rare items, events, or observations
C) Grouping similar data points
D) Summarizing data characteristics
CORRECT ANSWER: B) Identifying rare items, events, or observations
Rationale:
Anomaly detection seeks to find data points that differ significantly from the norm,
revealing potential fraud or experimental errors.
Question 5:
Which of the following programming languages is primarily used for data analysis?
A) HTML
B) Java
C) Python
D) C++
CORRECT ANSWER: C) Python