Data Analysis – Solved Assignment with Step-by-Step Answers,
Key Concepts & Exam-Focused Study Guide (Updated 2025
Edition)
Question 1:
Which of the following is a common programming language used for data analysis?
• A) HTML
• B) Python
• C) CSS
• D) SQL
CORRECT ANSWER: B
Rationale: Python is widely used for data analysis due to its powerful libraries like
Pandas, NumPy, and Matplotlib.
Question 2:
What does ETL stand for in data processing?
• A) Extract, Transform, Load
• B) Evaluate, Test, Load
• C) Extract, Transfer, Log
• D) Evaluate, Transform, Load
CORRECT ANSWER: A
Rationale: ETL is a process in data warehousing that involves extracting data from
different sources, transforming it into a suitable format, and loading it into a target
database.
Question 3:
In automation, what does RPA stand for?
• A) Robotic Process Automation
• B) Reliable Process Automation
• C) Reactive Process Automation
• D) Random Process Automation
CORRECT ANSWER: A
Rationale: RPA stands for Robotic Process Automation, which refers to the use of
,software robots to automate highly repetitive and mundane tasks traditionally
performed by humans.
Question 4:
Which of the following is NOT typically considered a benefit of data visualization?
• A) Improved insights
• B) Increased complexity
• C) Enhanced communication
• D) Faster decision-making
CORRECT ANSWER: B
Rationale: Increased complexity is not a benefit of data visualization; rather, the goal is
to simplify and clarify data for better understanding.
Question 5:
What is the primary purpose of a data warehouse?
• A) To process transactions in real time
• B) To store data for long-term analysis
• C) To facilitate data entry
• D) To design user interfaces
CORRECT ANSWER: B
Rationale: A data warehouse is primarily used for storing data for long-term analysis
and reporting, allowing for complex queries and analysis.
Question 6:
Which of the following tools is widely used for data cleaning?
• A) Tableau
• B) Knime
• C) Google Docs
• D) PowerPoint
CORRECT ANSWER: B
Rationale: Knime is a popular open-source data analytics tool that includes features for
data cleaning.
, Question 7:
In statistical analysis, what does the term "p-value" represent?
• A) Probability value indicating statistical significance
• B) Positive value indicating the mean
• C) Population value
• D) Predictive value
CORRECT ANSWER: A
Rationale: The p-value helps determine the statistical significance of results, indicating
the probability that the observed data occurred by chance.
Question 8:
Which type of chart is best used for showing the distribution of a dataset?
• A) Bar chart
• B) Pie chart
• C) Histogram
• D) Line chart
CORRECT ANSWER: C
Rationale: A histogram is specifically designed to show the distribution of a dataset by
grouping data into bins.
Question 9:
What does machine learning primarily involve?
• A) Manual programming for every outcome
• B) Algorithms allowing computers to learn from data
• C) Data entry by humans
• D) Predicting future trends solely through intuition
CORRECT ANSWER: B
Rationale: Machine learning involves algorithms that allow computers to learn from
and make predictions or decisions based on data.
Question 10:
Which of the following is NOT a type of machine learning?
• A) Supervised learning