(AIN3701) – Fully Solved Assignment with Step-by-Step Answers,
Marking-Guideline Alignment & Exam-Focused Study Guide (Updated 2026
Edition)
,Question 1:
Which of the following is a primary benefit of automation in data analysis?
• A) Increased human error
• B) Enhanced speed and efficiency
• C) Higher costs
• D) Complexity in data management
Correct Option: B) Enhanced speed and efficiency
Rationale: Automation significantly reduces the time required for data processing and
analysis, leading to faster insights and decision-making.
Question 2:
What is the purpose of a data pipeline in automation?
• A) To store data securely
• B) To ensure data is analyzed only once
• C) To automate the movement and processing of data
• D) To visualize data
Correct Option: C) To automate the movement and processing of data
Rationale: A data pipeline automates the flow of data from ingestion through
processing to storage, allowing for efficient data analysis.
Question 3:
Which tool is commonly used for data visualization in Python?
• A) SQL
• B) TensorFlow
• C) Matplotlib
• D) Git
Correct Option: C) Matplotlib
Rationale: Matplotlib is a popular Python library used for creating static, animated, and
interactive visualizations in Python.
, Question 4:
What does ETL stand for in data processing?
• A) Extract, Transform, Load
• B) Edit, Test, Launch
• C) Evaluate, Transform, Load
• D) Extract, Test, Load
Correct Option: A) Extract, Transform, Load
Rationale: ETL is a process used to extract data from various sources, transform it into
a suitable format, and load it into a destination database.
Question 5:
In the context of machine learning, what does "overfitting" refer to?
• A) A model that performs well on training data but poorly on unseen data
• B) A model that performs well on unseen data
• C) A model that is too simple
• D) A model that requires less data
Correct Option: A) A model that performs well on training data but poorly on unseen
data
Rationale: Overfitting occurs when a model learns the training data too well, including
its noise and outliers, which negatively impacts its performance on new data.
Question 6:
Which of the following is a method for handling missing data in a dataset?
• A) Ignoring the missing values
• B) Filling with the mean
• C) Removing all rows with missing values
• D) All of the above
Correct Option: D) All of the above
Rationale: Each of these methods can be used to handle missing data, depending on
the context and needs of the analysis.
Question 7: