Questions and Answers 2026/2027 | 100
Practice Questions with Detailed
Explanations for University Students
Description:
Prepare for your 2026/2027 data analytics and machine learning exams with 100
comprehensive practice questions covering classification, regression, time series, clustering,
and model evaluation. Each question includes detailed answers and explanations to reinforce
understanding.
Download your complete exam preparation guide today and ace your university assessments!
, Machine Learning Exam Questions & Answers 2026/2027
SECTION A: DATA TYPES AND DATA PREPARATION
Question 1
What is the primary distinction between quantitative data and categorical data?
A. Quantitative data is always discrete while categorical data is always continuous
B. Quantitative data has meaningful numerical values where higher means more, while
categorical data consists of numbers without inherent meaning or non-numeric labels
C. Quantitative data can only be measured on an interval scale, while categorical data uses ratio
scales
D. Quantitative data requires larger sample sizes than categorical data for statistical significance
Answer: B
Explanation: Quantitative data possesses numerical values with meaningful magnitude
relationships—for example, age, sales figures, temperature, and income all have interpretable
numeric scales where higher values genuinely represent more of the measured attribute.
Categorical data includes numeric codes without inherent meaning, such as zip codes, non-
numeric attributes like hair color, or binary classifications including gender, yes/no responses,
and on/off states.
Question 2
Which of the following represents time-series data?
A. The median selling price of residential properties in the United States recorded annually from
1820 to the present
B. The height measurements of every professional basketball player in the NBA at the beginning
of the current season
C. The current population of each country in the United Nations
D. The favorite ice cream flavors of students enrolled in a university statistics course
Answer: A
,Explanation: Time-series data consists of observations recorded sequentially over time,
typically at consistent intervals. The annual property price recordings represent repeated
measurements of the same variable across time, which is the defining characteristic of time-
series data. The basketball player heights represent cross-sectional data collected at a single point
in time, while population figures and flavor preferences are also cross-sectional measurements.
Question 3
Which example illustrates structured data?
A. The complete text content of a person's Twitter feed
B. The current balance in a person's bank account
C. Video footage from a security camera
D. A collection of customer service email correspondence
Answer: B
Explanation: Structured data follows a predefined format that can be easily organized, stored,
and queried using traditional database systems. A bank account balance is a numeric value that
fits neatly into structured storage alongside other account attributes like account holder name,
transaction history, and interest rate. Unstructured data, such as Twitter feeds, video footage, and
email correspondence, lacks a predefined format and requires more sophisticated processing
techniques for meaningful analysis.
Question 4
A researcher conducts a survey of 50 households, recording each household's size and primary
vehicle type. Which option constitutes a single data point?
A. The household size measurement for the 17th participant
B. The vehicle type classification for each participant
C. The complete set of measurements for the 17th participant, including both household size and
vehicle type
D. The average household size across all 50 participants
Answer: C
, Explanation: A data point represents all information collected about a single observational unit.
In this survey, each household represents one observation, and the complete record for that
household—including both household size and vehicle type—constitutes a single data point.
Recording only one attribute or summarizing across participants creates aggregates or partial
observations rather than individual data points.
Question 5
Which statement accurately describes a contextual outlier?
A. A value that falls outside the interquartile range of the entire dataset
B. A value that is not significantly different from the overall dataset but deviates substantially
from nearby temporal or spatial observations
C. An observation that is missing from an otherwise complete dataset
D. A value that exceeds three standard deviations from the mean
Answer: B
Explanation: Contextual outliers are observations that appear reasonable within the global
dataset distribution but become anomalous when considering their local context, particularly in
time-series or spatial data. For instance, a temperature reading of 25°C might be typical for a
location overall but would represent a contextual outlier if recorded during January in a region
where nearby readings cluster around -5°C. This differs from global outliers, which are extreme
relative to the entire dataset distribution.
Question 6
What visualization technique is most effective for identifying potential outliers in a dataset?
A. Scatter plot matrix
B. Box-and-whisker plot
C. Heat map
D. Parallel coordinates plot
Answer: B