Data Science Questions and Verified Answers
Missing Data Correct Answer: Observations where one or more values are not
recorded.
Imputation Correct Answer: Replacing missing values with estimated values such as
the mean or median.
Outliers Correct Answer: Data points that significantly differ from the rest of the
dataset.
Normalization Correct Answer: Scaling data to a fixed range, usually [0,1] [0,1].
Standardization Correct Answer: Scaling data to have a mean of 0 and standard
deviation of 1.
Categorical Encoding Correct Answer: Converting non-numeric variables into
numeric form.
One-Hot Encoding Correct Answer: Representing categories as binary indicator
variables.
Feature Engineering Correct Answer: Creating new input variables from existing
data.
Data Transformation Correct Answer: Applying mathematical operations to change
data distribution.
Train-Test Split Correct Answer: Dividing data into training and testing sets.
Descriptive Statistics Correct Answer: Numerical summaries describing a dataset.
Mean Correct Answer: The arithmetic average of values.
Median Correct Answer: The middle value in an ordered dataset.
Missing Data Correct Answer: Observations where one or more values are not
recorded.
Imputation Correct Answer: Replacing missing values with estimated values such as
the mean or median.
Outliers Correct Answer: Data points that significantly differ from the rest of the
dataset.
Normalization Correct Answer: Scaling data to a fixed range, usually [0,1] [0,1].
Standardization Correct Answer: Scaling data to have a mean of 0 and standard
deviation of 1.
Categorical Encoding Correct Answer: Converting non-numeric variables into
numeric form.
One-Hot Encoding Correct Answer: Representing categories as binary indicator
variables.
Feature Engineering Correct Answer: Creating new input variables from existing
data.
Data Transformation Correct Answer: Applying mathematical operations to change
data distribution.
Train-Test Split Correct Answer: Dividing data into training and testing sets.
Descriptive Statistics Correct Answer: Numerical summaries describing a dataset.
Mean Correct Answer: The arithmetic average of values.
Median Correct Answer: The middle value in an ordered dataset.