Exam Questions & Verified Correct
Answers | A+ Graded | Latest Edition
1. We should always use mean squared error to determine the best
value of lambda in lasso regression.
a. True
b. False
Sol: False. The criterion used is a choice we make.
Explanation:
While Mean Squared Error (MSE) is commonly used to evaluate
model performance and help choose the best λ (lambda) in lasso
regression, it is not the only option, nor is it always the best.
Other criteria or methods may also be used depending on the
context:
• Cross-validation (often k-fold CV) to minimize validation error
(e.g., using mean absolute error (MAE) or deviance depending
on the type of regression).
• Information criteria, such as AIC or BIC, particularly in
generalized linear models.
• Domain-specific performance metrics may be more appropriate
in certain applications (e.g., classification accuracy, ROC-AUC for
classification tasks).
, 2. Standard linear regression is an example of a generalized linear
model where the response is normally distributed and the link is
the identity function.
a. True
b. False
Sol: True. See Unit 4.4.1.
Explanation:
Generalized Linear Models (GLMs) extend linear regression by
allowing:
• Different distributions for the response variable (not just normal).
• Different link functions connecting predictors to the expected
response.
However, standard linear regression is actually a special case of
a GLM:
• Response distribution: Normal (Gaussian)
• Link function: Identity (i.e., the mean of the response is directly
modeled as a linear combination of predictors)
This fits the definition of a GLM.
, 3. Goodness-of-fit assessment for logistic regression involves
checking for the independence, constant variance, and normality of
the deviance residuals.
a. True
b. False
Sol: False. We don’t have constant variance in binomial
regression.
Explanation:
In logistic regression, the standard assumptions for assessing
model goodness-of-fit differ from those in linear regression.
Specifically:
• The response is binary, not continuous.
• Variance is not constant—it depends on the mean (since it's a
binomial distribution).
• Normality of residuals is not assumed.
Instead of checking for independence, constant variance, and
normality (as in linear regression), we typically assess logistic
regression fit using:
• Deviance and deviance residuals
• Hosmer–Lemeshow test
• Area under the ROC curve (AUC)
• Classification accuracy or confusion matrices