ISYE 6501 FINAL EXAM | QUESTIONS AND ANSWERS | VERIFIED
ANSWERS GRADED A+ | LATEST EXAM
what is k-fold cross validation? - CORRECT ANSWERS -S-split the
training/validation data into k- parts; we train on k-1 parts and validate on the
remaining part.
What metric do you use for k-fold cross validation when comparing models? -
CORRECT ANSWERS -S-The average of all k evaluations.
What do we use when important data only appears in the validation or test sets?
-
ANSWERS-cross-validation
What do we do after we've performed crossvalidation? - CORRECT
ANSWERS -S-We train the model again using all the data.
what are the benefits of k-fold cross validation? - CORRECT ANSWERS -S-
better use of data, better estimate of model quality, and chooses model more
effectively
What can clustering be used for? - CORRECT ANSWERS -Sgrouping data
points (e.g., market segmentation) and discovering groups in data points (e.g.,
personalized medicine
,Which should we use most of the data for: training, validation, or test? -
CORRECT ANSWERS -S- training
In k-fold cross-validation, how many times is each part of the data used for
training, and for validation? - CORRECT ANSWERS -S-k-1 times for
training, and 1 time for validation
what is rectangular distance useful for? - CORRECT ANSWERS -S-
calculating driving distance when the city is mapped in a grid
what is the value of p for euclidean distance -
ANSWERS-2
what is the general equation for p-norm distance - CORRECT ANSWERS -S-
What do descriptive questions ask? - CORRECT ANSWERS -SWhat
happened? (e.g., which customers are most alike)
4|Page
What do predictive questions ask? - CORRECT ANSWERS -SWhat will
happen? (e.g., what will Google's stock price be?)
What do prescriptive questions ask? - CORRECT ANSWERS -S- What
action(s) would be best? (e.g., where to put traffic lights)
What is a model? - CORRECT ANSWERS -S-Real-life situation expressed as
math.
,What do classifiers help you do? - CORRECT ANSWERS -Sdifferentiate
What is a soft classifier and when is it used? -
ANSWERS-In some cases, there won't
be a line that separates all of the labeled examples. So we use a classifier that
minimizes the number of mistakes.
What does it mean when the classifier/decision boundary is almost parallel to
the vertical x-axis? -
5|Page
ANSWERS-The horizontal attribute is all that is needed.
What does it mean when the classifier/decision boundary is almost parallel to
the horizontal y- axis? - CORRECT ANSWERS -S-The vertical attribute is all
that is needed.
What is time-series data? - CORRECT ANSWERS -S-The same data recorded
over time often recorded at equal intervals
What is quantitative data? - CORRECT ANSWERS -S-Number with a
meaning: higher means more,
lower means less (e.g., age, sales, temperature, income)
What is categorical data? - CORRECT ANSWERS -S-Numbers w/o meaning
(e.g., zip codes), non- numeric (e.g., hair color), binary data (e.g., male/female,
yes/no, on/off)
6|Page
, Which of these is time series data?
A. The average cost of a house in the United States every year since 1820
B. The height of each professional basketball player in the NBA at the start
of the season - CORRECT ANSWERS -S-A
Which of these is structured data?
A. The contents of a person's Twitter feed B. The amount of money in a
person's bank account - CORRECT ANSWERS -S-B
What is structured data? - CORRECT ANSWERS -S-Data that can be stores in
a structured way
What is unstructured data? - CORRECT ANSWERS -S-Data that is not easily
described and stored
(e.g., written text)
A survey of 25 people recorded each person's family size and type of car. Which
of these is a data point?
7|Page
A. The 14th person's family size and car type B. The 14th person's family size
C.The car type of each person - CORRECT ANSWERS -S-A. A data point is
all the information about one observation
The farther the wrongly classified point is from the line - CORRECT
ANSWERS -S-The bigger the mistake we've made
The term including the margin gets larger so the importance of a large margin
out weights avoiding mistakes and classifying known data samples. -
CORRECT ANSWERS -S-As lambda gets larger
ANSWERS GRADED A+ | LATEST EXAM
what is k-fold cross validation? - CORRECT ANSWERS -S-split the
training/validation data into k- parts; we train on k-1 parts and validate on the
remaining part.
What metric do you use for k-fold cross validation when comparing models? -
CORRECT ANSWERS -S-The average of all k evaluations.
What do we use when important data only appears in the validation or test sets?
-
ANSWERS-cross-validation
What do we do after we've performed crossvalidation? - CORRECT
ANSWERS -S-We train the model again using all the data.
what are the benefits of k-fold cross validation? - CORRECT ANSWERS -S-
better use of data, better estimate of model quality, and chooses model more
effectively
What can clustering be used for? - CORRECT ANSWERS -Sgrouping data
points (e.g., market segmentation) and discovering groups in data points (e.g.,
personalized medicine
,Which should we use most of the data for: training, validation, or test? -
CORRECT ANSWERS -S- training
In k-fold cross-validation, how many times is each part of the data used for
training, and for validation? - CORRECT ANSWERS -S-k-1 times for
training, and 1 time for validation
what is rectangular distance useful for? - CORRECT ANSWERS -S-
calculating driving distance when the city is mapped in a grid
what is the value of p for euclidean distance -
ANSWERS-2
what is the general equation for p-norm distance - CORRECT ANSWERS -S-
What do descriptive questions ask? - CORRECT ANSWERS -SWhat
happened? (e.g., which customers are most alike)
4|Page
What do predictive questions ask? - CORRECT ANSWERS -SWhat will
happen? (e.g., what will Google's stock price be?)
What do prescriptive questions ask? - CORRECT ANSWERS -S- What
action(s) would be best? (e.g., where to put traffic lights)
What is a model? - CORRECT ANSWERS -S-Real-life situation expressed as
math.
,What do classifiers help you do? - CORRECT ANSWERS -Sdifferentiate
What is a soft classifier and when is it used? -
ANSWERS-In some cases, there won't
be a line that separates all of the labeled examples. So we use a classifier that
minimizes the number of mistakes.
What does it mean when the classifier/decision boundary is almost parallel to
the vertical x-axis? -
5|Page
ANSWERS-The horizontal attribute is all that is needed.
What does it mean when the classifier/decision boundary is almost parallel to
the horizontal y- axis? - CORRECT ANSWERS -S-The vertical attribute is all
that is needed.
What is time-series data? - CORRECT ANSWERS -S-The same data recorded
over time often recorded at equal intervals
What is quantitative data? - CORRECT ANSWERS -S-Number with a
meaning: higher means more,
lower means less (e.g., age, sales, temperature, income)
What is categorical data? - CORRECT ANSWERS -S-Numbers w/o meaning
(e.g., zip codes), non- numeric (e.g., hair color), binary data (e.g., male/female,
yes/no, on/off)
6|Page
, Which of these is time series data?
A. The average cost of a house in the United States every year since 1820
B. The height of each professional basketball player in the NBA at the start
of the season - CORRECT ANSWERS -S-A
Which of these is structured data?
A. The contents of a person's Twitter feed B. The amount of money in a
person's bank account - CORRECT ANSWERS -S-B
What is structured data? - CORRECT ANSWERS -S-Data that can be stores in
a structured way
What is unstructured data? - CORRECT ANSWERS -S-Data that is not easily
described and stored
(e.g., written text)
A survey of 25 people recorded each person's family size and type of car. Which
of these is a data point?
7|Page
A. The 14th person's family size and car type B. The 14th person's family size
C.The car type of each person - CORRECT ANSWERS -S-A. A data point is
all the information about one observation
The farther the wrongly classified point is from the line - CORRECT
ANSWERS -S-The bigger the mistake we've made
The term including the margin gets larger so the importance of a large margin
out weights avoiding mistakes and classifying known data samples. -
CORRECT ANSWERS -S-As lambda gets larger