QUESTIONS WITH SOLUTIONS GRADED A+
● Identify a true statement about an unsupervised model. Answer: has
no target variable
● Which of the following is true of AutoML? Answer: The AutoML
platform is typically capable of analytical discovery of relationships
actually present in the dataset.
● Identify a true statement about AutoML. Answer: AutoML facilitates
accurate decision making for users with limited coding and modeling
experience.
● Which of the four key steps in the AutoML process involves handling
missing data, outliers, variable selection, data standardization, and data
transformation to maintain a common format? Answer: preparing data
● The common adage that people use when referring to ________ data is
"garbage in, garbage out." Answer: invalid and unreliable
● The step of creating ensemble models in the AutoML process allows
us to Answer: reduce the generalization error of the prediction.
,● The boosting process in the creating ensemble models step in the
AutoML process serves the purpose of Answer: reducing error in the
model
● Identify the correct sequence of the four key steps in the AutoML
process. Answer: preparing data, building models, creating ensemble
models, recommending models
● The purpose of building models in the AutoML process is to Answer:
extract insights from data
● In cluster analysis, a market is segmented using ________. Answer:
shared traits
● Which of the following statements is true of the clustering process?
Answer: It enables marketers to identify hidden structures in data.
● In which of the following functions is the distance measured
equivalent to the true straight line distance between two points?
Answer: the euclidean distance
● In hierarchical clustering, approaches such as ________ are most often
used when numerical variables are analyzed. Answer: the Euclidean
distance or the Manhattan distance
, ● In the Manhattan distance approach of measuring similarity between
observations Answer: the distance between two points is a path with
right turns as if one is walking a grid in a city.
● The first step in the k-means clustering algorithm is ________.
Answer: determining the initial k clusters
● The Matching coefficient approach of measuring similarity between
observations Answer: measures the similarity between two observations
with values that represent the minimum differences between two points.
● In the k-means clustering analysis, the silhouette score is calculated
________. Answer: after the cluster algorithm has assigned each
observation to a cluster
● In the k-means clustering algorithm, what happens after observations
are randomly assigned to a cluster? Answer: Cluster centroids are
determined.
● In which of the following methods of linking individual observations
both within and between clusters is similarity defined as the shortest
distance from an object in a cluster to an object from another cluster?
Answer: single linkage