Data Mining Final Exam Review
Questions with Complete Answers
Midterm two - Answer-6-9
how many data sets do you need in order to implement linear regression
a. one
b. two
c. three
d. four - Answer-two
in the linear regression method one needs to designate an attribute to be predicted
as 'label'. which of the following rapid miner operators can be used in order to do
that?
a. set label
b. set role
c. apply model
d. apply label - Answer-set role
in the linear regression method, the ranges for all attributes in the scoring data must
be within the ranges for the corresponding attributes in the training data. which of the
following rapid miner operators can be used in order to match the ranges?
a. filter examples
b. filter range
c. set examples
d. set ranges - Answer-filter examples
Any value that is smaller (larger) than ____ standard deviations below (above) the
mean is considered inconsistent
a. one
b. two
c. three
d. four - Answer-two
what is the mathematical formula for the multiple linear regression? - Answer-Y =
m1x1 + m2x2+ .... + mnxn + b
What does the letter 'k' in kmeans clustering stand for
a. number of groups
b. number of attributes
c. number of correlations
d. number of observations - Answer-number of groups
, The kmeans clustering is a ---- model
a. Prediction
b. Regression
c. differentiation
d. classification - Answer-classification
What is the mathematical formula for the simple linear regression
a. y = x + b
b. x = y + b
c. a = mx + b
d. y = mx + b - Answer-y = mx + b
In the discriminant analysis method, what can be the data type of the attribute that is
to be predicted
a. numeric
b. binomial
c. binominal
d. all the above - Answer-all the above
in the logistic regression method, which of the following rapid miner operators is
used to connect training data set stream with scoring data set stream?
a. apply model
b. connect model
c. connect stream
d. logistic regression - Answer-apply model
in the discriminant analysis method, which values are calculated by using the training
data set
a. predicted values
b. confidence intervals
c. probabilities
d. Coefficients - Answer-probabilities
in the logistic regression method, the ranges for all attributes in the scoring data
must be within the ranges for the corresponding attributes in the training data. Which
of the following rapid miner operators can be used in order to match the ranges?
a. Set ranges
b. filter ranges
c. Filter examples
d. set examples - Answer-filter examples
what is the name of the constant in the mathematical formula for the simple linear
regression?
Questions with Complete Answers
Midterm two - Answer-6-9
how many data sets do you need in order to implement linear regression
a. one
b. two
c. three
d. four - Answer-two
in the linear regression method one needs to designate an attribute to be predicted
as 'label'. which of the following rapid miner operators can be used in order to do
that?
a. set label
b. set role
c. apply model
d. apply label - Answer-set role
in the linear regression method, the ranges for all attributes in the scoring data must
be within the ranges for the corresponding attributes in the training data. which of the
following rapid miner operators can be used in order to match the ranges?
a. filter examples
b. filter range
c. set examples
d. set ranges - Answer-filter examples
Any value that is smaller (larger) than ____ standard deviations below (above) the
mean is considered inconsistent
a. one
b. two
c. three
d. four - Answer-two
what is the mathematical formula for the multiple linear regression? - Answer-Y =
m1x1 + m2x2+ .... + mnxn + b
What does the letter 'k' in kmeans clustering stand for
a. number of groups
b. number of attributes
c. number of correlations
d. number of observations - Answer-number of groups
, The kmeans clustering is a ---- model
a. Prediction
b. Regression
c. differentiation
d. classification - Answer-classification
What is the mathematical formula for the simple linear regression
a. y = x + b
b. x = y + b
c. a = mx + b
d. y = mx + b - Answer-y = mx + b
In the discriminant analysis method, what can be the data type of the attribute that is
to be predicted
a. numeric
b. binomial
c. binominal
d. all the above - Answer-all the above
in the logistic regression method, which of the following rapid miner operators is
used to connect training data set stream with scoring data set stream?
a. apply model
b. connect model
c. connect stream
d. logistic regression - Answer-apply model
in the discriminant analysis method, which values are calculated by using the training
data set
a. predicted values
b. confidence intervals
c. probabilities
d. Coefficients - Answer-probabilities
in the logistic regression method, the ranges for all attributes in the scoring data
must be within the ranges for the corresponding attributes in the training data. Which
of the following rapid miner operators can be used in order to match the ranges?
a. Set ranges
b. filter ranges
c. Filter examples
d. set examples - Answer-filter examples
what is the name of the constant in the mathematical formula for the simple linear
regression?