ADDA REVIEW TEST QUESTIONS AND
100% CORRECT ANSWERS!!
What is cluster analysis?
• A simple approach to forming groups of variables or cases
• Individuals or variables that are "similar" to one another are grouped into the same cluster.
• Essentially an exploratory technique
What are the three types of cluster analysis?
1. Hierarchical cluster analysis
2. k‐means cluster analysis
3. Two step cluster analysis (SPSS)
In which type of cluster analysis does the researcher not determine how many clusters will
be produced in the final model?
The researcher not determine how many clusters will be produced in the two step cluster
analysis, which has inferential techniques to assist with decisions on the number of clusters.
What measure of similarity between the variables do we use in hierarchical cluster
analysis?
Distance scores
- Don't use correlations because they assess similar variation, not similar scores.
What are the distance measures used in hierarchical cluster analysis?
1. Euclidean Distance:
2. Block
3. Minkowski‐r
4. Squared Euclidean Distance
5. Power
What is the calculation for working out distance in hierarchical clustering using the block
metric?
, takes the difference of two scores
What is the calculation for working out distance in hierarchical clustering using Squared
Euclidean Distance?
Takes the difference between each score and then squares them to get rid of negative
What is the calculation for working out distance in hierarchical clustering using Euclidean
Distance?
takes the square root of the squared difference between each score
In a proximity matrix (comprised of distance scores), what do the rows and columns
represent, and what do the cells represent?
In a proximity matrix:
A. Numbers further away from zero indicate variables are closer together and therefor
more likely to be part of the same cluster.
B. Numbers further away from zero indicate variables are further apart and therefor more
likely to be part of the same cluster.
C. Numbers closer to zero indicate variables are further apart, and therefor more likely to
be part of the same cluster.
D. Numbers closer to zero indicate variables are closer together, and therefor more likely
to be part of the same cluster.
D. Numbers closer to zero indicate variables are closer together, and therefor more likely to be
part of the same cluster.
In hierarchical clustering analysis, what is the method for combining clusters whereby the
distance between cluster A and B is defined as the smallest distance between any element
(variable) of A and any element (variable) B?
Nearest neighbor rule (also called single link)
What are the possible rules for distance between clusters used in hierarchical clustering
analysis?
100% CORRECT ANSWERS!!
What is cluster analysis?
• A simple approach to forming groups of variables or cases
• Individuals or variables that are "similar" to one another are grouped into the same cluster.
• Essentially an exploratory technique
What are the three types of cluster analysis?
1. Hierarchical cluster analysis
2. k‐means cluster analysis
3. Two step cluster analysis (SPSS)
In which type of cluster analysis does the researcher not determine how many clusters will
be produced in the final model?
The researcher not determine how many clusters will be produced in the two step cluster
analysis, which has inferential techniques to assist with decisions on the number of clusters.
What measure of similarity between the variables do we use in hierarchical cluster
analysis?
Distance scores
- Don't use correlations because they assess similar variation, not similar scores.
What are the distance measures used in hierarchical cluster analysis?
1. Euclidean Distance:
2. Block
3. Minkowski‐r
4. Squared Euclidean Distance
5. Power
What is the calculation for working out distance in hierarchical clustering using the block
metric?
, takes the difference of two scores
What is the calculation for working out distance in hierarchical clustering using Squared
Euclidean Distance?
Takes the difference between each score and then squares them to get rid of negative
What is the calculation for working out distance in hierarchical clustering using Euclidean
Distance?
takes the square root of the squared difference between each score
In a proximity matrix (comprised of distance scores), what do the rows and columns
represent, and what do the cells represent?
In a proximity matrix:
A. Numbers further away from zero indicate variables are closer together and therefor
more likely to be part of the same cluster.
B. Numbers further away from zero indicate variables are further apart and therefor more
likely to be part of the same cluster.
C. Numbers closer to zero indicate variables are further apart, and therefor more likely to
be part of the same cluster.
D. Numbers closer to zero indicate variables are closer together, and therefor more likely
to be part of the same cluster.
D. Numbers closer to zero indicate variables are closer together, and therefor more likely to be
part of the same cluster.
In hierarchical clustering analysis, what is the method for combining clusters whereby the
distance between cluster A and B is defined as the smallest distance between any element
(variable) of A and any element (variable) B?
Nearest neighbor rule (also called single link)
What are the possible rules for distance between clusters used in hierarchical clustering
analysis?