100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Exam (elaborations)

DSCI 4520 EXAM 1 QUESTIONS AND CORRECT ANSWERS

Rating
-
Sold
-
Pages
10
Grade
A+
Uploaded on
28-02-2025
Written in
2024/2025

DSCI 4520 EXAM 1 QUESTIONS AND CORRECT ANSWERS We run two k-means clustering models on the same data with k=3 and k=5. The model with k=3 is necessarily better than the other one because a smaller value of k is always better for clustering. CORRECT ANSW-false The following chart shows the within-cluster sum of square errors versus the number of clusters in a k-means clustering model. Based on the Elbow method, what value of k is optimum for clustering? CORRECT ANSW-(answer is 4) How to answer a question like this is you choose the value on the x-axis where the elbow would be on the arm. essentially where the chart "slows" down In the k-means clustering technique, the desired number of clusters (k) is a number that is determined in the middle of the algorithm by calculating the model error. CORRECT ANSW false

Show more Read less
Institution
DSCI 4520
Course
DSCI 4520









Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
DSCI 4520
Course
DSCI 4520

Document information

Uploaded on
February 28, 2025
Number of pages
10
Written in
2024/2025
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

Content preview

DSCI 4520 EXAM 1 QUESTIONS AND
CORRECT ANSWERS
We run two k-means clustering models on the same data with k=3 and k=5. The model with k=3 is
necessarily better than the other one because a smaller value of k is always better for clustering.
✅✅CORRECT ANSW-false



The following chart shows the within-cluster sum of square errors versus the number of clusters in a
k-means clustering model. Based on the Elbow method, what value of k is optimum for clustering?
✅✅CORRECT ANSW-(answer is 4)

How to answer a question like this is you choose the value on the x-axis where the elbow would be
on the arm. essentially where the chart "slows" down



In the k-means clustering technique, the desired number of clusters (k) is a number that is
determined in the middle of the algorithm by calculating the model error. ✅✅CORRECT ANSW-
false



Both numerical and categorical variables can be used in the Euclidian distance function in the k-
means clustering algorithm. ✅✅CORRECT ANSW-false



With the k-NN model for a numerical target, after we determined the k nearest neighbors of a new
data record, how the target value is predicted? ✅✅CORRECT ANSW-Average of the neighbors

Which statement is INCORRECT about the k-means clustering algorithm? ✅✅CORRECT ANSW-The
algorithm starts with initial centroids that are determined by distance function



What is the Euclidean distance between the following two records WITHOUT normalization? Round
your answer to 1 decimal.

Euclidean distance formula: ✅✅CORRECT ANSW-Age1 = 25, Age2 = 30 Credit Score1 = 550, Credit
Score2 = 540 Children1 = 3, Children2 = 2 Savings1 = 4.6, Savings2 = 7.2

Euclidean Distance = sqrt((25 - 30)^2 + (550 - 540)^2 + (3 - 2)^2 + (4.6 - 7.2)^2)

Euclidean Distance = sqrt((-5)^2 + (10)^2 + (1)^2 + (-2.6)^2)

Euclidean Distance = sqrt(25 + 100 + 1 + 6.76)

Euclidean Distance = sqrt(132.76)

, [11.5]



The k-means clustering algorithm can easily handle noisy data with outliers as well as non-convex
data patterns. ✅✅CORRECT ANSW-false



Which statement is INCORRECT about clustering? ✅✅CORRECT ANSW-Clustering is useful for
predicting association rules



Before computing the distance between two data records, we should normalize the numerical
variables to prevent variables with large scales from having an undue effect. ✅✅CORRECT ANSW-
true



Which statement is INCORRECT about choosing the number of clusters in the k-means clustering
method? ✅✅CORRECT ANSW-Maximizing the within-cluster sums of squared errors (WSS) is the
goal when selecting k



Which statement is INCORRECT about k-NN predictive models? ✅✅CORRECT ANSW-Larger values
of k increase the risk of over-fitting



k-nearest neighbor (k-NN) is a supervised method that can be used for predicting categorical or
numerical targets. ✅✅CORRECT ANSW-true



What statement is correct about the k-nearest neighbor (k-NN) method? ✅✅CORRECT ANSW-The
value of k can control model over and underfitting



In the k-nearest neighbor models, increasing the value of k leads to overfitting. ✅✅CORRECT
ANSW-false



You are requested to use a large data set of customers to predict how many days after their first
purchase they will make the second purchase. You can do this by developing a classification model.
✅✅CORRECT ANSW-false



Categorical variables can NOT be used as predictors in the linear regression model. ✅✅CORRECT
ANSW-false

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
PEAKGRADES Chamberlain College Of Nursing
View profile
Follow You need to be logged in order to follow users or courses
Sold
26
Member since
1 year
Number of followers
6
Documents
4006
Last sold
3 weeks ago
PEAK GRADES

Hello everyone...Explore a wide range of Nursing Exams, Test Banks, Study Guides, and other valuable study materials on this page. If you need any additional resources, simply reach out to us, and we’ll deliver them promptly! Please remember to leave a review after your purchase to help us improve customer satisfaction. Thank you

4.3

4 reviews

5
2
4
1
3
1
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions