Exam (elaborations)

DSCI 4520 EXAM 1 QUESTIONS AND CORRECT ANSWERS

Rating

Sold

Pages

Grade

A+

Uploaded on

28-02-2025

Written in

2024/2025

Show more Read less

Institution

DSCI 4520

Course

DSCI 4520

Whoops! We can’t load your doc right now. Try again or contact support.

Report Copyright Violation

Written for

Institution: DSCI 4520
Course: DSCI 4520

Document information

Uploaded on: February 28, 2025
Number of pages: 10
Written in: 2024/2025
Type: Exam (elaborations)
Contains: Questions & answers

Subjects

dsci 4520 exam 1 questions and correct answers

Content preview

DSCI 4520 EXAM 1 QUESTIONS AND
CORRECT ANSWERS
We run two k-means clustering models on the same data with k=3 and k=5. The model with k=3 is
necessarily better than the other one because a smaller value of k is always better for clustering.
✅✅CORRECT ANSW-false

The following chart shows the within-cluster sum of square errors versus the number of clusters in a
k-means clustering model. Based on the Elbow method, what value of k is optimum for clustering?
✅✅CORRECT ANSW-(answer is 4)

How to answer a question like this is you choose the value on the x-axis where the elbow would be
on the arm. essentially where the chart "slows" down

In the k-means clustering technique, the desired number of clusters (k) is a number that is
determined in the middle of the algorithm by calculating the model error. ✅✅CORRECT ANSW-
false

Both numerical and categorical variables can be used in the Euclidian distance function in the k-
means clustering algorithm. ✅✅CORRECT ANSW-false

With the k-NN model for a numerical target, after we determined the k nearest neighbors of a new
data record, how the target value is predicted? ✅✅CORRECT ANSW-Average of the neighbors

Which statement is INCORRECT about the k-means clustering algorithm? ✅✅CORRECT ANSW-The
algorithm starts with initial centroids that are determined by distance function

What is the Euclidean distance between the following two records WITHOUT normalization? Round
your answer to 1 decimal.

Euclidean distance formula: ✅✅CORRECT ANSW-Age1 = 25, Age2 = 30 Credit Score1 = 550, Credit
Score2 = 540 Children1 = 3, Children2 = 2 Savings1 = 4.6, Savings2 = 7.2

Euclidean Distance = sqrt((25 - 30)^2 + (550 - 540)^2 + (3 - 2)^2 + (4.6 - 7.2)^2)

Euclidean Distance = sqrt((-5)^2 + (10)^2 + (1)^2 + (-2.6)^2)

Euclidean Distance = sqrt(25 + 100 + 1 + 6.76)

Euclidean Distance = sqrt(132.76)

, [11.5]

The k-means clustering algorithm can easily handle noisy data with outliers as well as non-convex
data patterns. ✅✅CORRECT ANSW-false

Which statement is INCORRECT about clustering? ✅✅CORRECT ANSW-Clustering is useful for
predicting association rules

Before computing the distance between two data records, we should normalize the numerical
variables to prevent variables with large scales from having an undue effect. ✅✅CORRECT ANSW-
true

Which statement is INCORRECT about choosing the number of clusters in the k-means clustering
method? ✅✅CORRECT ANSW-Maximizing the within-cluster sums of squared errors (WSS) is the
goal when selecting k

Which statement is INCORRECT about k-NN predictive models? ✅✅CORRECT ANSW-Larger values
of k increase the risk of over-fitting

k-nearest neighbor (k-NN) is a supervised method that can be used for predicting categorical or
numerical targets. ✅✅CORRECT ANSW-true

What statement is correct about the k-nearest neighbor (k-NN) method? ✅✅CORRECT ANSW-The
value of k can control model over and underfitting

In the k-nearest neighbor models, increasing the value of k leads to overfitting. ✅✅CORRECT
ANSW-false

You are requested to use a large data set of customers to predict how many days after their first
purchase they will make the second purchase. You can do this by developing a classification model.
✅✅CORRECT ANSW-false

Categorical variables can NOT be used as predictors in the linear regression model. ✅✅CORRECT
ANSW-false

$7.99

Get access to the full document:

100% satisfaction guarantee

Immediately available after payment

Both online and in PDF

No strings attached

Get to know the seller

PEAKGRADES

4.3

(4)

Also available in package deal

Get to know the seller

PEAKGRADES Chamberlain College Of Nursing

View profile

Sold

Member since

1 year

Number of followers

Documents

4006

Last sold

3 weeks ago

PEAK GRADES

Hello everyone...Explore a wide range of Nursing Exams, Test Banks, Study Guides, and other valuable study materials on this page. If you need any additional resources, simply reach out to us, and we’ll deliver them promptly! Please remember to leave a review after your purchase to help us improve customer satisfaction. Thank you

4.3

4 reviews

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller PEAKGRADES. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $7.99. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 47075 documents were sold in the last 30 days Founded in 2010, the go-to place to buy study notes for 15 years now

DSCI 4520 EXAM 1 QUESTIONS AND CORRECT ANSWERS

Written for

Document information

Subjects

Content preview

Also available in package deal

Get to know the seller

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Didn't get what you expected? Choose another document

Pay as you like, start learning right away

Frequently asked questions

What do I get when I buy this document?

Satisfaction guarantee: how does it work?

Who am I buying these notes from?

Will I be stuck with a subscription?

Can Stuvia be trusted?