Tentamen (uitwerkingen)

DSCI 4520 EXAM 1 SECTION 2 QUESTIONS WITH COMPLETE SOLUTIONS

Beoordeling

Verkocht

Pagina's

Geüpload op

09-03-2025

Geschreven in

2024/2025

DSCI 4520 EXAM 1 SECTION 2 QUESTIONS WITH COMPLETE SOLUTIONS

Instelling

WSS

Vak

WSS

Oeps! We kunnen je document nu niet laden. Probeer het nog eens of neem contact op met support.

Meld schending auteursrecht

Geschreven voor

Instelling: WSS
Vak: WSS

Documentinformatie

Geüpload op: 9 maart 2025
Aantal pagina's: 8
Geschreven in: 2024/2025
Type: Tentamen (uitwerkingen)
Bevat: Onbekend

Onderwerpen

dsci 4520 exam 1 section 2 questions with complete

Voorbeeld van de inhoud

DSCI 4520 EXAM 1 SECTION 2
QUESTIONS WITH COMPLETE
SOLUTIONS
Which statement is INCORRECT about choosing the number of clusters in the k-means
clustering method?
A. Maximizing the within-cluster sums of squared errors (WSS) is the goal when
selecting k
B. Sometimes business considerations impose constrains on the value of k
C. Ability to do a useful profiling based on the cluster centroids helps us select a right
value of k
D. Similar analyses can be used to inform our decision about a right value of k -
Answer-Maximizing the within-cluster sums of squared errors (WSS) is the goal when
selecting k

k-nearest neighbor (k-NN) is a supervised method that can be used for predicting
categorical or numerical targets.
True
False - Answer-True

In the k-nearest neighbor models, increasing the value of k leads to overfitting.
True
False - Answer-False

With the k-NN model for a numerical target, after we determined the k nearest
neighbors of a new data record, how the target value is predicted?
A. Majority vote determines the predicted class
B. Average of the neighbors
C. Through a logistic regression between the neighbors
D. Through a linear combination of neighbors - Answer-Average of the neighbors

What statement is correct about the k-nearest neighbor (k-NN) method?
A. Underfitted k-NN models can be fixed by adding a dummy variable for accuracy
B. Logistic regression is a special case of k-NN
C. The value of k can control model over and underfitting
D. Overfitted k-NN models can be fixed by decreasing k - Answer-The value of k can
control model over and underfitting

Which statement is INCORRECT about k-NN predictive models?
A. Larger values of k increase the risk of over-fitting
B. When k=n (number of data records) the k-NN and the universal average methods are
the same
C. k-NN is sensitive to irrelevant features

, D. Finding optimum value of k can be computationally expensive - Answer-Larger
values of k increase the risk of over-fitting

When we are building a linear regression model, against what model do we compare it
to evaluate its significance?
Naïve (average) model
Logistic model
Classification model
Random model - Answer-Naïve (average) model

In a linear regression model, the t-Test for each predictor's coefficient indicates if the
estimated value is significantly different from zero.
True
False - Answer-True

In the development of a linear regression model, what is the naive (based) model that
we compare the performance of the linear model with?
Simple linear model
Average model
Multiple linear model
Random guess - Answer-Average model

In the following scatter plot matrix, Price is the target variable. What predictor shows the
strongest negative correlation with Price?
CC
HP
Age_08_04
Weight - Answer-Age_08_04

The following report shows Excel output for a linear regression model. What can the p-
value of F-statistic tell us?
A. If this p-value is less than our significance level then the coefficients are significant
B. If this p-value is larger than our significance level then the coefficients are significant
C. If this p-value is larger than our significance level then the model as a whole is
significant
D. If this p-value is less than our significance level then the model as a whole is
significant - Answer-If this p-value is less than our significance level then the model as a
whole is significant

We have developed two different linear regression models on the same data set. Which
model shows a better goodness-of-fit?
Not enough information
Models are the same
Model B
Model A - Answer-Model A

€13,59

Krijg toegang tot het volledige document:

100% tevredenheidsgarantie

Direct beschikbaar na je betaling

Lees online óf als PDF

Geen vaste maandelijkse kosten

Maak kennis met de verkoper

biggdreamer

4,0

(38)

Ook beschikbaar in voordeelbundel

Maak kennis met de verkoper

biggdreamer Havard School

Bekijk profiel

Volgen

Verkocht

248

Lid sinds

2 jaar

Aantal volgers

Documenten

17956

Laatst verkocht

1 week geleden

4,0

38 beoordelingen

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper biggdreamer. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor €13,59. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews) Afgelopen 30 dagen zijn er 42898 samenvattingen verkocht Opgericht in 2010, al 15 jaar dé plek om samenvattingen te kopen