100% de satisfacción garantizada Inmediatamente disponible después del pago Tanto en línea como en PDF No estas atado a nada 4.2 TrustPilot
logo-home
Examen

DSCI 4520 EXAM 1 SECTION 2 QUESTIONS WITH COMPLETE SOLUTIONS

Puntuación
-
Vendido
-
Páginas
8
Subido en
09-03-2025
Escrito en
2024/2025

DSCI 4520 EXAM 1 SECTION 2 QUESTIONS WITH COMPLETE SOLUTIONS

Institución
WSS
Grado
WSS









Ups! No podemos cargar tu documento ahora. Inténtalo de nuevo o contacta con soporte.

Escuela, estudio y materia

Institución
WSS
Grado
WSS

Información del documento

Subido en
9 de marzo de 2025
Número de páginas
8
Escrito en
2024/2025
Tipo
Examen
Contiene
Desconocido

Temas

Vista previa del contenido

DSCI 4520 EXAM 1 SECTION 2
QUESTIONS WITH COMPLETE
SOLUTIONS
Which statement is INCORRECT about choosing the number of clusters in the k-means
clustering method?
A. Maximizing the within-cluster sums of squared errors (WSS) is the goal when
selecting k
B. Sometimes business considerations impose constrains on the value of k
C. Ability to do a useful profiling based on the cluster centroids helps us select a right
value of k
D. Similar analyses can be used to inform our decision about a right value of k -
Answer-Maximizing the within-cluster sums of squared errors (WSS) is the goal when
selecting k

k-nearest neighbor (k-NN) is a supervised method that can be used for predicting
categorical or numerical targets.
True
False - Answer-True

In the k-nearest neighbor models, increasing the value of k leads to overfitting.
True
False - Answer-False

With the k-NN model for a numerical target, after we determined the k nearest
neighbors of a new data record, how the target value is predicted?
A. Majority vote determines the predicted class
B. Average of the neighbors
C. Through a logistic regression between the neighbors
D. Through a linear combination of neighbors - Answer-Average of the neighbors

What statement is correct about the k-nearest neighbor (k-NN) method?
A. Underfitted k-NN models can be fixed by adding a dummy variable for accuracy
B. Logistic regression is a special case of k-NN
C. The value of k can control model over and underfitting
D. Overfitted k-NN models can be fixed by decreasing k - Answer-The value of k can
control model over and underfitting

Which statement is INCORRECT about k-NN predictive models?
A. Larger values of k increase the risk of over-fitting
B. When k=n (number of data records) the k-NN and the universal average methods are
the same
C. k-NN is sensitive to irrelevant features

, D. Finding optimum value of k can be computationally expensive - Answer-Larger
values of k increase the risk of over-fitting

When we are building a linear regression model, against what model do we compare it
to evaluate its significance?
Naïve (average) model
Logistic model
Classification model
Random model - Answer-Naïve (average) model

In a linear regression model, the t-Test for each predictor's coefficient indicates if the
estimated value is significantly different from zero.
True
False - Answer-True

In the development of a linear regression model, what is the naive (based) model that
we compare the performance of the linear model with?
Simple linear model
Average model
Multiple linear model
Random guess - Answer-Average model

In the following scatter plot matrix, Price is the target variable. What predictor shows the
strongest negative correlation with Price?
CC
HP
Age_08_04
Weight - Answer-Age_08_04

The following report shows Excel output for a linear regression model. What can the p-
value of F-statistic tell us?
A. If this p-value is less than our significance level then the coefficients are significant
B. If this p-value is larger than our significance level then the coefficients are significant
C. If this p-value is larger than our significance level then the model as a whole is
significant
D. If this p-value is less than our significance level then the model as a whole is
significant - Answer-If this p-value is less than our significance level then the model as a
whole is significant

We have developed two different linear regression models on the same data set. Which
model shows a better goodness-of-fit?
Not enough information
Models are the same
Model B
Model A - Answer-Model A
$15.49
Accede al documento completo:

100% de satisfacción garantizada
Inmediatamente disponible después del pago
Tanto en línea como en PDF
No estas atado a nada


Documento también disponible en un lote

Conoce al vendedor

Seller avatar
Los indicadores de reputación están sujetos a la cantidad de artículos vendidos por una tarifa y las reseñas que ha recibido por esos documentos. Hay tres niveles: Bronce, Plata y Oro. Cuanto mayor reputación, más podrás confiar en la calidad del trabajo del vendedor.
biggdreamer Havard School
Seguir Necesitas iniciar sesión para seguir a otros usuarios o asignaturas
Vendido
248
Miembro desde
2 año
Número de seguidores
68
Documentos
17956
Última venta
1 semana hace

4.0

38 reseñas

5
22
4
4
3
6
2
2
1
4

Recientemente visto por ti

Por qué los estudiantes eligen Stuvia

Creado por compañeros estudiantes, verificado por reseñas

Calidad en la que puedes confiar: escrito por estudiantes que aprobaron y evaluado por otros que han usado estos resúmenes.

¿No estás satisfecho? Elige otro documento

¡No te preocupes! Puedes elegir directamente otro documento que se ajuste mejor a lo que buscas.

Paga como quieras, empieza a estudiar al instante

Sin suscripción, sin compromisos. Paga como estés acostumbrado con tarjeta de crédito y descarga tu documento PDF inmediatamente.

Student with book image

“Comprado, descargado y aprobado. Así de fácil puede ser.”

Alisha Student

Preguntas frecuentes