100% de satisfacción garantizada Inmediatamente disponible después del pago Tanto en línea como en PDF No estas atado a nada 4.2 TrustPilot
logo-home
Examen

Data Mining Final Exam 2024 Questions with 100% correct Answers

Puntuación
-
Vendido
-
Páginas
17
Grado
A+
Subido en
16-08-2024
Escrito en
2024/2025

Data Mining Final Exam 2024 Questions with 100% correct Answers In a data set with 22 variables, if 13% of the values, randomly spread across observations, are missing (blank), what is the probable percent of complete and usable observations? 4.67 (1 − 0.13)22 = 0.0467 or 4.67%. In a data set with 20 variables, if 8% of the values, randomly spread across observations, are missing (blank), what is the probable percent of complete and usable observations? (1 − 0.08)20 = 0.1887 or 18.87%. When performing an analysis, one technique is called RFM. Which of the following is not reflective of RFM? Relevancy; RFM is the acronym for recency, frequency, and monetary. Mark wants to have a better understanding of his client base at the credit union. To do so, he is running a report to show loan amount approval with corresponding credit scores. He realized the data set is quite large and wants to create categories by grouping. To do this, he needs to do all the following except Remove 20% of the data to create a training set; Binning is taking the entire data set, identifying the value to be binned into smaller groups, ensuring no data overlapping, and labeling the bin accordingly. In R, Mary wants to understand the number of days between rain events in Chicago, IL. What function is used to find the number of rain events between today and January 1, 2026? diffitime Using R, what is the formula that will allow for the weekday function to display the day of the week for November 15, 2020? >weekdays(as.Date("")) Using R, what function is used to evaluate the categories in the variable to identify the dummy variables? ifelse Michael is examining a data set and trying to determine which category he can transform into a dummy variable. Of the four variables, Employee Number, Pay Rate, Hire Date, and Sex, which is the best fit to use a dummy variable? Sex Marcus wants to include the month of the year in the analysis as categories. How many dummy variables will be needed? 11; If a given k categories = 12, then k − 1, or 12 − 1 = 11 dummy variables. Kara is reviewing categories where a series of numbers represent the type of loan. She would prefer the actual name of the loan be retained when running her analysis. Using Microsoft Excel, what function will allow Kara to retain the category name instead of recording them in numbers? IF function; An IF function allows for statements to be crafted to transform numbers into category names. What data preparation technique is Maeve using when she extracts a payroll data set into two separate files, one for hourly employees and one for salary employees? Subsetting Regression analysis captures the relationship between only two distinct variables. False; Regression analysis captures the relationship between 2 or more variables. The response variable is the outcome of a variable, whereas the predictor is the input variable(s). True R2 in linear regression is the correlation coefficient. False; R2 in linear regression is the coefficient of determination, which is the proportion of the sample variation in the response variable that is explained by the samp

Mostrar más Leer menos
Institución
Data Mining
Grado
Data Mining










Ups! No podemos cargar tu documento ahora. Inténtalo de nuevo o contacta con soporte.

Escuela, estudio y materia

Institución
Data Mining
Grado
Data Mining

Información del documento

Subido en
16 de agosto de 2024
Número de páginas
17
Escrito en
2024/2025
Tipo
Examen
Contiene
Preguntas y respuestas

Temas

Vista previa del contenido

Data Mining Final Exam 2024 Questions with 100% correct
Answers


In a data set with 22 variables, if 13% of the values, randomly spread across observations, are missing
(blank), what is the probable percent of complete and usable observations?

4.67



(1 − 0.13)22 = 0.0467 or 4.67%.




In a data set with 20 variables, if 8% of the values, randomly spread across observations, are missing
(blank), what is the probable percent of complete and usable observations?

(1 − 0.08)20 = 0.1887 or 18.87%.




When performing an analysis, one technique is called RFM. Which of the following is not reflective of
RFM?

Relevancy;

RFM is the acronym for recency, frequency, and monetary.




Mark wants to have a better understanding of his client base at the credit union. To do so, he is
running a report to show loan amount approval with corresponding credit scores. He realized the data
set is quite large and wants to create categories by grouping. To do this, he needs to do all the
following except

Remove 20% of the data to create a training set;



Binning is taking the entire data set, identifying the value to be binned into smaller groups, ensuring
no data overlapping, and labeling the bin accordingly.

,In R, Mary wants to understand the number of days between rain events in Chicago, IL. What function
is used to find the number of rain events between today and January 1, 2026?

diffitime




Using R, what is the formula that will allow for the weekday function to display the day of the week
for November 15, 2020?

>weekdays(as.Date("2020-11-15"))




Using R, what function is used to evaluate the categories in the variable to identify the dummy
variables?

ifelse




Michael is examining a data set and trying to determine which category he can transform into a
dummy variable. Of the four variables, Employee Number, Pay Rate, Hire Date, and Sex, which is the
best fit to use a dummy variable?

Sex




Marcus wants to include the month of the year in the analysis as categories. How many dummy
variables will be needed?

11;

If a given k categories = 12, then k − 1, or 12 − 1 = 11 dummy variables.




Kara is reviewing categories where a series of numbers represent the type of loan. She would prefer
the actual name of the loan be retained when running her analysis. Using Microsoft Excel, what
function will allow Kara to retain the category name instead of recording them in numbers?

IF function;

, An IF function allows for statements to be crafted to transform numbers into category names.




What data preparation technique is Maeve using when she extracts a payroll data set into two
separate files, one for hourly employees and one for salary employees?

Subsetting




Regression analysis captures the relationship between only two distinct variables.

False;



Regression analysis captures the relationship between 2 or more variables.




The response variable is the outcome of a variable, whereas the predictor is the input variable(s).

True




R2 in linear regression is the correlation coefficient.

False;

R2 in linear regression is the coefficient of determination, which is the proportion of the sample
variation in the response variable that is explained by the sample regression equation. The correlation
coefficient is the relationship between two variables.




R2, also known as the coefficient of determination, quantifies the proportion of the sample variation
in the predictor variables (xi) that is explained in the sample regression equation.

False;

R2 quantifies the sample variation of the response variable y that is explained in the sample
regression equation, not the predictor variables.
$12.49
Accede al documento completo:

100% de satisfacción garantizada
Inmediatamente disponible después del pago
Tanto en línea como en PDF
No estas atado a nada

Conoce al vendedor
Seller avatar
EDWARDLEON

Conoce al vendedor

Seller avatar
EDWARDLEON Walden University
Ver perfil
Seguir Necesitas iniciar sesión para seguir a otros usuarios o asignaturas
Vendido
5
Miembro desde
1 año
Número de seguidores
2
Documentos
551
Última venta
1 semana hace

0.0

0 reseñas

5
0
4
0
3
0
2
0
1
0

Recientemente visto por ti

Por qué los estudiantes eligen Stuvia

Creado por compañeros estudiantes, verificado por reseñas

Calidad en la que puedes confiar: escrito por estudiantes que aprobaron y evaluado por otros que han usado estos resúmenes.

¿No estás satisfecho? Elige otro documento

¡No te preocupes! Puedes elegir directamente otro documento que se ajuste mejor a lo que buscas.

Paga como quieras, empieza a estudiar al instante

Sin suscripción, sin compromisos. Paga como estés acostumbrado con tarjeta de crédito y descarga tu documento PDF inmediatamente.

Student with book image

“Comprado, descargado y aprobado. Así de fácil puede ser.”

Alisha Student

Preguntas frecuentes