Summary Week 12
Using the code in section 4.3 to scale the age variable. ANSWER THE FOLLOWING QUESTIONS: 1. Explain what the mutate function is doing in this line of code: mutate(scaled_age = (age - !!scale_values$mean-age) / !!scale_values$sd_age) 2. What do the two exclamation marks next to each other do? Use the code in the book to create a histogram of Scaled Age ANSWER THESE QUESTION: 3. Approximately how many profiles in the training set fall in the 0 bin? Using the code in section 4.3, aggregate the profiles in the training set by ethnicity. ANSWER THE FOLLOWING QUESTION: 4. What does it mean to have a 'combination of ethnicities'? Use the code to create dummy variables for each race/ethnicity. ANSWER THE FOLLOWING QUESTIONS: 5. What are the dummy variables? 6. Show how okc_train has been transformed. Using the code from section 4.3, add a column called essay_length to okc_train. ANSWER THE FOLLOWING QUESTIONS: 7. Explain what is happening in this code: essay_length = char_length(paste(!!!syms(paste0("essay", 0:9)))) Use the code in section 4.3 to create a histrogram of essay length. ANSWER THE FOLLOWING QUESTION: 8. What does bins = 100 mean in the code? Using the code in section 4.3, save the training file as a Parquet file.
Libro relacionado
Escuela, estudio y materia
- Institución
- Big Data Tools & Architecture
- Grado
- Big Data Tools & Architecture
Información del documento
- ¿Un libro?
- No
- ¿Qué capítulos están resumidos?
- Desconocido
- Subido en
- 15 de julio de 2023
- Archivo actualizado en
- 22 de febrero de 2024
- Número de páginas
- 4
- Escrito en
- 2022/2023
- Tipo
- Resumen