100% de satisfacción garantizada Inmediatamente disponible después del pago Tanto en línea como en PDF No estas atado a nada 4.2 TrustPilot
logo-home
Resumen

Samenvatting Advanced Analytics in a Big Data World (D0S06B)

Puntuación
-
Vendido
2
Páginas
91
Subido en
12-03-2025
Escrito en
2023/2024

Samenvatting van de volledige cursus op basis van de notities en slides voor het vak Advanced Analytics in a Big Data World (D0S06B) HIR(B) 2e master. Geslaagd eerste zit.

Institución
Grado











Ups! No podemos cargar tu documento ahora. Inténtalo de nuevo o contacta con soporte.

Escuela, estudio y materia

Institución
Estudio
Grado

Información del documento

Subido en
12 de marzo de 2025
Número de páginas
91
Escrito en
2023/2024
Tipo
Resumen

Temas

Vista previa del contenido

ADVANCED ANALYTICS
Prof. Seppe vanden Broucke




KU Leuven

,TABLE OF CONTENTS
Table of Contents...................................................................................................................................1
1 Introduction........................................................................................................................................4
1.1 Setting the Scene.........................................................................................................................4
1.2 Components of Data Science.......................................................................................................4
1.3 Process, People, and Problems....................................................................................................5
2 Preprocessing and Feature Engineering..............................................................................................7
2.1 Preprocessing Steps.....................................................................................................................7
2.2 Feature Engineering...................................................................................................................10
2.3 Conclusion.................................................................................................................................10
3 Supervised Learning..........................................................................................................................12
3.1 (Logistic) Regression..................................................................................................................12
3.2 Decision and Regression Trees...................................................................................................13
3.3 K-NN...........................................................................................................................................15
4 Model Evaluation..............................................................................................................................16
4.1 Introduction...............................................................................................................................16
4.2 Classification Performance.........................................................................................................16
4.3 Regression Performance............................................................................................................19
4.4 Cross-Validation and Tuning......................................................................................................19
4.5 Additional Notes........................................................................................................................20
4.6 Monitoring and Maintenance....................................................................................................21
5 Ensemble Modelling: Bagging and Boosting.....................................................................................23
5.1 Introduction...............................................................................................................................23
5.2 Bagging......................................................................................................................................23
5.3 Boosting.....................................................................................................................................24
5.4 Comparing Bagging and Boosting..............................................................................................25
6 Interpretability..................................................................................................................................26
6.1 Introduction...............................................................................................................................26
6.2 Feature importance...................................................................................................................26
6.3 Partial Dependence Plots...........................................................................................................27
6.4 Individual Conditional Expectation plots....................................................................................27
6.5 LIME...........................................................................................................................................27
6.6 Shapley values...........................................................................................................................28
6.7 Conclusion.................................................................................................................................28


1

,7 Deep Learning Part 1: Foundations and Images................................................................................29
7.1 Introduction...............................................................................................................................29
7.2 Foundations of artificial neural networks..................................................................................30
7.3 Delving deeper into Artificial Neural Networks..........................................................................31
7.4 The convolutional architecture..................................................................................................33
7.5 Interpretation of convolutional neural networks.......................................................................35
7.6 Generative models for images...................................................................................................37
8 Unsupervised Learning.....................................................................................................................45
8.1 Frequent itemset and association rule mining...........................................................................45
8.2 Clustering...................................................................................................................................47
8.3 Dimensionality reduction...........................................................................................................50
8.4 Anomaly detection.....................................................................................................................51
9 Data Science Tools............................................................................................................................53
9.1 In-memory analytics..................................................................................................................53
9.2 Python and R..............................................................................................................................53
9.3 Visualization...............................................................................................................................53
9.4 The road to big data...................................................................................................................54
9.5 Notebooks and development environments.............................................................................54
9.6 Labeling......................................................................................................................................55
9.7 File formats................................................................................................................................55
9.8 Packaging and versioning systems.............................................................................................57
9.9 Model deployment....................................................................................................................58
10 Hadoop, Spark, and Streaming Analytics........................................................................................61
10.1 Introduction.............................................................................................................................61
10.2 Hadoop: HDFS and MapReduce...............................................................................................61
10.3 Spark: SparkSQL and MLlib......................................................................................................64
10.4 Streaming analytics and other trends......................................................................................67
11 Deep Learning Part 2: Text, Representation Learning and Recurrence...........................................69
11.1 Traditional approaches............................................................................................................69
11.2 Word embeddings and representational learning...................................................................70
11.3 Recurrent neural networks (RNN)............................................................................................73
11.4 From RNNs to Transformers....................................................................................................75
11.5 Conclusion...............................................................................................................................77
12 Graph Analytics...............................................................................................................................78
12.1 Graph construction.................................................................................................................78
12.2 Graph metrics..........................................................................................................................78

2

, 12.3 Community mining...................................................................................................................79
12.4 Making predictions: Relational learners..................................................................................80
12.5 Making predictions: Featurization...........................................................................................82
12.6 Example...................................................................................................................................82
12.7 A word on validation................................................................................................................82
12.8 Node2vec and deep learning...................................................................................................83
12.9 Tooling.....................................................................................................................................86
12.10 NoSQL....................................................................................................................................86
12.11 Graph databases....................................................................................................................87
13 Wrap Up..........................................................................................................................................89
13.1 Key pitfalls................................................................................................................................89
13.2 Closing......................................................................................................................................90




3
$12.65
Accede al documento completo:

100% de satisfacción garantizada
Inmediatamente disponible después del pago
Tanto en línea como en PDF
No estas atado a nada

Conoce al vendedor

Seller avatar
Los indicadores de reputación están sujetos a la cantidad de artículos vendidos por una tarifa y las reseñas que ha recibido por esos documentos. Hay tres niveles: Bronce, Plata y Oro. Cuanto mayor reputación, más podrás confiar en la calidad del trabajo del vendedor.
rikteugels Katholieke Universiteit Leuven
Seguir Necesitas iniciar sesión para seguir a otros usuarios o asignaturas
Vendido
54
Miembro desde
2 año
Número de seguidores
8
Documentos
6
Última venta
1 mes hace

4.5

2 reseñas

5
1
4
1
3
0
2
0
1
0

Recientemente visto por ti

Por qué los estudiantes eligen Stuvia

Creado por compañeros estudiantes, verificado por reseñas

Calidad en la que puedes confiar: escrito por estudiantes que aprobaron y evaluado por otros que han usado estos resúmenes.

¿No estás satisfecho? Elige otro documento

¡No te preocupes! Puedes elegir directamente otro documento que se ajuste mejor a lo que buscas.

Paga como quieras, empieza a estudiar al instante

Sin suscripción, sin compromisos. Paga como estés acostumbrado con tarjeta de crédito y descarga tu documento PDF inmediatamente.

Student with book image

“Comprado, descargado y aprobado. Así de fácil puede ser.”

Alisha Student

Preguntas frecuentes