Correct- University of Alberta
ECE 447
Midterm
Thursday 13th of February, 2025, 11:00 AM
Duration: 50min
Name:
Student ID:
Questions Worth Mark Subject
1-7 25 ML and Data Analysis
8-12 23 Evaluation
13 10 Confusion Matrix
14 10 Clamp Transformation
Total 68 --
, ECE447 – Data Analysis and Machine Learning for Engineers
1. Explain the importance of data quality in machine learning. What are the potential
consequences of using poor quality data? (4 pts)
Data quality is crucial in machine learning for the following reasons:
Garbage in, garbage out principle - poor quality data leads to poor model performance
Models learn patterns from data, so errors/inconsistencies in data will be reflected in
predictions
Poor quality data can lead to biased models and unreliable results
Clean, high-quality data reduces the need for complex model architectures
2. What is 'overfitting' in machine learning and why is it a concern? (3 pts)
a) When a model performs well on both training and test data
b) When a model performs well on training data but poorly on test data
c) When a model performs poorly on training data but well on test data
d) When a model performs poorly on both training and test data