BIDA 630 DATA ANALYTICS
QUESTIONS AND CORRECT ANSWERS |
LATEST UPDATE
Identify whether the task required is supervised or unsupervised learning: Deciding whether
to issue a loan to an applicant based on demographic and financial data (with reference to a
database of similar data on prior customers).
- Supervised
- Unsupervised
✓ -:- Supervised
This is supervised learning, because the database includes whether the loan was approved or
not.
Identify whether the task required is supervised or unsupervised learning: Printing of custom
discount coupons at the conclusion of a grocery store checkout based on what you just
bought and what others have bought previously.
1|P a g e | G r a d e A + | 2 0 2 0 2 5
,2024 /2025 | © copyright | This work may not be copied for profit gain | Excel!
- Supervised
- Unsupervised
✓ -:- Unsupervised
This is unsupervised learning, if we assume that we do not know what will be purchased in
the future.
The test data are used to build models, or to further tweak the model or improve its fit.
- True
- False
✓ -:- False
The test data are not used to build models, or to further tweak the model or improve its fit.
(If the test data were used for these purposes, they would play a role in building or selecting
the best model, and would no longer provide an unbiased assessment of the chosen model's
performance with completely new data.)
2|P a g e | G r a d e A + | 2 0 2 0 2 5
, 2024 /2025 | © copyright | This work may not be copied for profit gain | Excel!
_____________ of data is used to assess the performance of each supervised learning
model so that we can compare models and pick the best one.
- The test partition
- The validation partition
✓ -:- Validation
The validation partition is used to assess the performance of each supervised learning model
so that we can compare models and pick the best one. In some algorithms (e.g.,
classification and regression trees, k-nearest neighbors) the validation partition may be used
in automated fashion to tune and improve the model. This means that the validation data
are actually used to help build the model.
When a model is fit to training data, zero error with those data is not necessarily good. This
special case is called ______.
- Overestimating
- Good fit
3|P a g e | G r a d e A + | 2 0 2 0 2 5