QMB3302 UF Fall '22 Final Questions and
Answers
NLP
Ans: natural language processing
Tokenization
Ans: a computer turning letters and/or words into something it can read
and understand, like numbers
Two of the most common recommenders, often used together
Ans: user based, item based
Imagine you have a dataset with 2 columns, both filled with
continuous numbers. You believe the first column is a predictor of
the second column. Which of the model approaches below could
work?
1. random forest
2. running .describe and .info on the data
3. regression
4. decision trees
Ans: regression (obvious choice), random forest, decision trees (not the
best)
most talked about problem with decision trees
Ans: overfitting
The LinearRegression estimator is only capable of simple straight
line fits: true or false?
Ans: false
5 steps to building a machine learning model
© 2025 All rights reserved