ANSWERS
The variable CHAS (whether bounds Charles River) in Boston Housing dataset is a categorical
variable. Which of the following graph is appropriate to explore its frequency counts.
✅✅CORRECT ANSW-Bar Chart
Which of the following is the feature of data warehouses? ✅✅CORRECT ANSW-All of the above.
The data is clean.
The data is well organized in standard format.
They are used for analysis
Which of the statement about measurement level is true? ✅✅CORRECT ANSW-Interval variables
can be measured as ordinal.
A variable can also be referred as an attribute, a feature, or a dimension in data mining.
✅✅CORRECT ANSW-True
We always discard observations that are statistical outliers (e.g. >3rd quartile + 1.5*Interquartile
range). ✅✅CORRECT ANSW-False
To add "CHAS" variable (whether the tract bounds Charles River, 1-Yes, 0-No) in Boston Housing data
to the scatterplot of "MEDV" and "NOX", we can use: ✅✅CORRECT ANSW-aes()
The ultimate goal of business analytics is to develop more complicated methods and technologies.
✅✅CORRECT ANSW-False
The course has two exams. Both are open book and the second one is held during the week of
university finals. ✅✅CORRECT ANSW-False
Which of the following is (are) step(s) in the data mining process? ✅✅CORRECT ANSW-All of the
Above
Interpretation/Reporting
Data preparation