|\ |\ |\ |\ |\ |\ |\
with answers |\
Observation: - CORRECT ANSWERS ✔✔set of recorded values of |\ |\ |\ |\ |\ |\ |\ |\ |\
variables associated with a single entity.
|\ |\ |\ |\ |\
"Data-mining approaches can be separated into two categories. - |\ |\ |\ |\ |\ |\ |\ |\ |\
CORRECT ANSWERS ✔✔" Supervised learning , Unsupervised
|\ |\ |\ |\ |\ |\ |\
learning
Supervised learning - - CORRECT ANSWERS ✔✔For prediction and
|\ |\ |\ |\ |\ |\ |\ |\
classification.
|\
Unsupervised learning - - CORRECT ANSWERS ✔✔To detect |\ |\ |\ |\ |\ |\ |\ |\
patterns and relationships in the data.
|\ |\ |\ |\ |\
Data Sampling: - CORRECT ANSWERS ✔✔Extract a sample of
|\ |\ |\ |\ |\ |\ |\ |\ |\
data that is relevant to the business problem under
|\ |\ |\ |\ |\ |\ |\ |\ |\
consideration.
Data Preparation: - CORRECT ANSWERS ✔✔Manipulate the data
|\ |\ |\ |\ |\ |\ |\ |\
to put it in a form suitable for formal modeling.
|\ |\ |\ |\ |\ |\ |\ |\ |\
Model Construction: - CORRECT ANSWERS ✔✔Apply the
|\ |\ |\ |\ |\ |\ |\
appropriate data-mining technique (regression, classification
|\ |\ |\ |\ |\
trees, k-means) to accomplish the desired data-mining task
|\ |\ |\ |\ |\ |\ |\ |\
(prediction, classification, clustering, etc.). |\ |\ |\
, Model Assessment: - CORRECT ANSWERS ✔✔Evaluate models by
|\ |\ |\ |\ |\ |\ |\ |\
comparing performance on appropriate data sets.
|\ |\ |\ |\ |\
When dealing with large volumes of data, - CORRECT ANSWERS
|\ |\ |\ |\ |\ |\ |\ |\ |\ |\
✔✔it is best practice to extract a representative sample for
|\ |\ |\ |\ |\ |\ |\ |\ |\ |\
analysis.
A sample is representative, - CORRECT ANSWERS ✔✔if the
|\ |\ |\ |\ |\ |\ |\ |\ |\
analyst can make the same conclusions from it as from the entire
|\ |\ |\ |\ |\ |\ |\ |\ |\ |\ |\
population of data.
|\ |\ |\
Data-mining algorithms typically are more effective - CORRECT
|\ |\ |\ |\ |\ |\ |\ |\
ANSWERS ✔✔given more data. |\ |\ |\
When obtaining a representative sample, - CORRECT ANSWERS
|\ |\ |\ |\ |\ |\ |\ |\
✔✔it is generally best to include as many variables as possible in
|\ |\ |\ |\ |\ |\ |\ |\ |\ |\ |\
the sample.
|\ |\
After exploring the data with descriptive statistics and
|\ |\ |\ |\ |\ |\ |\ |\
visualization, - CORRECT ANSWERS ✔✔the analyst can eliminate |\ |\ |\ |\ |\ |\ |\ |\
variables that are not of interest.
|\ |\ |\ |\ |\
The data in a data set are often said to be ________before they
|\ |\ |\ |\ |\ |\ |\ |\ |\ |\ |\ |\ |\
have been preprocessed. - CORRECT ANSWERS ✔✔"dirty" and
|\ |\ |\ |\ |\ |\ |\
"raw"
|\