100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.6 TrustPilot
logo-home
Exam (elaborations)

DSCI 4520 Final Exam Questions with Correct Answers Latest Update 2025/2026

Rating
-
Sold
-
Pages
10
Grade
A+
Uploaded on
28-11-2025
Written in
2025/2026

DSCI 4520 Final Exam Questions with Correct Answers Latest Update 2025/2026 Which statement about the data mining process is INCORRECT? - Answers Data cleaning and pre-processing is usually a trivial step in the process According to the data-driven decision-making technology pyramid shown in the following figure, which statement is FALSE? - Answers The process only moves in one direction (upward) and higher layers never give feedback to the lower layers Which statement is FALSE about the data-driven decision-making approach? - Answers It is loaded with assumptions and theories Which statement about business intelligence workflow is CORRECT? - Answers Data in the operational database is transformed to analytical data in the data warehouse Which of the following is a core idea/task in data mining? - Answers All of the others "Estimating the repair time required for an aircraft based on a trouble ticket." Performing this task in data mining requires an unsupervised learning approach - Answers False ANOVA is an analysis under which of the following data mining task categories? - Answers Data exploration "Learn from the observed records to predict numerical values of unseen records." In data mining this is called.... - Answers Regression Data exploration includes summary statistics, univariate and bivariate analysis, basic statistical test (t-test, correlation), ANOVA, and outlier detection. - Answers True "Identifying segments of similar customers." Performing this task in data mining requires a supervised learning approach. - Answers False Which of the following tasks is an unsupervised learning task? - Answers Grouping customers based on the similarity in their online behavior "Learn from the observed records to predict the class value of unseen records." In data mining, this called... - Answers Classification "Identifying a network data packet as dangerous (virus, hacker attack) based on comparison to other packets whose threat status is known." performing this task in data mining requires a supervised learning approach. - Answers Trure "Automated sorting of mail by zip code scanning." Performing this task in data mining requires an unsupervised learning approach. - Answers True What is the first phase in the CRISP-DM approach for data mining tasks? - Answers business understanding What is the essential element in the machine learning algorithms that distinguish supervised from unsupervised learning? - Answers In the supervised learning models target variable is used in the model, but in the unsupervised learning models there is no target to predict Which of the following tasks is a supervised learning task? - Answers Predicting air pollution "Predicting whether a company will go bankrupt based on comparing its financial data to those of similar bankrupt and non bankrupt firms." Performing this tasks in data mining requires an unsupervised learning approach. - Answers False Which of the following statements is INCORRECT about imputing missing numerical values? - Answers Random generator function is one of the best methods of imputing Which is NOT one of the primary reasons for discretizing numerical variables? - Answers Higher accuracy When data is not uniformly distributed and includes outliers, linear normalization is better than the z-score standardization method. - Answers False Transforming numerical variables means performing mathematical functions on them

Show more Read less
Institution
DSCI 4520
Course
DSCI 4520









Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
DSCI 4520
Course
DSCI 4520

Document information

Uploaded on
November 28, 2025
Number of pages
10
Written in
2025/2026
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

Content preview

DSCI 4520 Final Exam Questions with Correct Answers Latest Update 2025/2026

Which statement about the data mining process is INCORRECT? - Answers Data cleaning and
pre-processing is usually a trivial step in the process

According to the data-driven decision-making technology pyramid shown in the following figure,
which statement is FALSE? - Answers The process only moves in one direction (upward) and
higher layers never give feedback to the lower layers

Which statement is FALSE about the data-driven decision-making approach? - Answers It is
loaded with assumptions and theories

Which statement about business intelligence workflow is CORRECT? - Answers Data in the
operational database is transformed to analytical data in the data warehouse

Which of the following is a core idea/task in data mining? - Answers All of the others

"Estimating the repair time required for an aircraft based on a trouble ticket."

Performing this task in data mining requires an unsupervised learning approach - Answers False

ANOVA is an analysis under which of the following data mining task categories? - Answers Data
exploration

"Learn from the observed records to predict numerical values of unseen records."

In data mining this is called.... - Answers Regression

Data exploration includes summary statistics, univariate and bivariate analysis, basic statistical
test (t-test, correlation), ANOVA, and outlier detection. - Answers True

"Identifying segments of similar customers."

Performing this task in data mining requires a supervised learning approach. - Answers False

Which of the following tasks is an unsupervised learning task? - Answers Grouping customers
based on the similarity in their online behavior

"Learn from the observed records to predict the class value of unseen records."

In data mining, this called... - Answers Classification

"Identifying a network data packet as dangerous (virus, hacker attack) based on comparison to
other packets whose threat status is known."

performing this task in data mining requires a supervised learning approach. - Answers Trure

"Automated sorting of mail by zip code scanning."

, Performing this task in data mining requires an unsupervised learning approach. - Answers True

What is the first phase in the CRISP-DM approach for data mining tasks? - Answers business
understanding

What is the essential element in the machine learning algorithms that distinguish supervised
from unsupervised learning? - Answers In the supervised learning models target variable is used
in the model, but in the unsupervised learning models there is no target to predict

Which of the following tasks is a supervised learning task? - Answers Predicting air pollution

"Predicting whether a company will go bankrupt based on comparing its financial data to those
of similar bankrupt and non bankrupt firms."

Performing this tasks in data mining requires an unsupervised learning approach. - Answers
False

Which of the following statements is INCORRECT about imputing missing numerical values? -
Answers Random generator function is one of the best methods of imputing

Which is NOT one of the primary reasons for discretizing numerical variables? - Answers Higher
accuracy

When data is not uniformly distributed and includes outliers, linear normalization is better than
the z-score standardization method. - Answers False

Transforming numerical variables means performing mathematical functions on them and
creating new variables that are better suited for our data mining model. - Answers True

IN practice, data preprocessing takes a significant portion of data mining projects. - Answers
True

Which of the following is NOT a step in data pre-processing? - Answers Data modeling

Which of the following statements is INCORRECT about the missing values in a data set? -
Answers The best strategy is always to drop records with any missing values

Which of the following tasks is NOT included in the data preprocessing phase? - Answers
Performance Evaluation

The data dictionary is meta-data, which is data about data. - Answers True

In the data preparation step, normalizing numerical data is a popular method to transform
variables into a more suitable scale for modeling. - Answers True

In statistics and data mining, "a statistical measure of the strength of the relationship between
the relative changes of 2 variables" is called... - Answers Correlation Coefficient

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
joshuawesonga22 Liberty University
View profile
Follow You need to be logged in order to follow users or courses
Sold
36
Member since
8 months
Number of followers
1
Documents
11104
Last sold
22 hours ago
Tutor Wes

Hi there! I'm Tutor Wes, a dedicated tutor with a passion for sharing knowledge and helping others succeed academically. All my notes are carefully organized, detailed, and easy to understand. Whether you're preparing for exams, catching up on lectures, or looking for clear summaries, you'll find useful study materials here. Let’s succeed together!

3.3

3 reviews

5
1
4
0
3
1
2
1
1
0

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions