2023 Verified
Which of the following describes the standard deviation?
It is the square root of the variance.
When two variables are highly positively correlated, the correlation coefficient
will be _______.
close to 1
According to statistical notation, what does ∑ stand for?
to act as a summation operator
The ________ is the observation that occurs most frequently.
mode
The difference between the first and third quartiles is referred to as the
____________.
interquartile range
Which of the following is an example of a measure of dispersion?
variance
Which of the following describes a positively skewed histogram?
a histogram that tails off toward the right
Which of the following is true for a median?
For an even number of observations, the median is the mean of the two middle
numbers
Which of the following is an example of a sample?
The number of IT employees out of all employees working in an office of Google
For a normal distribution mean is _______ to median.
equal
When sample size increases
Correct
Confidence interval decreases
Which of the following proposition describes an existing theory or belief?
Null hypothesis
Which of the following is a Type-I error?
The null hypothesis is actually true, but the hypothesis test incorrectly rejects it.
In order to reject the null hypothesis, the p-value must be less than the
Alpha
What is the confidence interval when the level of significance is 0.07?
0.930
The WPC Sports Company has noted that the size of individual "customer order"
is normally distributed with a mean of $100 and standard deviation of $12. If a
soccer team of 16 players were to make the next batch of orders, what would be
the standard error of the mean?
3.00
You are collecting data via an online survey to improve education standard at
ASU. Which of the following methods will not result in data collection bias?
Anonymously data collection by hiding ASU brand in the survey question.
, The central limit theorem states that if the population is normally distributed, then
the
Sampling distribution of the mean will also be normal for any sample size
Which of the following is a continuous random variable?
The time to complete a specific task
Which of the following is a difference between the t-distribution and the standard
normal (z) distribution?
The t-distribution has a larger variance than the standard normal distribution.
In classification analysis, we typically split the data into two mutually exclusive
sets, known as ________, to investigate the strength of the developed model.
Training and validation/testing
In logistic regression analysis, instead of Y as a dependent variable, we use a
function of Y called ________.
Logit
In classification problems, the primary source for accuracy estimation of the
model is ________.
Confusion matrix
The ________ is often used to describe the performance of a classification model
applied to a set of test data for which the true outcomes are known.
Confusion matrix
In classification analysis, we are determining the probability of an observation
________.
To be part of a certain class or not
In logistic regression, the dependent variable y is defined as:
Log (p/1-p)
Odds ratio is defined as ________, where p is the probability of success.
p/1-p
Logistic regression is a specialized type of regression analysis that is designed
to predict ________ variables.
a binary categorical
If you want to find out if body weight, calorie intake, fat intake and age have an
influence on the probability of having a heart attack (yes or no), which of the
following kind of analysis will help determine the answer?
Multiple logistic regression
A loan officer wants to know if the next customer is likely to default or not on a
loan. How can she assess the risk of extending the loan to that customer?
By utilizing a multiple logistic regression model developed by an in-house analyst
Which of the following is true about interpretation?
We only need to interpret our finding from a regression analysis.
We don't need to revisit our original question once we have performed an interpretation.
Interpretation only needs to be done when we have finished all our analytics tasks.
NONE OF THE ANSWERS ARE CORRECT!
Which of the following statement is false with regard to interpretation?