WITH COMPLETE SOLUTIONS
Which of the following describes the standard deviation? - Answer-It is the square root
of the variance.
When two variables are highly positively correlated, the correlation coefficient will be
_______. - Answer-close to 1
According to statistical notation, what does ∑ stand for? - Answer-to act as a summation
operator
The ________ is the observation that occurs most frequently. - Answer-mode
The difference between the first and third quartiles is referred to as the ____________. -
Answer-interquartile range
Which of the following is an example of a measure of dispersion? - Answer-variance
Which of the following describes a positively skewed histogram? - Answer-a histogram
that tails off toward the right
Which of the following is true for a median? - Answer-For an even number of
observations, the median is the mean of the two middle numbers
Which of the following is an example of a sample? - Answer-The number of IT
employees out of all employees working in an office of Google
For a normal distribution mean is _______ to median. - Answer-equal
When sample size increases
Correct - Answer-Confidence interval decreases
Which of the following proposition describes an existing theory or belief? - Answer-Null
hypothesis
Which of the following is a Type-I error? - Answer-The null hypothesis is actually true,
but the hypothesis test incorrectly rejects it.
In order to reject the null hypothesis, the p-value must be less than the - Answer-Alpha
What is the confidence interval when the level of significance is 0.07? - Answer-0.930
, The WPC Sports Company has noted that the size of individual "customer order" is
normally distributed with a mean of $100 and standard deviation of $12. If a soccer
team of 16 players were to make the next batch of orders, what would be the standard
error of the mean? - Answer-3.00
You are collecting data via an online survey to improve education standard at ASU.
Which of the following methods will not result in data collection bias? - Answer-
Anonymously data collection by hiding ASU brand in the survey question.
The central limit theorem states that if the population is normally distributed, then the -
Answer-Sampling distribution of the mean will also be normal for any sample size
Which of the following is a continuous random variable? - Answer-The time to complete
a specific task
Which of the following is a difference between the t-distribution and the standard normal
(z) distribution? - Answer-The t-distribution has a larger variance than the standard
normal distribution.
In classification analysis, we typically split the data into two mutually exclusive sets,
known as ________, to investigate the strength of the developed model. - Answer-
Training and validation/testing
In logistic regression analysis, instead of Y as a dependent variable, we use a function
of Y called ________. - Answer-Logit
In classification problems, the primary source for accuracy estimation of the model is
________. - Answer-Confusion matrix
The ________ is often used to describe the performance of a classification model
applied to a set of test data for which the true outcomes are known. - Answer-Confusion
matrix
In classification analysis, we are determining the probability of an observation
________. - Answer-To be part of a certain class or not
In logistic regression, the dependent variable y is defined as: - Answer-Log (p/1-p)
Odds ratio is defined as ________, where p is the probability of success. - Answer-p/1-p
Logistic regression is a specialized type of regression analysis that is designed to
predict ________ variables. - Answer-a binary categorical
If you want to find out if body weight, calorie intake, fat intake and age have an influence
on the probability of having a heart attack (yes or no), which of the following kind of
analysis will help determine the answer? - Answer-Multiple logistic regression