ANALYTICS M12 CORRECT 100%
The Assertion-Evidence Approach - ANSWERAn analytic presentation approach built
around messages rather than topics and supporting visual evidence rather than bullets
analysis set - ANSWERA sample of data used in the development of a predictive model;
also referred to as the training dataset
validation set - ANSWERA sample of data held back from the training data set that is
used to evaluate the skill of the model while tuning hyperparameters.
cumulative gains chart - ANSWERA measure of the effectiveness of a predictive model
showing the cumulative "yes" responses versus the cumulative number of cases.
lift chart - ANSWERA measure of the effectiveness of a predictive model calculated as
the ratio between the results obtained with and without the predictive model. Derived
from gains chart.
response rate - ANSWERThe number of observations meeting a defined criteria divided
by the number of observations in the sample.
data table - ANSWERan arrangement of data in which each row represents a case and
each column represents a variable
relational database - ANSWERA group of database tables that is connected or linked by
a defined relationship that ties the information together.
Structured Query Language (SQL) - ANSWERA language used to create and
manipulate relational databases
frequency - ANSWERThe total number of purchases over a given time period
monetary - ANSWERThe total amount of currency used for purchases over a given time
period
index score - ANSWERA composite statistic that aggregates multiple indicators. A
measure of changes in a representative group of individual data points.
z-score - ANSWERThe number of standard deviations from the mean a data point is.
, CHAID - ANSWER(Chi-square Automatic Interaction Detector) a tool used to discover
the relationship between variables and help determine how best to merge them to
explain a dependent variable outcome.
clustering - ANSWERthe grouping of a particular set of objects based on their
characteristics, aggregating them according to their similarities
r-squared - ANSWERA statistical measure of how close the data are to the fitted
regression line.
adjusted r-squared - ANSWERA modified version of R-squared that is adjusted for the
number of predictors in the model.
upper limit - ANSWERThe highest coefficient estimate calculated in a regression model
lower limit - ANSWERThe lowest coefficient estimate calculated in a regression model
p-value - ANSWERThe probability level which forms basis for deciding if results are
statistically significant (not due to chance).
dependent variable - ANSWERA variable (often denoted by y) whose value depends on
that of another.
independent variable - ANSWERA variable (often denoted by x) whose variation does
not depend on that of another.
slope - ANSWERA number that describes both the direction and the steepness of the
line
regression coefficient - ANSWERThe slope of the regression line: the change expected
in the dependent variable, if the independent variable changes by one unit.
intercept - ANSWERThe distance from the origin to the point at which a curve or line
intersects an axis, generally the X-axis
association rules - ANSWERif-then statements that help to show the probability of
relationships between data items within large data sets
correlation - ANSWERA statistical measure determining the relationship between 2 or
more variables
strong relationship - ANSWERA statistical measure indicating two variables potentially
influencing each other
SELECT clause - ANSWERThe clause that specifies the columns to retrieve in the SQL
query