QUESTIONS & CORRECT ANSWERS
Big data - ANS-The collection and analysis of data sets so large and complex that traditional methods
typically brought to bear on the problem would be overwhelmed
Business analytics - ANS-The process of using statistical analysis and modeling to drive business
decisions
Case - ANS-A case is an individual about whom or which we have data
Categorical/qualitative variable - ANS-A variable that means categories is called categorical or
qualitative, weather with words or numerals
Context - ANS-The context ideally tells who was measured, what was measured, how the data was
collected, where the data were collected, and when and why the study was performed
Cross-sectional data - ANS-Data taken from situations that very overtime but measured at a single time
instant is said to be a cross-section of the time series
Data - ANS-Recorded values weather numbers are labels, together with their context
Data mining - ANS-The process of using a variety of statistical tools to analyze large data bases or data
where houses
Data table - ANS-And arrangement of data in which each row represents a case and each column
represents a variable
Data warehouse - ANS-A large database of information collected by a company or other organization
usually to record transactions that the organization makes, but also used for analysis via data mining
Experimental unit - ANS-An individual in a study for which or for whom data values are recorded. Human
experimental units are usually called subjects or participants
Identifier variable - ANS-A categorical variable that records and unique value for each case, used to
name or identify it
Meta-data - ANS-Auxiliary information about variables in a database, typically including how, when, and
where, and possibly why, the data were collected; who each case represents; and the definition of all
variables
, Nominal variable - ANS-The term nominal can be applied to a variable whose values are used only to
name categories
Ordinal variable - ANS-The term ordinal can be applied to a variable whose categorical values possess
some kind of order
Participant - ANS-A human experimental unit, also called a subject
Quantitative variable - ANS-A variable in which the numbers are values of measured quantities with
units
Record - ANS-Information about an individual in a database
Relational database - ANS-A relational database stores and retrieves information, within the database
information is kept in data tables that can be related to each other
Respondent - ANS-Someone who answers or responds to a survey
Spreadsheet - ANS-A spreadsheet is a layout design for accounting that is often used to store and
manage data tables, and example is excel
Subject - ANS-A human experimental unit, also called a participant
Time series - ANS-Data measured over time, usually the time intervals are equally spaced or regularly
spaced
Transactional data - ANS-Data collected to record the individual transactions of a company or
organization
Units - ANS-A quantity or amount adopted as a standard of measurement such as dollars, hours, or
grams
Variable - ANS-A variable hold information about the same characteristic for many cases
Area principle - ANS-In a statistical display, each data value is represented by the same amount of area
Bar chart/relative frequency bar chart - ANS-A chart that represents the counter or percentage of each
category and a categorical variable as a bar, allowing easy visual comparisons across categories
Cell - ANS-Each location and a contingency table, representing the values of two categorical variables, is
called a cell
Column percent - ANS-The proportion of each column contained in the cell of a frequency table
Conditional distribution - ANS-The distribution of a variable restricting the who to consider only a
smaller group of individuals