1
DESCRIPTIVE STATISTICS EXAMINATION
Statistics is the branch of mathematics that focuses - answer-On analysis
and interpretation of data.
Distributions are a way to visualize the shape of your data, how - answer-
Descriptive statistics is a subset within statistics that allows - answer-You to
describe (or explain) and summarize the data you collect and then present it
in an efficient and meaningful way.
Descriptive statistics allows you to - answer-Organize, display, and describe
data using data visualizations, which can turn difficult-to-understand
variables across a large dataset into bite-sized descriptions.
Descriptive statistics will allow you to - answer-Quantitatively and
qualitatively describe the main features of a collection of information, such
as where the middle of the data is, how spread out it is, and where clumps of
values or anomalies may exist.
A single value from a dataset can carry plenty of meaning, but it can mean
even more when - answer-Compared to a larger dataset (such as your math
test score when compared to the scores of the entire class).
Measures of central tendency are used to - answer-Indicate and describe the
central position of a group of data. These measures are important because
they help you condense data, find representative values, make comparisons,
and perform further statistical analysis.
The three measures of central tendency are - answer-mean, median, and
mode.
Mean - answer-(often referred to as average) is the sum of all values divided
by the number of values in the dataset.
Median - answer-Is the middle value that separates the higher half from the
lower half of a dataset.
Mode - answer-Is the value in a dataset that appears most frequently.
[Type here]
, 1
1
It is the most popular measure of central tendency because - answer-It uses
every data point in its calculation. Mean is calculated using the following
formula:
Mean=Sum of All Values/Number of All Values
An outlier is a data point that - answer-Sticks out from the rest. It does not fit
in a trend that the other data shows, or it falls outside (above/below) a range
of values in which we would expect the data to fall.
Outliers easily does what to data - answer-Skew the data.
Although outliers skew the data, they can also tell analysts a lot of
information. As an analyst, you should always question - answer-Why the
outlier is in the data:
Could it be an error from when the data was collected?
Maybe it really is an employee's salary — for example, the CEO's?
Sometimes outliers can be explained, and sometimes they occur by chance.
Other times, they may require further investigation. As an analyst, you will
need to - answer-Ask questions when investigating why an outlier might be
in your data. Most of the time, these outliers can give you insight into your
data.
Outliers are important to investigate, but how do they fit into the larger
process of data analysis? - answer-In one instance, they can exist in the
initial data given to you. In this case, they should be cleaned or dealt with
accordingly when you've reached that step in the data wrangling checklist
In other cases, they appear in your results at the end of your analysis. In
these instances, they do not - answer-Get cleaned and instead become part
of your findings, which turn into inferences and insights.
There is no function in Google Sheets called "mean." However, there is an -
answer-AVERAGE function that has the same functionality. You will use the
AVERAGE function when finding the mean of a set of values. Below is the
Google Sheets documentation for reference.
The median is the middle value in a list of values. It is very important that
the list of values is - answer-Sorted from least to greatest, or greatest to
least, when calculating the median by hand.
Steps to calculate the median: - answer-Order your list of values in
ascending order.
[Type here]