100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Exam (elaborations)

DESCRIPTIVE STATISTICS

Rating
-
Sold
-
Pages
15
Grade
A+
Uploaded on
03-09-2024
Written in
2024/2025

Statistics is the branch of mathematics that focuses - answer-On analysis and interpretation of data. Distributions are a way to visualize the shape of your data, how - answer- Descriptive statistics is a subset within statistics that allows - answer-You to describe (or explain) and summarize the data you collect and then present it in an efficient and meaningful way. Descriptive statistics allows you to - answer-Organize, display, and describe data using data visualizations, which can turn difficult-to-understand variables across a large dataset into bite-sized descriptions. Descriptive statistics will allow you to - answer-Quantitatively and qualitatively describe the main features of a collection of information, such as where the middle of the data is, how spread out it is, and where clumps of values or anomalies may exist. A single value from a dataset can carry plenty of meaning, but it can mean even more when - answer-Compared to a larger dataset (such as your math test score when compared to the scores of the entire class). Measures of central tendency are used to - answer-Indicate and describe the central position of a group of data. These measures are important because they help you condense data, find representative values, make comparisons, and perform further statistical analysis. The three measures of central tendency are - answer-mean, median, and mode. Mean - answer-(often referred to as average) is the sum of all values divided by the number of values in the dataset. Median - answer-Is the middle value that separates the higher half from the lower half of a dataset. Mode - answer-Is the value in a dataset that appears most frequently. It is the most popular measure of central tendency because - answer-It uses every data point in its calculation. Mean is calculated using the following formula: Mean=Sum of All Values/Number of All Values An outlier is a data point that - answer-Sticks out from the rest. It does not fit in a trend that the other data shows, or it falls outside (above/below) a range of values in which we would expect the data to fall. Outliers easily does what to data - answer-Skew the data. Although outliers skew the data, they can also tell analysts a lot of information. As an analyst, you should always question - answer-Why the outlier is in the data: Could it be an error from when the data was collected? Maybe it really is an employee's salary — for example, the CEO's? Sometimes outliers can be explained, and sometimes they occur by chance. Other times, they may require further investigation. As an analyst, you will need to - answer-Ask questions when investigating why an outlier might be in your data. Most of the time, these outliers can give you insight into your data. Outliers are important to investigate, but how do they fit into the larger process of data analysis? - answer-In one instance, they can exist in the initial data given to you. In this case, they should be cleaned or dealt with accordingly when you've reached that step in the data wrangling checklist In other cases, they appear in your results at the end of your analysis. In these instances, they do not - answer-Get cleaned and instead become part of your findings, which turn into inferences and insights. There is no function in Google Sheets called "mean." However, there is an - answer-AVERAGE function that has the same functionality. You will use the AVERAGE function when finding the mean of a set of values. Below is the Google Sheets documentation for reference. The median is the middle value in a list of values. It is very important that the list of values is - answer-Sorted from least to greatest, or greatest to least, when calculating the median by hand. Steps to calculate the median: - answer-Order your list of values in ascending order. Count the number of values in your list and determine if the number of values you have is odd or even. HOW TO FIND THE MEDIAN WITH AN ODD SET OF NUMBERS: : - answer-If your list of values is an odd number, then you simply need to find the value that falls in the exact middle of the sorted list. If you have a large number of values, use a formula below to calculate the middle position. Median=n + 12Median=n + 12 In this formula, n represents the number of values in the list. HOW TO FIND THE MEDIAN WITH AN EVEN SET OF NUMBERS: - answer-If your list of values is an even number, you will need to take an extra step to find the mean of the two middle values. The mean of the two middle values ($42,000 and $43,000) will be the median of an even-numbered dataset. Median vs. Mean - answer-Unlike the mean, the median is less affected by outliers and skewed data. This is because the median only uses the central value(s) for its calculation, which signifies that the only impact an outlier can have is in shifting your median number over by one position. When you are working with thousands of rows of data, quickly spotting your outliers will not be as simple. These methods involve using the mean and median together. RESULTING IN - answer-If there are no extreme outliers in a dataset, the mean and median will be similar. If outliers do exist in the dataset, then the mean and median will be very different. The mean is affected by outliers, while the median is not. For this reason, - answer-Both provide value to analysts even though they both measure central tendency. When calculating the median of a set of values, you can use the - answer-MEDIAN function in Google Sheets. The last measure of central tendency is the mode. Mode is the value that appears - answer-Most frequently in the dataset. It is often referred to as the most "popular" value. There is no specific calculation to find the mode: you simply look for - answer-the most frequent values. For example, take a look at this list: [11, 12, 72, 12, 49, 11, 77, 13, 12]. The mode would be 12 because it appears the most often in the list (three times). Data cannot have a mode if all the values appear the same number of times. There can also be multiple modes if - answer-Two or more values appear the same number of times. The mode is most useful when the variable is discrete, since there are likely multiple occurrences of one value. When you need to figure out which discrete value occurs the most, the mode will t - answer-Tell you this information. When the data is continuous, it is less likely to have repeat values because there are typically more marginal differences (e.g., 3.00 vs. 3.01) between values. When calculating the mode of a set of values, you can use - answer-The MODE function in Google Sheets. MODE will return the first mode it finds. You would be able to identify this quickly in this set of values because it is fairly small, but if you had a larger set of values, you would not immediately notice that the MODE function did not return all the modes. Thus, - answer-In Google Sheets, you will use the MODE.MULT function. MODE.MULT stands for - answer-"mode multiple," and it will return every mode in the set of values selected. Once you identify the center of your dataset, your next step is to find out how . - answer-Spread out your data is. This allows you to see how well the measures of central tendency represent the data. Depending on how wide the spread of values are, the measures of central tendency will be - answer-More or less representative of the dataset. Measures of spread help you uncover how - answer-Spread out your data points are. In a larger spread, there are likely bigger spaces between the values within a dataset. You can discover how spread out your data is by using another type of descriptive statistics called - answer-Measures of spread. Measures of spread describe - answer-How similar or varied a set of values is from the central values. These measures include range, quartiles, interquartile range, variance, and standard deviation The first three measures of spread you will learn are - answer-range, quartiles, and interquartile range. Range: - answer-The difference between the lowest and the highest value within a dataset. Quartiles: - answer-Values that divide your dataset into quarters. Similar to how the median divides the dataset in half, the quartiles split your data into four equal parts. Interquartile range: - answer-A measure that describes the difference between the third quartile and the first quartile, which tells you about the range of the middle half of the values. The range is the simplest measure of spread, and it allows you to - answer-see the boundaries of your dataset.

Show more Read less
Institution
DESCRIPTIVE STATISTICS
Course
DESCRIPTIVE STATISTICS









Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
DESCRIPTIVE STATISTICS
Course
DESCRIPTIVE STATISTICS

Document information

Uploaded on
September 3, 2024
Number of pages
15
Written in
2024/2025
Type
Exam (elaborations)
Contains
Questions & answers

Content preview

1
1


DESCRIPTIVE STATISTICS EXAMINATION

Statistics is the branch of mathematics that focuses - answer-On analysis
and interpretation of data.

Distributions are a way to visualize the shape of your data, how - answer-

Descriptive statistics is a subset within statistics that allows - answer-You to
describe (or explain) and summarize the data you collect and then present it
in an efficient and meaningful way.

Descriptive statistics allows you to - answer-Organize, display, and describe
data using data visualizations, which can turn difficult-to-understand
variables across a large dataset into bite-sized descriptions.

Descriptive statistics will allow you to - answer-Quantitatively and
qualitatively describe the main features of a collection of information, such
as where the middle of the data is, how spread out it is, and where clumps of
values or anomalies may exist.

A single value from a dataset can carry plenty of meaning, but it can mean
even more when - answer-Compared to a larger dataset (such as your math
test score when compared to the scores of the entire class).

Measures of central tendency are used to - answer-Indicate and describe the
central position of a group of data. These measures are important because
they help you condense data, find representative values, make comparisons,
and perform further statistical analysis.

The three measures of central tendency are - answer-mean, median, and
mode.

Mean - answer-(often referred to as average) is the sum of all values divided
by the number of values in the dataset.

Median - answer-Is the middle value that separates the higher half from the
lower half of a dataset.

Mode - answer-Is the value in a dataset that appears most frequently.




[Type here]

, 1
1

It is the most popular measure of central tendency because - answer-It uses
every data point in its calculation. Mean is calculated using the following
formula:
Mean=Sum of All Values/Number of All Values

An outlier is a data point that - answer-Sticks out from the rest. It does not fit
in a trend that the other data shows, or it falls outside (above/below) a range
of values in which we would expect the data to fall.

Outliers easily does what to data - answer-Skew the data.

Although outliers skew the data, they can also tell analysts a lot of
information. As an analyst, you should always question - answer-Why the
outlier is in the data:
Could it be an error from when the data was collected?

Maybe it really is an employee's salary — for example, the CEO's?

Sometimes outliers can be explained, and sometimes they occur by chance.
Other times, they may require further investigation. As an analyst, you will
need to - answer-Ask questions when investigating why an outlier might be
in your data. Most of the time, these outliers can give you insight into your
data.

Outliers are important to investigate, but how do they fit into the larger
process of data analysis? - answer-In one instance, they can exist in the
initial data given to you. In this case, they should be cleaned or dealt with
accordingly when you've reached that step in the data wrangling checklist

In other cases, they appear in your results at the end of your analysis. In
these instances, they do not - answer-Get cleaned and instead become part
of your findings, which turn into inferences and insights.

There is no function in Google Sheets called "mean." However, there is an -
answer-AVERAGE function that has the same functionality. You will use the
AVERAGE function when finding the mean of a set of values. Below is the
Google Sheets documentation for reference.

The median is the middle value in a list of values. It is very important that
the list of values is - answer-Sorted from least to greatest, or greatest to
least, when calculating the median by hand.

Steps to calculate the median: - answer-Order your list of values in
ascending order.

[Type here]

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
TOPDOCTOR Abacus College, Oxford
View profile
Follow You need to be logged in order to follow users or courses
Sold
10
Member since
2 year
Number of followers
5
Documents
3393
Last sold
2 months ago
TOPGRADER!!

Looking for relevant and updated study material to help you ace your exams? TOPTIERGRADES has your back!!! I have essential exams, test-banks, study bites, assignments all graded A+, Have Complete solutions, and are updated regularly. Please feel free to message me if you are looking for a specific test bank that is not listed on my profile or want a test bank or exam sent to you directly as google doc link. In the event that any of the materials have an issue, please let me know and I\'ll do my best to resolve it or provide an alternative. Thank You & All The Very BEST!!!!!

Read more Read less
5.0

1 reviews

5
1
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions