100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Summary

STA2020 (Applied Statistics) Regression Summary

Rating
-
Sold
1
Pages
18
Uploaded on
22-06-2021
Written in
2020/2021

This is a complete and comprehensive summary of the regression section section of the STA2020 - Applied Statistics course. It contains everything you need to know to fully understand and do well in this section. Ideal for test/exam preparation! The sub-sections covered are an introduction to statistical inference, hypothesis testing, correlation analysis, simple linear regression, multiple linear regression, model building and logistic regression.

Show more Read less









Whoops! We can’t load your doc right now. Try again or contact support.

Document information

Uploaded on
June 22, 2021
Number of pages
18
Written in
2020/2021
Type
Summary

Content preview

STA2020F: Applied Statistics

Regression Notes



Statistical inference: The attempt to reach a conclusion concerning all members of a
class (population) from observations of only some of them (sample).



Population: A collection of observations.

• Numerical descriptor = A parameter.
• Population size = N.
• Mean (a measure of centre) = .
• Variance (a measure of dispersion) = 2.
• Standard deviation = .



Sample: A part or subset of a population.

• Numerical descriptor = A statistic.
• Sample size = n.
• Sample mean = 𝑥̅ .
• Sample variance = s2.
• Sample standard deviation = s.



Hypothesis testing:

• Define the null hypothesis, H0.
• Define the alternative hypothesis, Ha, where Ha is a form of “not H0”.
• Define the type 1 error (probability of falsely rejecting the null hypothesis), ,
usually 0.05.
𝑥̅ − 𝜇
• Calculate the test statistic. 𝑧 = .
𝜎
• Calculate the p-value (the probability of getting a result “as or more extreme”
than observed if the null hypothesis is true).
• If the p-value is  , reject H0. Otherwise, fail to reject H0.

, Correlation analysis: Measures the strength and direction of the linear relationship
between two variables.



Correlation coefficient

• Population correlation coefficient (p) – Measures the strength of the association
between the variables.
• Sample correlation coefficient (r) – An estimate of p, used to measure the
strength of the linear relationship in the sample observations.
• The sample correlation coefficient (r):




• Unit free.
• Lies between -1 (strong negative relationship) and 1 (strong positive relationship).
• Significance of the correlation coefficient:

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
stuviasisters University of Cape Town
View profile
Follow You need to be logged in order to follow users or courses
Sold
140
Member since
4 year
Number of followers
104
Documents
31
Last sold
3 weeks ago
stuviasisters

4,5

42 reviews

5
28
4
11
3
0
2
1
1
2

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their exams and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can immediately select a different document that better matches what you need.

Pay how you prefer, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card or EFT and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions