Class notes

STA2020F Applied Statistics – Lecture Slide Pack: Regression & ANOVA

Rating

Sold

Pages

112

Uploaded on

07-08-2025

Written in

2025/2026

STA2020F Applied Statistics – Lecture Slide Pack: Regression & ANOVA (Semester 1, 2025). This pack contains the full Regression and Analysis of Variance (ANOVA) lecture slides from STA2020F (Applied Statistics), exactly as taught during Semester 1, 2025. What’s included: • Complete lecture slides for Regression & ANOVA, • Two simple summaries included to help consolidate key ideas, • Neatly formatted and easy to read. Please note: • These are purely the lecture slides – no additional notes or annotations, • A few example slides may be missing if they weren’t covered in class, • Still a reliable and structured resource that follows the taught content closely. Perfect for: • Printing and bringing to lectures to annotate, • Creating your own summaries, • Seeing the full picture of the course content in a clear, organised format.

Show more Read less

Institution

Course

Content preview

Simple Linear Regression
Outline
1. Recap of Foundational Concepts in Statistics
2. The Problem We Want to Solve:
3. Example Problem
4. Correlation Analysis
5. Simple Linear Regression

Recap of Foundational Concepts in Statistics
Population vs Sample

• When we refer to a numerical descriptor for a population we refer to it as a
parameter, where a numerical descriptor for a sample is referred to as a
statistic.
• We use sample statistics to approximate population parameters.

Statistical Inference

• Statistical inference is the attempt to reach a conclusion concerning a complete
set of observations (the population) using only a subset thereof (a sample).
• It is important to note that this sample needs to be representative of the
population in order to make accurate inference.
• We make use of sampling distributions to make inference.
• Statistical inference is conducted with the help of hypothesis testing.

Hypothesis Testing

Hypothesis testing allows us to make statements about a population from a sample of
that population. It involves the following basic steps:

Step 1: Define the null hypothesis (H0) This is the hypothesis of no statistical
significance. Step 2: Define the alternative hypothesis (Ha) This is the hypothesis of
statistical significance.

Step 3: Define the significance level (α) This is the type one error rate (probability of
falsely rejecting H0). Typically, α = 0.05 or α = 0.01 are suﬀiciently low. Step 4: Calculate
the test statistic This will be calculated differently depending on the test being
conducted.

,Step 5: Find the p-value This is the probability of getting a result as or more extreme
than the observed test statistic, assuming H0 is true. A precise p-value can be
generated using software or an approximate one using tables by hand.

Step 6: Make a conclusion If p-value is ≤ α, then we reject H0 and conclude statistical
significance of our result. Otherwise, we fail to reject H0 and conclude no statistical
significance (this means that we can’t make any statements about the population from
our sample result).

The problem we want to solve
Describing the relationship between two variables

• How strong is the relationship? (so we want to be able to quantify it) Is this
observed relationship likely real or just due to chance?
• Can we explain the impact that changing one variable has on another variable?
• Can we predict the value of one variable from another variable?

Lecture example
As part of an experiment, a lecturer recorded the overall course marks and number of
lectures attended for 20 students in the course that they teach. The results of this
experiment are shown below:

,Correlation analysis as a method to solve our problem
Correlation is a measure of strength and direction of a linear relationship between two
variables

• Correlation is bounded between -1 and 1.
• Correlation does not have a unit.
• Correlation cannot be used to predict one variable from another.

Correlation coefficient

• Correlation is measured using the correlation coeﬀicient (typically the Pearson
correlation coeﬀicient).
• The population correlation coeﬀicient (ρ) measures the direction and strength of
the association between the full set of two variables.
• The sample correlation coeﬀicient (r) is an estimate of ρ and measures the
direction and strength of the association between the two variables in a sample
of the population.
• The sample correlation coeﬀicient is given by:

Test your understanding

Calculate the correlation coeﬀicient between X and Y for the following 3 observations:

, Example of data with different correlation coefficients

Correlation analysis with our example

Report Copyright Violation

Written for

Institution: University of Cape Town (UCT)
Course: STA2020 Applied Statistics (STA2020)

All documents for this subject (9)

Document information

Uploaded on: August 7, 2025
Number of pages: 112
Written in: 2025/2026
Type: Class notes
Professor(s): Grace carmichael, ané cloete
Contains: All classes

Subjects

lecture slides

R60,00

Get access to the full document:

100% satisfaction guarantee

Immediately available after payment

Both online and in PDF

No strings attached

Get to know the seller

catherineleppan

Get to know the seller

catherineleppan University of Cape Town

View profile

Sold

Member since

6 months

Number of followers

Documents

Last sold

3 months ago

0,0

0 reviews

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their exams and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can immediately select a different document that better matches what you need.

Pay how you prefer, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card or EFT and download your PDF document instantly.

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying this summary from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller catherineleppan. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy this summary for R60,00. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 58676 documents were sold in the last 30 days Founded in 2010, the go-to place to buy summaries for 16 years now

STA2020F Applied Statistics – Lecture Slide Pack: Regression & ANOVA

Content preview

Written for

Document information

Subjects

More courses for University of Cape Town (UCT) >

Get to know the seller

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Didn't get what you expected? Choose another document

Pay how you prefer, start learning right away

Frequently asked questions

What do I get when I buy this document?

Satisfaction guarantee: how does it work?

Who am I buying this summary from?

Will I be stuck with a subscription?

Can Stuvia be trusted?