Summary

Summary Descriptive Statistics Lecture 4 (H3.3 & 3.4)

Rating

Sold

Pages

Uploaded on

12-02-2023

Written in

2022/2023

This is a summary for the subject matter of lecture 4 of Descriptive Statistics in the pre-master Orthopedagogy at the University of Amsterdam. It covers chapters 3.3 and 3.4 of Algresti & Franklin (Statistics).

Institution

Course

Whoops! We can’t load your doc right now. Try again or contact support.

Report Copyright Violation

Connected book

Alan Agresti, Christine Franklin Statistics

Edition:Unknown
ISBN:9781292164878
Edition:Unknown

Written for

Institution: Universiteit van Amsterdam (UvA)
Study: Pedagogische Wetenschappen
Course: Beschrijvende statistiek (70110102AY)

All documents for this subject (42)

Document information

Summarized whole book?: No
Which chapters are summarized?: H3.3&3.4
Uploaded on: February 12, 2023
Number of pages: 6
Written in: 2022/2023
Type: Summary

Subjects

regressie
regression
y intercept
slope
helling
residual sum of squares
least squares method
analyzing associations
outlier

Content preview

3.3. Predicting the outcome of a variable
Exploring the relationship between 2 quantitative variables graphically  scatterplot

Straight-line pattern?  correlation coefficient describes its strength numerically

Further analysis  finding an equation for the straight line that best describes that pattern

This equation can be used to predict the value of the variable designated as the response variable
from the value of the variable designated as the explanatory variable.

Regression line = predicts the value for the response variable y as a straight-line function of the
value x of the explanatory variable. Let ^y denote the predicted value of y.

- The equation for the regression line has
the form: ^y =a+bx
- a denotes the y-intercept and b denotes
the slope.

y-intercept = the predicted value of y when x = 0

slope = equals the amount that ^y changes when
x increases by one unit.

- For two x values that differ by 1.0, the ^y
values differ by b.

When the slope is negative  ^y decreases as x
increases. The straight line then goes downward,
and the association is negative.

When the slope = 0, the regression line is
horizontal (parallel to the x-axis). ^y stays constant
at the y-intercept for any value of x. ^y does not
change as x changes and the variables don’t

exhibit association.

, The absolute value of the slope describes the magnitude of the change in ^y for a 1-unit change in x.
The larger the absolute value, the steeper the regression line.

Prediction error / residuals = difference between the actual y value and the predicted y value.
Residual = y− ^y

Each observation has a residual

A positive residual occurs when the actual y is larger than ^y , so that y− ^y > 0

A negative residual results when the actual y is smaller than ^y , so that y− ^y < 0

The smaller the absolute value of the residual, the closer the predicted value is to the actual value,
so the better the prediction.

If the predicted value is the same as the actual value, the residual is zero: y− ^y =0

In a scatterplot, the vertical distance between the point and the regression line is the absolute value
of the residual.

How is the equation for the regression line found?

The actual summary measure used to evaluate regression lines is called the residual sum of squares
residual ∑ of squares=Σ( residual)2=Σ( y− ^y )2
This formula squares each vertical distance between a point and the line and then adds up these
squared values. The better the line, the smaller the residuals tend to be, and the smaller the residual
sum of squares tends to be.

For each potential line, we have a set of predicted values, a set of residuals and a residual sum of
squares. The line that the software reports is the one having the smallest residual sum of squares.
This is why selecting a line is called the least squares method.

This regression line:

- Makes the errors as small as possible
- Has some positive residuals and some negative residuals, and the sum (and mean) of the
residuals equals 0
o Too-high predictions are balanced by too-low predictions
- Passes through the point ( x , y )
o The center of the data

sy
Formula for slope is b=r ( )
sx

Formula for y-intercept is a= y−b( x)

The slope b is directly related to the correlation r and the y-intercept depends on the slope.

We’ve used correlation to describe the strength of the association.

$4.17

Get access to the full document:

100% satisfaction guarantee

Immediately available after payment

Both online and in PDF

No strings attached

Get to know the seller

sevendeboer

5.0

(2)

Also available in package deal

Get to know the seller

sevendeboer Universiteit van Amsterdam

View profile

Sold

Member since

2 year

Number of followers

Documents

Last sold

3 months ago

5.0

2 reviews

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller sevendeboer. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $4.17. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 47402 documents were sold in the last 30 days Founded in 2010, the go-to place to buy study notes for 16 years now

Summary Descriptive Statistics Lecture 4 (H3.3 & 3.4)

Connected book

Written for

Document information

Subjects

Content preview

Also available in package deal

Get to know the seller

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Didn't get what you expected? Choose another document

Pay as you like, start learning right away

Frequently asked questions

What do I get when I buy this document?

Satisfaction guarantee: how does it work?

Who am I buying these notes from?

Will I be stuck with a subscription?

Can Stuvia be trusted?