100% tevredenheidsgarantie Direct beschikbaar na je betaling Lees online óf als PDF Geen vaste maandelijkse kosten 4.2 TrustPilot
logo-home
Samenvatting

Statistics 2 - Full Summary & Overview

Beoordeling
-
Verkocht
-
Pagina's
6
Geüpload op
20-04-2022
Geschreven in
2021/2022

This document is a short summary and overview of the most important topics covered in Statistics 2, including information from the web lectures and Q&As. Since this is a summary, it may not include ALL the necessary information but this document is helpful for a revision of the most-discussed topics for Statistics 2.

Meer zien Lees minder









Oeps! We kunnen je document nu niet laden. Probeer het nog eens of neem contact op met support.

Documentinformatie

Geüpload op
20 april 2022
Aantal pagina's
6
Geschreven in
2021/2022
Type
Samenvatting

Voorbeeld van de inhoud

WEEK 1 ~ Bivariate Linear Regression
1. Describe what a linear regression is.
- Describes the r.ship between variables by fitting a straight line.
+ B0 => constant (when X=0)
+ B1 => coefficient (when X increases 1)
- Interpretation: DV will decline by approximately *slope* scale points for every unit increase in IV.
- Assumptions: IV can be binary / nominal but DV should be interval ratio.

2. Describe what influences the size of standard errors.
- Bigger sample size smaller error
- More variation in X, smaller error

3. What is the “least squares” line
- Type of linear regression that minimizes the residuals.

4. Calculating predicted values.
- Expected when DV= # : Constant + (Slope * #)
- The difference btw expected values always equals the slope.

5. Assess the statistical significance of model coefficients
- Coefficient / Standard Error = SS
- If CI includes 0 => not significant

WEEK 2 ~ Multiple Linear Regression
1. Assessing “model fit” and its practical considerations
a. R squared
- Shows the amount of variance in Y explained by the model.
- 1 means it explains everything
- Adjusted R Squared corrects for the inflation that occurs when we add additional variables
- Regression / Total = R square
b. F statistics
- F statistics show the ratio of variance explained by the model to unexplained variance.
- Regression / Residual
- Only shows one coefficient is significant- but not which one.
- Higher F means better fit.
- When running a simple or bivariate linear regression F= t^2

2. Describing the concept of “multicollinearity”
- Multicollinearity is the degree of correlation between your IVs
- VIF= -R^2
- should be smaller than 5 ideal
- Tolerance: VIF/1 => should be above 0.2
- Potential solutions: combining into a single variable, collecting more data

3. Ordinal variables
- In ordinal variables, the r.ship between X and Y isn’t linear.
- Treating as categorical => pick a reference category
- Treating as continuous => we assume they’re equally spaced

4. Run and interpret a multiple linear regression in SPSS
- The value of the constant term is $. This represents the expected or mean value of the DV when the
IVs in the model equals 0. In this case, this represents the mean value of the DV among those who

, say… (the reference group). The values for the other coefficients are … We can see that ^ and * are
more supportive of …



WEEK 3 ~ Moderation-Mediation & Outliers-Influential Cases




1. Understand the differences between moderator and mediator variables
- Moderator: a variable that affects the direction/ strength of the r.ship btw. IV and DV.
- Confounding: a variable causes both the IV and DV



- Post treatment variables: (general term) variables that are a consequence
of the IV we care about and also has an influence on the DV. (are
endogenous) We care about X => Y, Z is a nuisance!

+ Mediator: explains the process through which two variables are
related. How much of X => Y goes through Z is a mediator!



- Exogenous: your level of education doesn’t cause your parents level of
education, but your parents level of education may affect your level of
education, thus affecting your views on migration. (potential causes- but not caused by)

- Heterogeneous effect: effect of X on Y varies by Z
- Homogenous effect: effect of X on Y doesn't vary by Z

2. Assess influential cases and potential outliers using SPSS
I. Outliers
- Standardized Residual
+ No cases above 3.29
+ < 1% cases above 2.58
+ < %5 cases above 1.96

II. Influential
- Cook’s Distance below 1
- Standardized DF beta below |-1|
+ All cases - that one case (then standardized) ~ overall
- Adjusted PV (good if it's close to 0)
+ PV for that case from a model in which the case is excluded ~ 1 particular coefficient

3. Create interaction terms out of existing variables
- When the coefficient of a variable is positive, and its interaction with another variable is negative:
effect starts out positive and then grows smaller with a one unit increase.
-
€3,39
Krijg toegang tot het volledige document:

100% tevredenheidsgarantie
Direct beschikbaar na je betaling
Lees online óf als PDF
Geen vaste maandelijkse kosten

Maak kennis met de verkoper
Seller avatar
kaylasagiz
4,0
(1)

Maak kennis met de verkoper

Seller avatar
kaylasagiz Universiteit Leiden
Bekijk profiel
Volgen Je moet ingelogd zijn om studenten of vakken te kunnen volgen
Verkocht
14
Lid sinds
3 jaar
Aantal volgers
12
Documenten
2
Laatst verkocht
1 jaar geleden

4,0

1 beoordelingen

5
0
4
1
3
0
2
0
1
0

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

Student with book image

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Veelgestelde vragen