Geschreven door studenten die geslaagd zijn Direct beschikbaar na je betaling Online lezen of als PDF Verkeerd document? Gratis ruilen 4,6 TrustPilot
logo-home
Samenvatting

Summary for Modern Methods in Data Analysis

Beoordeling
-
Verkocht
1
Pagina's
52
Geüpload op
27-10-2023
Geschreven in
2023/2024

An overview/summary of the course 'Modern Methods in Data Analysis' (part of Epidemiology at Utrecht University/UMC Utrecht). Linear Models, Likelihood and Logistic Regression are explained. Also Poisson models and generalized linear models, Survival Analysis, Resampling methods, Longitudinal Data Analysis are explained.

Meer zien Lees minder

Voorbeeld van de inhoud

Modern Methods in Data Analysis
All notes are taken from the lecture material provided by the ‘Modern Methods in Data Analysis (Online)’
course offered by the MSc Epidemiology program of the UMC Utrecht and Utrecht University.



Learning Objectives
By the end of the course, you should be able to:

• Explain the principles of the likelihood theory and maximum likelihood methods
• Explain the principles of the following statistical analysis techniques: Logistic regression analysis, Poisson
regression analysis, Analysis of event history data, including the Cox proportional hazards regression model
• Explain model validation and regression diagnostics
• Describe the basic principles of longitudinal data analysis
• Apply the above-mentioned techniques using common statistical packages (for this course we use R)
• Name the situations in which these techniques can be applied and the conditions that should be met to obtain
reliable results using these techniques
• Explain and interpret the results obtained with these techniques, and apply these results in practice (e.g. to
answer a research question)


Course Structure
This course consists of 9 Learning Units, spread over 9 weeks.

• Learning Unit 1 - Linear Models
• Learning Unit 2 - Likelihood and Logistic Regression 1
• Learning Unit 3 - Logistic Regression 2
• Learning Unit 4 - Poisson models and generalized linear models
• Learning Unit 5 - Survival Analysis
• Learning Unit 6 - Resampling methods
• Learning Unit 7 - Longitudinal Data Analysis 1
• Learning Unit 8 - Longitudinal Data Analysis 2
• Learning Unit 9 - Final Exam and Course Closure

,Week 1 – Linear Models
Linear models: Continuous data and analysis of variance, Linear models and one-way ANOVA, Two-way
ANOVA, Regression analysis, Linear regression, Analysis of covariance (ANCOVA), Multiple regression.

Lecture 1.3 – Linear Models and One-Way ANOVA




All models fit under the multiple linear regression (the others are simplifications of this one).

Model for data-generating process population: yij = μ + (μi - μ) + εij
(i = group, j = observation number in group) obs = const + group effect + residual




Linear model for the ANOVA case is: yij = μ + (μi - μ) + εij
Assumptions: yij is normally distributed (yij ~ N (μi , σ2)) and σ is constant .
Alternatively: εij ~ N (0 , σ2) → this distribution of residuals is normally how you check the assumptions. (ε is
replaced with e, e is the residual from a sample)

Steps in ANOVA analysis, see below chapter 1.

No differences between the groups implies 𝜇1 = 𝜇2 = 𝜇3 = ⋯
→ variance within the groups = variance between the groups, or
𝐹 = 𝑀𝑆𝐺𝑟𝑜𝑢𝑝 / 𝑀𝑆𝑅𝑒𝑠 ≈1
If group means are not all equal then variance between the groups will be (much) larger than the variance
within the groups
𝐹 = 𝑀𝑆𝐺𝑟𝑜𝑢𝑝 / 𝑀𝑆𝑅𝑒𝑠 ≫1
To test 𝐻0:𝜇1 = 𝜇2 = 𝜇3 = ⋯ against 𝐻1: at least one mean differs, use the result that F has an F distribution

,with degrees of freedom dfgroup and dfres
→Use this to calculate p-values

2 ways to get the ANOVA table in R:
Model
> model.an <- lm(y ~ factor(group))

Method 1. Both work for 1-way anova, but drop 1 is better for 2-way (because of single term deletion)

> drop1(model.an, test="F")
Single term deletions
Df Sum of Sq RSS AIC F value Pr(>F)
<none> 14.583 6.3397
factor(group) 1 16.333 30.917 13.3567 11.2 0.007407 **


Method 2.
> anova(model.an)
Df Sum Sq Mean Sq F value Pr(>F)
factor(group) 1 16.333 16.3333 11.2 0.007407 **
Residuals 10 14.583 1.4583

F = residual variance.
Anova: test if there are significant differences in means of different groups.

Drop1: construct ANOVA table, and check whether there are significant differences between groups.
Drop1 drops terms one by one to examine how the model fit changes. It can be used to construct the ANOVA
table.


Lecture 1.6 – Two-Way ANOVA
Two-way ANOVA: two grouping variables/factors (e.g. treatment vs placebo and low vs high dose (could be
more groups, still two-way ANOVA)). Possible combination effects of 2 factors are possible.

Two possibilities with two-way ANOVA:
1. No effect of both factors
2. Effect of one factor, e.g. (main effect of) treatment
3. Effect of other factor, e.g. (main effect of) dose
4. Effect of both factors, but effects are independent; an additive effect (no interactions).
5. Effect of both factors, effects are dependent; an interaction/effect modification.

, 1 2 3




4 5 5 (other example)

So you look at the residuals to check the
normality assumptions
Analysis steps:




> Two-way ANOVA table
with 3 different F statistics:




1. Test differences between treatment groups (factor 1): F = MSgr / MSres
Follows an F-distribution with dfgr and dfres degrees of freedom.
2. Test differences between doses (factor 2): F = MSdose / MSres
Follows an F-distribution with dfdose and dfres degrees of freedom.
3. Test differences between treatment groups and dose (factor 1 and 2): F = MSinteraction / MSres
Follows an F-distribution with dfint and dfres degrees of freedom.

Geschreven voor

Vak
Onbekend

Documentinformatie

Geüpload op
27 oktober 2023
Aantal pagina's
52
Geschreven in
2023/2024
Type
SAMENVATTING

Onderwerpen

€8,48
Krijg toegang tot het volledige document:

Verkeerd document? Gratis ruilen Binnen 14 dagen na aankoop en voor het downloaden kun je een ander document kiezen. Je kunt het bedrag gewoon opnieuw besteden.
Geschreven door studenten die geslaagd zijn
Direct beschikbaar na je betaling
Online lezen of als PDF

Maak kennis met de verkoper

Seller avatar
De reputatie van een verkoper is gebaseerd op het aantal documenten dat iemand tegen betaling verkocht heeft en de beoordelingen die voor die items ontvangen zijn. Er zijn drie niveau’s te onderscheiden: brons, zilver en goud. Hoe beter de reputatie, hoe meer de kwaliteit van zijn of haar werk te vertrouwen is.
lenie22 Vrije Universiteit Amsterdam
Bekijk profiel
Volgen Je moet ingelogd zijn om studenten of vakken te kunnen volgen
Verkocht
74
Lid sinds
6 jaar
Aantal volgers
45
Documenten
14
Laatst verkocht
1 maand geleden

4,1

7 beoordelingen

5
3
4
2
3
2
2
0
1
0

Populaire documenten

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

Student with book image

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Veelgestelde vragen