Samenvatting

Samenvatting literatuur week 3 Data analyse gezondheid en maatschappij

Beoordeling

Verkocht

Pagina's

Geüpload op

24-05-2021

Geschreven in

2020/2021

dit is een samenvatting van de aanbevolen literatuur van week 3 van het vak data analyse voor gezondheid en maatschappij

Instelling

Vak

Oeps! We kunnen je document nu niet laden. Probeer het nog eens of neem contact op met support.

Meld schending auteursrecht

Gekoppeld boek

Andy Field Discovering Statistics Using IBM SPSS

Uitgave:november 2017
ISBN:9781526419521
Druk:5

Geschreven voor

Instelling: Wageningen University (WUR)
Studie: Bachelor Gezondheid En Maatschappij
Vak: Data Analysis For Health And Society (YRM21806)

Alle documenten voor dit vak (7)

Documentinformatie

Heel boek samengevat?: Nee
Wat is er van het boek samengevat?: 2.9.6en 2.9.7 , 12.2 -12.3, 12.10 & 12.11 , 13.1 –13.4, 13.6 (niet 13.6.3), 13.7, 13.11 , 4.1 –14.4,
Geüpload op: 24 mei 2021
Aantal pagina's: 20
Geschreven in: 2020/2021
Type: Samenvatting

Onderwerpen

yrm21806
yrm
data analyse
data
analyse
field
literatuur
week
gezondheid
maatschappij
jaar 2
summary
samenvatting
en
gezondheid en maatschappij

Voorbeeld van de inhoud

LITERATUUR WEEK 3

FIELD

CHAPTER 2 THE SPINE OF STATISTICS

2.9.6 TYPE I AND TYPE II ERRORS

When we use test statistics to tell us about the true state of the world, we’re trying to see whether there is an
effect in our population. There are two possibilities. There is , in reality , an effect in the population or there is ,
in reality , no effect in the population. We have no way of knowing which of these possibilities is true. However
we can look at test statistics and their associated probability to help us to decide which of the two is more
likely. There are two mistakes we can make: a type I error and a type II error.
A type I error occurs when we believe that there is a genuine effect in our population, when in fact there isn’t.
The opposite is a type II error, which occurs when we believe that there is no effect in the population when, in
reality, there is.
There is a trade-off between these two errors: if we lower the probability of accepting an effect as genuine
( making alpha smaller) then we increase the probability that we’ll reject an effect that does genuinely exist.

2.9.7 INFLATED ERROR RATES

As we have seen, if a test uses a 0.05 level of significance then the chances of making a type I error are only
5%. Logically, then the probability of no type I error is 95% for each test. If you do three tests and we assume
that each test is independent, then the overall probability of no type I errors will be 0.95 3 = 0.95 x 0.95 x 0.95 =
0.857. Given that the probability of no type I errors is 0.857, then the probability of making at least one type I
error is this number subtracted from 1  1-0.857 = 0.143 or 14.3%. Therefore, across this group of tests the
probability of making a type I error has increased from 5% to 14.3%, a value greater than the criterion that is
typically used. This is called the familywise error rate. This can be calculated with the equation:

Familywise error = 1-0.95n

N is the number of tests carried out on the data. To combat this build-up of errors, we can adjust the level of
significance for individual tests such that the overall type I error rate ( alpha ) across all comparisons remains
0.05. The most popular way is to divide alpha by the number of comparisons k :

CHAPTER 12 GLM 1 :COMPARING SEVERAL INDEPENDENT MEANS

12.2 USING A LINEAR MODEL TO COMPARE SEVERAL MEANS
We saw in chapter 10 that if we include a predictor variable containing two categories into the linear model
then the resulting b for that predictor compares the difference between the mean score for the two categories.
We also saw in chapter 11 that if we want to include a categorical predictor that contains more than two
categories, this can be achieved by recoding that variable into several categorical predictors each of which has
only two categories ( dummy coding ). When we do , the bs for predictors represent differences between
means. Therefore, if we’re interested in comparing more than two means we can use the linear model to do
this. We test the overall fit of a linear model with an F-statistic, we can do the same here. We first use an F to
test whether we significantly predict the outcome variably by using group means and then use the specific
model parameters ( the bs ) to tell us which means differ from which. This chapter will develop what we
discovered in chapter 10 and 11 about using dummy variables in the linear model to compare means.

,Let’s start with an example. Puppy therapy rooms have been set up
to de-stress students and staff at the university of Sussex along with
universities in Bristol, Nottingham, Aberdeen and Lancaster. Despite
the increase in puppies on campuses to reduce stress, the evidence
base is pretty mixed. Imagine we want to contribute to this
literature by running a study in which we randomized people into
three groups: 1. Control group ( treatment as usual, no treatment or
placebo ) 2. 15 minutes of puppy therapy ( low-dose ) and 3. 30
minutes of puppy therapy ( high-dose group). The dependent variable was a measure of happiness ranging
from 0 to 10. We’d predict that any form of puppy therapy should be better than the control ( higher happiness
scores ) but also formulate a dose-response hypothesis that as exposure time increases happiness will increase
too. If we want to predict happiness from group membership we can use the general equation:

Outcomei = (model) + errori

We’ve seen that with two groups we can replace them by dummy variables ( 1 and 0 ) and an associated b -
value would represent the difference between the group
means. We have three groups , but we’ve also seen that this
situation is easily incorporated into the linear model by
including two dummy variables ( each assigned to a b-value )
and that any number of groups can be included by extending
the
number of dummy variables to one less than the number of
groups. We’ve also learnt that when we use dummy
variables we assign one group as the baseline ( 0 ). The
baseline category should be the condition against which
you intend to compare the other groups. In the puppy
therapy example, we can take the control group ( who
receive no puppy therapy ) as the baseline category because
we want to compare the 15 and 30 minute groups to this
group. Let’s call the 30 minutes long and the other short ( 15)
the model as predictors gives:

Happinessi = b0 + b1longi + b2shorti + εi

the baseline category is coded 0 . if a participant received 30 minutes of puppy therapy they are coded with a 1
for the long dummy variable and 0 for short. If the participant received 15 minutes of puppy therapy they are
coded with 1 for short and 0 for long.

Let’s first examine the model for the control group. Both the long and short dummy variables are coded 0
therefore we ignore the error term. The model becomes:

The 15 and 30 minute groups have dropped out of the model ( coded 0 ) and we’re left with b 0. The predicted

value of happiness will be the mean of the control group so we can replace happiness with this
value. This leaves us with that b0 in the model is always the mean of the baseline category. For the 30 minute
group, the value of the dummy variable is 1 and the value for short is 0 . the model becomes;

, Which tells us that predicted happiness for someone in the 30 minute group is the sum of b 0 and the b for the
dummy variable long (b1). We know that b0 is the mean of the control group and the predicted value of

happiness for someone in the 30 minute group is the mean of that group , therefore we can
replace b0 with the mean of the control and happiness with the mean of the 30 minutes group. The result is:

Which shows that the b-value for the dummy variable representing the 30 minute group is the difference
between the means of that group and the control. Finally we do this for the 15 minute group. The short is value
1 and the long is value 0.

Again, we replace bo by the mean of the control. The predicted value of happiness for someone in the 15

minute group is the mean of that group so we can replace happiness with , the result is:

Which shows us that the b-value for the dummy variable representing the 15 minute group is the difference
between means for the 15 minute group and the control.

The output from ANOVA:

The F(2,12) = 5,12 , p=0.025 , gives that our
model represents the group means. This F tells
us that using group means to predict
happiness scores is significantly better than
using the mean of all scores. It says: the group
means are significantly different.

The constant (b0) is equal to the mean of the
base category ( control group ) = 2.2. the b-
value of the first dummy variable b1 is equal to the difference between the means of the 30 minute group and
the control group ( 5.0-2.2 = 2.8). finally the b-value for the second dummy variable (b 2) is equal to the
difference between the mean of the 15 minute group and the control ( 3.2-2.2 = 1). The difference between
the 30 minute group and the control group is significant p=0.008, the difference between the 15 minute and
the control is not p=0.282.

12.1.1 LOGIC OF THE F-STATISTIC

The F-statistic tests the overall fit of a linear
model to a set of observed data. F is the ratio of

€7,49

Krijg toegang tot het volledige document:

100% tevredenheidsgarantie

Direct beschikbaar na je betaling

Lees online óf als PDF

Geen vaste maandelijkse kosten

Maak kennis met de verkoper

Laurieke

4,0

(1)

Maak kennis met de verkoper

Laurieke Wageningen University

Bekijk profiel

Volgen

Verkocht

Lid sinds

6 jaar

Aantal volgers

Documenten

Laatst verkocht

4 jaar geleden

4,0

1 beoordelingen

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via iDeal of creditcard en download je PDF-document meteen.

“Gekocht, gedownload en geslaagd. Zo makkelijk kan het dus zijn.”

Alisha Student

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper Laurieke. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor €7,49. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews) Afgelopen 30 dagen zijn er 47073 samenvattingen verkocht Opgericht in 2010, al 15 jaar dé plek om samenvattingen te kopen

Samenvatting literatuur week 3 Data analyse gezondheid en maatschappij

Gekoppeld boek

Geschreven voor

Documentinformatie

Onderwerpen

Voorbeeld van de inhoud

Meer vakken binnen Wageningen University (WUR) > Bachelor Gezondheid En Maatschappij

Maak kennis met de verkoper

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Niet tevreden? Kies een ander document

Betaal zoals je wilt, start meteen met leren

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Tevredenheidsgarantie: hoe werkt dat?

Van wie koop ik deze samenvatting?

Zit ik meteen vast aan een abonnement?

Is Stuvia te vertrouwen?