100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Summary

Summary SPSSS Andy Field Ch. 5 - 9

Rating
3.5
(6)
Sold
10
Pages
20
Uploaded on
19-01-2016
Written in
2014/2015

Summary of the book Discovering statistics using SPSS written by Andy Field. It includes chapter 5, 6, 7, 8 & 9.

Institution
Course









Whoops! We can’t load your doc right now. Try again or contact support.

Connected book

Written for

Institution
Study
Course

Document information

Summarized whole book?
No
Which chapters are summarized?
Chapter 5, 6, 7, 8 & 9
Uploaded on
January 19, 2016
Number of pages
20
Written in
2014/2015
Type
Summary

Subjects

Content preview

SPSSS Andy Field – summary Chapter 5

Bias = Things that lead us to a wrong conclusion.

When we estimate a parameter we compute an estimate of how well it
represents the population, such as a standard error or confidence intervals, or
test statistics and their associated probabilities.

Assumption = a condition that ensures that what you’re attempting to do
works. When the assumption is not met, it is called a violation. The main
assumptions we look at are (1) additivity and linearity, (2) normality, (3)
homoscedasticity, and (4) independence.

Outlier = a score very different from the rest of the data. An outlier can bias a
parameter estimate, such as decreasing or increasing the mean. Outliers also
affect the sum of squared error dramatically, because we use squared errors, so
any bias created by the outlier is magnified by the fact that deviations are
squared. If the sum of squared errors is biased, so are the standard error and the
confidence intervals.

The assumption of additivity and linearity means that the outcome variable is,
in reality, linearly related to any predictors (i.e. a straight line). If this assumption
is not true, even if all other assumptions are not met, your model is invalid.

The normal distribution is valid to:
1. Parameter estimates: parameters (such as a mean) are affected by non-
normal distributions (such as outliers). It depends on the parameter how
much they are biased, a median is less biased by a skewed distribution
than the mean.
2. Confidence intervals: the standard normal distribution is used to compute
the confidence intervals around a parameter estimate.
3. Null hypothesis significance testing: to test a hypothesis we use the normal
distribution, because we assume the parameter has a normal distribution.
4. Errors: any model we fit include some error. These residuals need to be
normally distributed

The assumption of normality: The estimate of the confidence interval needs to
come from a normal distribution, and the sampling distribution must be normal,
and the estimates of the parameters must be normal. (This is not the same as
that the data needs to be normally distributed).

The central limit theorem revisited: As our sample sizes get bigger the sampling
distributions become more normal, up to point at which the sample is big enough
that the sampling distribution is normal. This is the central limit theorem:
regardless of the shape of the population, parameter estimates of that population
will have a normal distribution provided the samples are big enough.

The central limit theorem means that there are a variety of situations in which we
can assume normality regardless of the shape of our sample data. If our sample
is large enough we do not need to worry about the assumption of normality.
If you want to estimate parameters of your model then normality doesn’t really
matter.

, Homoscedasticity: assume that each of the samples come from populations
with the same variance.
We have to assume homoscedasticity in order to make sure our estimates of the
parameters that define our model and our significance test are accurate.

Example: 10 people are on tour with the loudest band and are measured for how
many hours after the concert these people had ringing in their ears. The scores
are presented by dots in a graph and the means are presented by blocks. In case
there is homoscedasticity the circles will lay around the dots every time the score
is measured. In case there is no homoscedasticity (thus, heteroscedascitiy) the
dots do not lay equally around the blocks, but differ along the y-axis (see page
175 for the example graphs).

If variances for the outcome variable differ along the predictor variable then the
estimates of the parameters within the model will not be optimal.
Heteroscedascitity creates a bias and inconsistency in the estimate of the
standard error.

Independence: this assumption means that the errors in your model are not
related to each other.
Example: Paul and Julie need to answer whether they have seen certain photos
before. In case they are not able to confer, the scores will be independent.

A histogram or a boxplot is an easy way to spot outliers.

Besides frequency distributions the P-plot (probability-probability plot) is another
useful graph for checking normality; it plots the cumulative probability of a
variable against the cumulative probability of a particular distribution. The data
are ranked and sorted, then for each rank the corresponding z-score is calculated
to create an ‘expected value’ that the score should have in a normal distribution.
Next, the score itself is converted to a z-score. The actual z-score is plotted
against the expected z-score. If the data is normally distributed the z-scores will
be the same, and you will get a perfectly diagonal line.

Graphs are particularly useful for looking at normality in big samples; however, in
smaller samples it can be useful to explore the distribution of variables using the
frequencies command. Analyze  descriptive statistics  frequencies. Select:
quartiles, standard deviation, variance, range, minimum, maximum, standard
error mean, mean, median, mode, skewedness, kurtosis (in statistics).
Positive values of skewness indicate a pile-up of scores on the left of the
distribution, whereas negative values indicate a pile-up on the right. Positive
values of kurtosis indicate a pointy and heavy-tailed distribution, whereas
negative values indicate a flat and light-tailed distribution. The further the value
is from zero, the more likely it is that the date are not normally distributed.
These values can be converted to z-scores, which enables us to (1) compare skew
and kurtosis values in different samples that used different measures, and (2)
calculate a p-value that tells us if the values are significantly different from 0 (i.e.
normal).

Both assumptions relate to the errors in the model we fit to the data. We can
create a scatterplot of the values of the residuals against the values of the
outcome predicted by our model. In doing so we are looking at whether there is a
systematic relationship between what comes out of the model and the errors in
$3.61
Get access to the full document:
Purchased by 10 students

100% satisfaction guarantee
Immediately available after payment
Both online and in PDF
No strings attached

Reviews from verified buyers

Showing all 6 reviews
4 year ago

Chapter 8 is missing and not very extensive

5 year ago

5 year ago

6 year ago

Chapter 8 is missing

6 year ago

6 year ago

3.5

6 reviews

5
2
4
1
3
1
2
2
1
0
Trustworthy reviews on Stuvia

All reviews are made by real Stuvia users after verified purchases.

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
jannahollema Erasmus Universiteit Rotterdam
Follow You need to be logged in order to follow users or courses
Sold
507
Member since
12 year
Number of followers
368
Documents
13
Last sold
1 year ago

Samenvattingen/essays/papers etc. van mijn Bachelor opleiding International Leisure Studies (Vrijetijdswetenschappen). Vakken die ik o.a. aanbied zijn Sociology, Anthropology, Research Methods, Marketing, Economy etc. Daarnaast bied ik vanaf 2016 ook samenvattingen aan van mijn Master opleiding Human Resource Management aan de Erasmus Universiteit.

3.8

108 reviews

5
32
4
40
3
24
2
7
1
5

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions