100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Class notes

Statistics II Regression Analysis - Answers to all assignments, including final assignment

Rating
-
Sold
-
Pages
99
Uploaded on
07-03-2023
Written in
2017/2018

Here are the answers to all weekly assignments for the PhD course at Maastricht University of Statistics II, focusing on Regression Analysis. With these answers, you will be able to assess your own learning, to see whether you understand what has been learned so far.

Show more Read less
Institution
Course











Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Study
Course

Document information

Uploaded on
March 7, 2023
Number of pages
99
Written in
2017/2018
Type
Class notes
Professor(s)
B. winkens
Contains
All classes

Subjects

Content preview

1. Correlation
Data file:
Open the SPSS file: colton6re.sav. In this file, age in years (Age) and systolic blood pressure
in mmHg (SBP) are given for 33 women from a Canadian study. An interesting research
question in this study is whether there is an association between age and systolic blood
pressure. Moreover, how strong is this association?

Before you start with the following questions, make your own analysis strategy: what steps
should be taken to answer this research question?

Task 1.1
Make a scatterplot with systolic blood pressure on the y-axis and age on the x-axis to see
whether there are no impossible values and that the possible association between systolic
blood pressure and age is linear.




Question 1a.
Are there any impossible values for systolic blood pressure or for age?

Systolic blood pressure ranges from about 100 to 220 (exact: 99 to 217), which are all
plausible values. Age ranges from about 20 to 80 (exact 22 to 81). Thus, there are also no
impossible values for age.
PhD course Statistics part 2: Regression analysis and SPSS 1

,Question 1b.
Is the association positive or negative? Is there a straight-line association or does it deviate
substantially from a linear line? Why does this latter matter?

Positive association (if age increases, SBP also increases); no clear deviation from linearity,
therefore you may assume linearity. This deviation from linearity matters, since the Pearson
correlation coefficient is a measure of linear association. If this assumption is clearly
violated, one should not use Pearson correlation coefficient on the raw data. A
transformation to make the association linear or use of Spearman correlation (if association
is monotone in- or decreasing) can then be considered.

Question 1c.
Is the association strong or weak? Give an estimation of the correlation coefficient (do not
calculate it).

Whether the association is strong or weak, in other words whether the points are close to a
straight line, is hard to tell. I would say it is a medium to rather strong correlation.
Estimation: points fairly close to a straight line → correlation coefficient between 0.5 and
0.8.

Task 1.2
Compute with SPSS the Pearson and Spearman correlation between age and systolic blood
pressure. Test whether these correlations are significant.

Correlations

Systolic blood
Age in years pressure in mmHg

Age in years Pearson Correlation 1 .718**

Sig. (2-tailed) .000

N 33 33
Systolic blood pressure in mmHg Pearson Correlation .718** 1

Sig. (2-tailed) .000

N 33 33
**. Correlation is significant at the 0.01 level (2-tailed).


Correlations

Systolic blood
pressure in
Age in years mmHg

Spearman's rho Age in years Correlation Coefficient 1.000 .659**

Sig. (2-tailed) . .000

N 33 33

Systolic blood pressure in Correlation Coefficient .659** 1.000
mmHg Sig. (2-tailed) .000 .


PhD course Statistics part 2: Regression analysis and SPSS 2

, N 33 33

**. Correlation is significant at the 0.01 level (2-tailed).


Question 2a.
How large are the Pearson and Spearman correlations? Are these correlations significant,
assuming a significance level (α) of 0.05? Which null hypothesis belongs to these tests?

Pearson: r = 0.718; p = 0.000 (< 0.001), thus significant (smaller than 0.05). H0: ρ = 0
(correlation in population = 0),
Spearman: rs = 0.659; p = 0.000 (< 0.001), thus significant (smaller than 0.05). H0: ρs = 0
(Spearman correlation in population = 0).


Question 2b.
Which correlation (Pearson or Spearman) is preferred? What information is additionally
required to answer this question? Make sure that you obtain this information using SPSS and
try to explain why the Pearson or Spearman correlation is preferred?

The Pearson correlation is preferred if both variables are (approximately) normally
distributed (and the relation is linear, which is checked in question 1). Thus, normality has to
be checked for both variables.




PhD course Statistics part 2: Regression analysis and SPSS 3

, PhD course Statistics part 2: Regression analysis and SPSS 4
$6.64
Get access to the full document:

100% satisfaction guarantee
Immediately available after payment
Both online and in PDF
No strings attached

Get to know the seller
Seller avatar
catherinnaems
1.0
(1)

Get to know the seller

Seller avatar
catherinnaems Maastricht University
Follow You need to be logged in order to follow users or courses
Sold
1
Member since
2 year
Number of followers
1
Documents
0
Last sold
-

1.0

1 reviews

5
0
4
0
3
0
2
0
1
1

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their exams and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can immediately select a different document that better matches what you need.

Pay how you prefer, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card or EFT and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions