100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.6 TrustPilot
logo-home
Summary

Summary Descriptive Statistics Lecture 3 (H3.1&3.2)

Rating
-
Sold
-
Pages
6
Uploaded on
12-02-2023
Written in
2022/2023

This is a summary for the subject matter of lecture 3 of Descriptive Statistics in the pre-master Orthopedagogy at the University of Amsterdam. It covers chapters 3.1 and 3.2 of Algresti & Franklin (Statistics).

Institution
Course









Whoops! We can’t load your doc right now. Try again or contact support.

Connected book

Written for

Institution
Study
Course

Document information

Summarized whole book?
No
Which chapters are summarized?
H3.1 & 3.2
Uploaded on
February 12, 2023
Number of pages
6
Written in
2022/2023
Type
Summary

Subjects

Content preview

3. Association: contingency, correlation and regression
When we analyse data on two variables, our first step is to distinguish between the response
variable and the explanatory variable

The response variable = the outcome variable on which comparisons are made for different values
of the explanatory variable

The explanatory variable = categorical, it defines the groups to be compared with respect to the
response variable. When the explanatory variable is quantitative, we examine how different
values of the explanatory variable relate to changes in the response variable.

The data analysis examines how the outcome on the response variable depends on or is explained
by the value of the explanatory variable.

Some studies regard either or both variables as response variables. There is no clear distinction as to
which variable would be explanatory for the other.

The main purpose of data analysis with two variables is to investigate whether there is an
association and to describe the nature of that association.

An association exists between two variables is particular values for one variable are more likely to
occur with certain values of the other variable.

3.1. The association between two categorical variables
Contingency table = a display for two categorical variables. Its rows list the categories of one
variable and its columns list the
categories of the other variable. Each
entry in the table is the number of
observations in the sample at a
particular combination of categories
of the two categorical variables.

Example 




Each row and column combination in a contingency table is called a cell.

The process of taking a data file and finding the frequencies for the cells of a contingency table is
referred to as cross-tabulation of the data.

Conditional proportion = a proportion whose formation is conditional on a variable. It refers to a
particular row/column of the contingency table

- The conditional proportions in each row sum to 1.0
- The sample size n for each set of conditional proportions is listed so you can determine the
frequencies on which the conditional proportions were based.
- Whenever we distinguish between a response variable and an explanatory variable, it is
natural to form conditional proportions (based on the explanatory variable) for categories of
the response variable

, Marginal proportion = the proportion of all the values of a variable. It is found using counts in the
margin of the table. It refers to the sum of the row/column of the contingency table.

Side-by-side bar chart = a single bar graphs that shows the bars for the conditional proportions side
by side.

Stacked bar chart = a display that compares the conditional proportions by stacking the proportions
on top of each other.




Both the side-by-side bar graph and the stacked bar chart allow an easy comparison of the
conditional proportions across the explanatory variables.

When forming a contingency table, determine whether one variable should be the response
variable. If there is a clear explanatory/response distinction, that dictates which way we compute
the conditional proportions. In some cases, either variable could be the response variable. Then you
can form conditional proportions in either or both directions. Studying the conditional proportions
helps you judge whether there is an association between the variables.
$4.17
Get access to the full document:

100% satisfaction guarantee
Immediately available after payment
Both online and in PDF
No strings attached


Also available in package deal

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
sevendeboer Universiteit van Amsterdam
Follow You need to be logged in order to follow users or courses
Sold
17
Member since
2 year
Number of followers
13
Documents
17
Last sold
3 months ago

5.0

2 reviews

5
2
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions