100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Exam (elaborations)

Inferential statistics cheat sheet test 1

Rating
-
Sold
1
Pages
2
Grade
8-9
Uploaded on
15-06-2025
Written in
2024/2025

Cheat sheet voor test 1 met de volgende onderwerpen: Proportions/means/goodness of fit (15%) You should be able to calculate (and interpret) by hand the se of proportions / means, given the standard deviation and n of a sample. You are thus also able to calculate the confidence interval and the margin of error for a 95% confidence interval. So maybe write down these formula on your cheat sheet :-). You should also be able to do a proportions test, a means test and a goodness of fit test by using R. Associations (15%) You are able to calculate the Chi square in a small table by hand (determine observed and expected values) and interpret that number (see in the R manual how to find the associated p-value of a chi square statistic and the associated degrees of freedom. You are familiar with the interpretation of the measures of association discussed in the course. Basic linear regression descriptive (20%) You are able to translate linear equations into graphs and vice versa. You are able to identify the b-coefficients in output and thus create linear equations and graphs. In the first test attention is restricted to additive linear models (the independent variables can be dummies, nominal or ratio, the max number of independent variables is two (so the most complex model is a nominal variable and a ratio variable as independent variables. Linear regression slightly more advanced / inference (30%) The inferential part includes confidence intervals, standard errors, t values and p values. You can interpret those when given some output and know how these numbers are related. You know what residuals are and are able to analyse those. Since the package modelr may not be available in the test environment and you cannot install your own packages during the exam, I will give clear instructions on how to find and use the residuals without modelr. In one question you will be asked to download a dataset, to construct relevant variables and to do and correctly perform a test using the lm() procedure. Selecting a test (20%) Based on a general problem, you have to select a specific test in R. This implies identifying the problem and selecting the associated test and clarifying the expectation. For example: make sure you know under which circumstances you have to select the Welch t-test and Welch ANOVA.

Show more Read less
Institution
Course








Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Study
Course

Document information

Uploaded on
June 15, 2025
Number of pages
2
Written in
2024/2025
Type
Exam (elaborations)
Contains
Only questions

Subjects

Content preview

e-07 = 7 0’s before the output number | SE decreases when sample size increases | 68 – 95 – 99.7 |
USE 3 or ALL DECIMALS WHEN CALCULATING
Standard error of a proportion Margin of error 1.96 * π 




The sampling distribution of p is approximately normally distributed if N is fairly large and π is not
close to 0 or 1. A rule of thumb is that the approximation is good if both Nπ and N(1 - π) are greater
than 10.
General confidence interval: estimate ± 2 * SE
For mean: mean ± 2 * (s / sqrt(n))
For proportion: prop ± 2 * sqrt(p * (1 - p)) /(sqrt(n))
From R for proportion: Use prop.test()
For slope (regression): Use estimate (slope) ± 2 * SE, where both estimate and SE come from Routput
For correlation: Use the estimate from R
prop.test([# of positives], [n]) OR binom.test([# of positives], [n])
Lazy formulas for C.I (for the mean!) (change values for the question and put them on separate row)
mean = 5.02 | s = 1.87 | n = 300 | SE = s/sqrt(n) |
upper = mean + 2*SE| lower = mean - 2*SE
Standardizing / t value


How many SD
the sample mean away is from mean mu |
table(dataset) # to see how many 1’s or 0’s
Write in R as (x-mu)/(sd/sqrt(n)) (first make the
values x, mu, sd & n)
Standard error of a one sample t-test : S.e. = s.d./sqrt(n)
P value in R (t value) : 2*pt(-[t], [df]) | If you reject H0, my sample mean is significantly different for
the population mean.
Calculate the Chi square
Expected numbers Chi Calculate the outcomes of all the total tables for each of the cells (percentages
need to be written as 0.54)




Interpreting the chi square statistic



# to find P-value given a chisquare and df | pchisq(3.84[chi], 4[df], lower.tail = FALSE)
# P-value < 0.05 there is a significant association # P-value > 0.05 there is no significant association
# GOODNESS OF FIT TEST: Open question (sample size | proportion fractions)
observed <- c(30,98,80,100) | expected <- c(0.10,0.25,0.30,0.35) | chisq.test(x=observed,
p=expected) #H0: The sample proportions are a good representation of the population proportions
$10.23
Get access to the full document:

100% satisfaction guarantee
Immediately available after payment
Both online and in PDF
No strings attached

Get to know the seller
Seller avatar
marlouvanhoudt1

Also available in package deal

Get to know the seller

Seller avatar
marlouvanhoudt1 Universiteit Twente
Follow You need to be logged in order to follow users or courses
Sold
5
Member since
7 months
Number of followers
0
Documents
3
Last sold
8 hours ago

0.0

0 reviews

5
0
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions