100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Summary

Full Summary of STA3022F course notes

Rating
-
Sold
2
Pages
54
Uploaded on
26-05-2023
Written in
2022/2023

Summary of Part 1 and 2 of the STA3022F course notes, prepared in 2023 covering the theory of the course as well as formulas and methods. Lecture slides are also covered in this document.

Institution
Course











Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Course

Document information

Uploaded on
May 26, 2023
Number of pages
54
Written in
2022/2023
Type
Summary

Subjects

Content preview

STA3022F Summaries




Chapter 1 - Chapter 10
2023


1

,CH 1: DATA

Data types:
- Numerical variables: measurements that can be recorded on a quantitative scale
where the intervals between two values on the scale have some consistent
meaning

• Ex. Height, age, number of children
• Can further classify numerical variables as continuous if they can take on any
intermediate value on the scale (e.g. height) or discrete if the values a variable
can take on are limited in some way, often to the set of whole numbers (e.g.
number of children).
- Categorical variables: measurements of individuals in terms of groups or
categories where the gap between categories have no intrinsic meaning.
- Ratio-scaled numerical variables are those that have a natural zero point (like age,
height, and income). Called ratio scaled because not sensitive to units of
measurements.
- Interval-scaled variables are still numeric but do not have a natural zero point (IQ
and temperature in degrees Celsius are of this type). Interval-scaled variables
therefore have an arbitrary zero point and an arbitrary scale
- Ordinal categorical variables are those where the categories can be ordered even
if the gaps between them cannot be interpreted (such as level of education, which
can be ordered: none, primary-school, high-school, undergraduate degree,
postgraduate degree)
- Nominal categorical variable cannot be ordered in any meaningful way (such as
race or language group)
- Likert/rating scales:
• measurement scale usually ranging from some negatively worded statement
(e.g. “strongly disagree”, “terrible”) to some positively worded statement (e.g.
“strongly agree”, “excellent”).

• categorical because the numbers are only being used as labels for the written
descriptions, and a gap of one unit cannot be consistently interpreted.




2

,Standardising Data
- Data measured in di erent scales can cause issues in multivariate analysis as it
will give too much in uence on variables measured on larger scales.
- Steps:
• Calculate the mean and standard deviation of each variable in the data matrix
(i.e. these are the column means and the column standard deviations).

• Subtract each element in the data matrix by its column mean.
• Divide the resulting “element minus mean” by its column standard deviation.

Singular Value Decomposition




- D matrix: diagonal matrix with 0’s on o diagonals.
• Number of diagonal entries = min(n,p)
• Values in D are singular values (>= 0)
• Singular values ordered in decreasing order across diagonals
- SVD is the basis for approximating multivariate data by dimension reduction.
- Huygens’ Principle: the approx necessarily includes the centroid so we will centre
data matrix X before doing the approximation. (Unless X already standardised)




3


fffl ff

, CH 2: PRINCIPAL COMPONENT ANALYSIS
- Main aim: Dimension Reduction
- New uncorrelated variables will be denoted by Y1,…,Yr and these will be a linear
combination of original variables X1,…,Xp
- Each principal component Yi is a linear combination of the Xi variables (usually
original ones in standardised form) in such a way that the rst axis (i.e., the rst
principal components) is in the direction containing most variation.

4



fi fi
$7.36
Get access to the full document:

100% satisfaction guarantee
Immediately available after payment
Both online and in PDF
No strings attached

Get to know the seller
Seller avatar
ec12
5.0
(1)

Get to know the seller

Seller avatar
ec12 University of Cape Town
Follow You need to be logged in order to follow users or courses
Sold
5
Member since
5 year
Number of followers
4
Documents
4
Last sold
7 months ago

5.0

1 reviews

5
1
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions