100% tevredenheidsgarantie Direct beschikbaar na je betaling Lees online óf als PDF Geen vaste maandelijkse kosten 4.2 TrustPilot
logo-home
Samenvatting

Full Summary of STA3022F course notes

Beoordeling
-
Verkocht
2
Pagina's
54
Geüpload op
26-05-2023
Geschreven in
2022/2023

Summary of Part 1 and 2 of the STA3022F course notes, prepared in 2023 covering the theory of the course as well as formulas and methods. Lecture slides are also covered in this document.

Instelling
Vak











Oeps! We kunnen je document nu niet laden. Probeer het nog eens of neem contact op met support.

Geschreven voor

Instelling
Vak

Documentinformatie

Geüpload op
26 mei 2023
Aantal pagina's
54
Geschreven in
2022/2023
Type
Samenvatting

Onderwerpen

Voorbeeld van de inhoud

STA3022F Summaries




Chapter 1 - Chapter 10
2023


1

,CH 1: DATA

Data types:
- Numerical variables: measurements that can be recorded on a quantitative scale
where the intervals between two values on the scale have some consistent
meaning

• Ex. Height, age, number of children
• Can further classify numerical variables as continuous if they can take on any
intermediate value on the scale (e.g. height) or discrete if the values a variable
can take on are limited in some way, often to the set of whole numbers (e.g.
number of children).
- Categorical variables: measurements of individuals in terms of groups or
categories where the gap between categories have no intrinsic meaning.
- Ratio-scaled numerical variables are those that have a natural zero point (like age,
height, and income). Called ratio scaled because not sensitive to units of
measurements.
- Interval-scaled variables are still numeric but do not have a natural zero point (IQ
and temperature in degrees Celsius are of this type). Interval-scaled variables
therefore have an arbitrary zero point and an arbitrary scale
- Ordinal categorical variables are those where the categories can be ordered even
if the gaps between them cannot be interpreted (such as level of education, which
can be ordered: none, primary-school, high-school, undergraduate degree,
postgraduate degree)
- Nominal categorical variable cannot be ordered in any meaningful way (such as
race or language group)
- Likert/rating scales:
• measurement scale usually ranging from some negatively worded statement
(e.g. “strongly disagree”, “terrible”) to some positively worded statement (e.g.
“strongly agree”, “excellent”).

• categorical because the numbers are only being used as labels for the written
descriptions, and a gap of one unit cannot be consistently interpreted.




2

,Standardising Data
- Data measured in di erent scales can cause issues in multivariate analysis as it
will give too much in uence on variables measured on larger scales.
- Steps:
• Calculate the mean and standard deviation of each variable in the data matrix
(i.e. these are the column means and the column standard deviations).

• Subtract each element in the data matrix by its column mean.
• Divide the resulting “element minus mean” by its column standard deviation.

Singular Value Decomposition




- D matrix: diagonal matrix with 0’s on o diagonals.
• Number of diagonal entries = min(n,p)
• Values in D are singular values (>= 0)
• Singular values ordered in decreasing order across diagonals
- SVD is the basis for approximating multivariate data by dimension reduction.
- Huygens’ Principle: the approx necessarily includes the centroid so we will centre
data matrix X before doing the approximation. (Unless X already standardised)




3


fffl ff

, CH 2: PRINCIPAL COMPONENT ANALYSIS
- Main aim: Dimension Reduction
- New uncorrelated variables will be denoted by Y1,…,Yr and these will be a linear
combination of original variables X1,…,Xp
- Each principal component Yi is a linear combination of the Xi variables (usually
original ones in standardised form) in such a way that the rst axis (i.e., the rst
principal components) is in the direction containing most variation.

4



fi fi
€6,30
Krijg toegang tot het volledige document:

100% tevredenheidsgarantie
Direct beschikbaar na je betaling
Lees online óf als PDF
Geen vaste maandelijkse kosten

Maak kennis met de verkoper
Seller avatar
ec12
5,0
(1)

Maak kennis met de verkoper

Seller avatar
ec12 University of Cape Town
Volgen Je moet ingelogd zijn om studenten of vakken te kunnen volgen
Verkocht
5
Lid sinds
5 jaar
Aantal volgers
4
Documenten
4
Laatst verkocht
8 maanden geleden

5,0

1 beoordelingen

5
1
4
0
3
0
2
0
1
0

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via Bancontact, iDeal of creditcard en download je PDF-document meteen.

Student with book image

“Gekocht, gedownload en geslaagd. Zo eenvoudig kan het zijn.”

Alisha Student

Veelgestelde vragen