Summary

Summary Machine Learning

Rating

Sold

Pages

Uploaded on

10-01-2024

Written in

2022/2023

Summary of all courses, supplemented with necessary information from the book Summary of all lectures, supplemented with information from the book.

Institution

Course

Whoops! We can’t load your doc right now. Try again or contact support.

Report Copyright Violation

Connected book

Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani An Introduction to Statistical Learning

Edition:2021
ISBN:9781071614181
Edition:Unknown

Written for

Institution: Wageningen University (WUR)
Study: Bioinformatics
Course: Machine Learning (FTE35306)

All documents for this subject (1)

Document information

Summarized whole book?: No
Which chapters are summarized?: Hoofdstukken uit colleges
Uploaded on: January 10, 2024
Number of pages: 61
Written in: 2022/2023
Type: Summary

Subjects

algorithms
supervised learning
unsupervised learning

Content preview

Week 1
Introduction
The course
Lectures with pen & paper exercises

Lab sessions

Project days

Grade

 50% project (report & code)
 50% written exam

Machine learning
Supervised learning => learning relationship (f) between input (x) & output (y)
based on training data

 Classification

 Regression

Methods for classification

 Logistic regr
 K nearest neigbours
 Linear/quadratic discriminant analysis
 Decision trees/ random forest

,  Support vector machines
 Neural networks

Methods for regression

 Linear
 Decision trees/ random forest
 Neural networks

Unsupervised learning => learning structure in training data without output
variable to predict

 Clustering

 Structure

Methods for clustering

 K means
 Expectation maximisation
 Hierarchical

Methods for dimensionality reduction

 Principal component analysis

How to optimally use training/test data?

,  Resampling: cross validation, bootstrapping

Statistical learning (chapter 2)
Statistical learning
Estimating f

 Income = y = response var
Years of education = x = predictor
 Unknown relationship between x & y = f
 Random error with mean 0 = E
- Part of y not explained by f
- Black bars
 Can also be multivariate
 More than 2 input dimensions (x)
- Number of input dimensions = p
- Number of data points = n

Prediction

 y = f(x) + E
- Y & f usually unknown
- Estimate f to predict y from known x values  ^y = ^f (x)
- F estimated using training data
- Error term E
 Error of the model
- Estimated from data set = mean squared error
 Reducible & irreducible error
- Reducible error => can be reduced by applying more appropriate
learning technique & models, or by adding more training data
- Irreducible error => cannot be reduced because relevant input is
unmeasured or there is unmeasurable variation

Inference

 Again estimate f
- But now: understand how x affects y
 Prediction vs inference
- Prediction => estimate to get good prediction

, - Inference => estimate to get understanding

Prediction accuracy vs model interpretability

 Linear models => high interpretability & sometimes high accuracy
Highly non-linear models => low interpretability, high accuracy c
 Choice depends on prediction or inference
- Prediction  more likely non-linear
- Inference  more likely linear

Parametric vs non-parametric

 Parametric
- Choose functional form of f
- Learn parameters of f from training data using least squares or
different method

😊 easier to estimate set of parameters than to fit arbitrary function 
less training data needed

☹ if chosen functional form is too far from truth  results can be poor

 Non-parametric
- No assumptions about functional form of f
- Estimate of f should fit well

😊 potential good fit, even if input-output relations are complex

☹ requires much more training data, risk of overfitting

Supervised & unsupervised

 Supervised learning => based on n training examples with p input
dimensions & 1 output (y), fit y = f(x) + E
 Unsupervised learning => n training examples with p input dimensions,
no corresponding outputs (y)
- Find structure in data: clustering or dimensionality reduction

Regression & classification

 Regression
- Response is quantitative (e.g. numerical)
 Classification
- Response is qualitative/categorical

Accuracy of a model

$4.22

Get access to the full document:

100% satisfaction guarantee

Immediately available after payment

Both online and in PDF

No strings attached

Get to know the seller

michouweimar

3.0

(5)

Get to know the seller

michouweimar Wageningen University

View profile

Sold

Member since

5 year

Number of followers

Documents

Last sold

1 month ago

3.0

5 reviews

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller michouweimar. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $4.22. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 46231 documents were sold in the last 30 days Founded in 2010, the go-to place to buy study notes for 15 years now

Summary Machine Learning

Connected book

Written for

Document information

Subjects

Content preview

Get to know the seller

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Didn't get what you expected? Choose another document

Pay as you like, start learning right away

Frequently asked questions

What do I get when I buy this document?

Satisfaction guarantee: how does it work?

Who am I buying these notes from?

Will I be stuck with a subscription?

Can Stuvia be trusted?