100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Summary

Summary Data Science for Ecology

Rating
-
Sold
-
Pages
30
Uploaded on
02-05-2022
Written in
2021/2022

Summary of lectures and practicals. Includes the material for both exams (skills & concepts).

Institution
Course










Whoops! We can’t load your doc right now. Try again or contact support.

Connected book

Written for

Institution
Study
Course

Document information

Summarized whole book?
No
Which chapters are summarized?
Hoofdstukken uit colleges
Uploaded on
May 2, 2022
Number of pages
30
Written in
2021/2022
Type
Summary

Subjects

Content preview

25% written exam (concept & theory)

25% PC exam (R skills)

50% group work (50% zip, 25% group, 25% ppt)



Concepts & theory
Data science
Top down view: generating value from data



Knowledge pyramid

 Data  info  knowledge  wisdom
 Raw data  meaningful data



Blend of principles & methods

 Ecology (domain) + computer science + maths and statistics



Trends in ecological research

 Large, complex datasets
 Specialised tech
 Data driven multidisciplinary science
 Analysing patterns



OSEMN pipeline

 Obtaining data
 Scrubbing (cleaning) data
 Exploring data
 Modelling data
 INterpreting results



Effective workflow

 Clear data structure
 Concise
 Understandable
 Reproducible
 Transferable



, 1. Import
2. Tidy
3. Transform
4. Visualise
5. Model (transform & visualise again when needed)
6. Communicate



Data science (DS) vs empirical science

 DS based on scientific method
 But: not all data science = science
 Different scale
 Empirical science => small #correlations  causal?
 DS => can identify unlimited #correlations



Data driven vs hypothesis driven

 Data driven
- Inductive
- Starts with data analysis
 Hypothesis driven
- Deductive
- Starts with hypothesis



3Vs of data

 Volume
 Variety
 Velocity

But: DS project can also be based on smaller, simpler data



DS workflow
1. (acquire data)
2. Import
3. Tidy
4. Transform
5. Visualise
6. Model (transform & visualise again when needed)  already 10 steps in
itself
7. Communicate
8. (act)



Gaining insight: transform, visualise, model

, Not 1 template workflow, but: similar steps



Data preparation (1)

 Tidy data in workable format
- Table with rows & columns
- Numeric data
 Convert categorical data  dummy vars
- n classes  n-1 dummy vars
 Deal with missing data
- Remove obs (r)
- Remove vars (c)
- Data imputation
 Correct errors or noise



Feature engineering (2)

 Use domain knowledge to extract features from raw data
 Compute interpretable features/vars from tidy data
- Data mining
 Creativity
- Many features  interactions
- Logical features  simpler models
 Takes lot of time



Algorithm selection (3)

 Problem def
- Classification vs regression  prediction
- Supervised vs unsupervised
- Prediction vs interference
 Multiple algorithms per category/problem
 All algorithms optimise cost/loss function



Feature standardisation (4)

 Centering & scaling
- Standardisation = (x – mean)/sd
- Normalisation = (x – xmin)/(xmax – xmin)
 Improves fit of algorithm
 Improves inference of results



Set division (5)

 Many data points: risk of overfitting

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
michouweimar Wageningen University
Follow You need to be logged in order to follow users or courses
Sold
48
Member since
5 year
Number of followers
33
Documents
34
Last sold
1 month ago

3.0

5 reviews

5
0
4
1
3
3
2
1
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions