Summary

Summary Ecological Methods: Applied Statistics (visual)

Name: Summary Ecological Methods: Applied Statistics (visual)
SKU: doc_6320998
Rating: 5.00 (1 reviews)
Author: niekvandeven

Rating

5.0

(1)

Sold

Pages

Uploaded on

03-10-2024

Written in

2024/2025

This is a summary of the third part of Ecological Methods (WEC31806), namely Applied Statistics. The summary shows all the lectures in the applied part and has been made extra visual in order to understand the material quickly and properly. In addition, with sufficient examples.

Show more Read less

Institution

Course

Whoops! We can’t load your doc right now. Try again or contact support.

Report Copyright Violation

Written for

Institution: Wageningen University (WUR)
Study: Forest and Nature Conservation
Course: Ecological Methods (WEC31806)

All documents for this subject (3)

Document information

Uploaded on: October 3, 2024
Number of pages: 35
Written in: 2024/2025
Type: Summary

Subjects

ecology
ecologie
statistiek
statistics
populaties
populations
applied
toegepast
clustering
cluster analysis
similarity
dissimilarity
caputere recapture
species distribution
geostatistical analysis
sur

Content preview

16. Cluster analysis
Clustering: grouping data points based on similarity
Data points within a cluster are similar to each other, and dissimilar do data points in other clusters
→ useful to find groups that are assumed to exist in reality (e.g. vegetation type, animal behaviour)

Clustering = partitioning (same term)

- Clustering is not about revealing gradients
→ ordination is about revealing gradients
→ clustering is about detecting discrete groups with small differences between members
- Clustering is not the same as classification
→ classification is about creating groups based on known labels
Similarity and dissimilarity are essential components of clustering analysis
Distance between pairs of:
→ points
→ cluster of points

Types of clustering:

- Flat clustering (K-means clustering): creates a flat set of clusters without any structure
- Hierarchical clustering: creates a hierarchy of clusters (thus within internal structure)
Flat clustering (K-means)
K-means: the simplest clustering algorithm, where we must define a target number K, which refers to
the number of means (centers) we want our dataset to partition around.
→ Each observation is assigned to the cluster with the nearest mean
→ Only deals with difference between clusters and not within clusters

Steps:

1. Randomly locates initial cluster centers
2. Assign records to nearest cluster mean
3. Compute new cluster means
4. Repeats 2 & 3 a few iterations
→ new data points can be assigned to the cluster
with the nearest center
→ disadvantage: number of clusters is assigned by
eye

Learning algorithm: algorithm that learns; tries a
few times and then knows a definite outcome.
→ does not necessarily result in exactly the same outcome when the analyses is repeated

,Hierarchical clustering
Hierarchical clustering does not require us to pre-
specify the number of clusters to be generated,
and results in a dendrogram

Dendrogram: tree-like diagram that records the
sequences of merges or splits

Root node: upper node where all samples belong to
Leaf (terminal node): cluster with only one sample

→ similarity of two observations is bases on the height where branches containing those two
observations first are fused
→ we cannot use the proximity of two observations along the horizontal axis for similarity

Types of hierarchical clustering:

- Agglomerative clustering (merges): builds nested clusters by merging smaller cluster with a
bottom-up approach
- Divisive clustering (splits): builds nested cluster by merging smaller clusters with a top-down
approach

Disadvantage: when a new datapoint is
added, the entire dendrogram needs to
be recalculated

,Similarity and dissimilarity
Distance between pairs
Euclidean How the crow flies

Manhattan How the taxi drives
→ distance along the axis

Jaccard Intersection/union: relative similarity
→ for binary data

Jaccard distance:

0.67 → 4 out of 6 species differ between the sites

, If variables differ in measure (e.g. temperature and weight), scale the columns to mean = 0, sd = 1
→ same scal

Linkage: how we quantify the dissimilarity between clusters:

- Single: minimum distance between clusters
- Often leading to clusters with different size
- Shape of clusters can become elongated
- Complete: maximum distance between clusters
- Size of clusters become more compact
- Average: average between clusters - Handles outliers and noise well
- Ward: minimum variance method - Lead to more uniformly sized clusters
- More difficult to compute, thus slower for
large datasets

$7.99

Get access to the full document:

100% satisfaction guarantee

Immediately available after payment

Both online and in PDF

No strings attached

Get to know the seller

niekvandeven

4.5

(6)

Also available in package deal

Reviews from verified buyers

Showing all reviews

Mverhoeven02 Eco En Wildlife · 19 reviews

4 months ago

5.0

1 reviews

Trustworthy reviews on Stuvia

All reviews are made by real Stuvia users after verified purchases.

Get to know the seller

niekvandeven Wageningen University

View profile

Sold

Member since

2 year

Number of followers

Documents

Last sold

2 weeks ago

N van de Ven

MSc student Forest and Nature conservation aan Wageningen University. Tijdens mijn opleidingen maak ik altijd voor tentamens een gedetailleerde visuele samenvatting van de stof. Deze daarna in de kast laten verstoffen zou zonde zijn; graag help ik jou ook verder met deze samenvattingen!

4.5

6 reviews

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller niekvandeven. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $7.99. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 57791 documents were sold in the last 30 days Founded in 2010, the go-to place to buy study notes for 16 years now