100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Summary

Data in Sport & Health: summary lectures

Rating
-
Sold
5
Pages
20
Uploaded on
21-03-2020
Written in
2019/2020

Summary of the lectures of the course data in sport and health.

Institution
Course










Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Study
Course

Document information

Uploaded on
March 21, 2020
Number of pages
20
Written in
2019/2020
Type
Summary

Subjects

Content preview

Data in Sport & Health

Lecture 1 – an introduction to data science
Data types
Structured
- Quantitative data
- Relational what can be found in a database
- Used in decision making → <50%

Unstructured data
- Qualitative data
- Can be found in the ‘wild’ → extract properties of
data
- Analysed → <1%




Big data
Data science is a multi-disciplinary field that uses scientific methods, processes, algorithms and
systems to extract knowledge and insights from structured and unstructured.

Knowledge discoveries in databases (KDD):
Data: Raw pieces of data
- Redm 192.234.234.453.674, v2.0
Information: Data is useful, organized and structured
- South facing traffic light on corner of Pitt and George
streets has turned red
Knowledge: Information is read, heard or seen and integrated and
understood
- The traffic light I am driving towards turned red
Wisdom: Informed decision making
- I better stop the car!

,Data science vs statistics:
- Not a very strict separation
- Each has its merits, statistical foundations of data science should not be ignored
- Primary differences:
• Statistics starts with hypothesis, DS with data
• Statistics shines with limited data, DS with lots of data
- Cultural differences between statisticians and data scientists.

Statistics
Old, extensive and mature (somewhat conservative) field
- Hypothesis → data collection → statistics
- Limited data
• Just about the hypothesis
• Few data points (test subjects, ethical considerations)
- Limited data demands carefulness and rigour
• A danger of drawing unfounded conclusions
- Hypothesis testing

Data science
Relatively young, methodology still growing
- Dataset → analysis → hypothesis
- Extensive data (long and wide)
- Leave room for exploration and discovery
- Extensive data demands carefulness and rigour
• Danger of drawing unfounded conclusions
- Hypothesis generation
• Findings are not unequivocal, publishable result
• Process doesn’t stop with DS

Data science lifecycle

,

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
mikeverweij Vrije Universiteit Amsterdam
Follow You need to be logged in order to follow users or courses
Sold
40
Member since
9 year
Number of followers
38
Documents
0
Last sold
1 year ago

4.4

5 reviews

5
2
4
3
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions