100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Class notes

Videolectures notes week 1-3

Rating
-
Sold
-
Pages
21
Uploaded on
25-02-2020
Written in
2019/2020

Videolectures 1-3 of the course Data Mining for Business and Governance

Institution
Course










Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Study
Course

Document information

Uploaded on
February 25, 2020
Number of pages
21
Written in
2019/2020
Type
Class notes
Professor(s)
Unknown
Contains
All classes

Subjects

Content preview

Video lecture 1

What is Data Science?
= Data science is a "concept to unify statistics, data analysis and their related methods" in order to
"understand and analyze actual phenomena" with data.




What makes a Data Scientist?
Data scientists use their data and analytical ability to find and interpret rich data sources; manage large
amounts of data; create visualizations to aid in understanding data; build mathematical models using the
data; and present and communicate the data insights/findings.




Machine learning = we want the machines
to be better than humans

Data Mining = dealing with data /
pre-processing / requires you to visualise
data




One commonality between all these fields = ​data-driven science




1

,What is data?
Example




Given these three conditions, what will the child do?

- We don't have enough data, so at one point we have to make a
prediction
- We have to convert the data into numerical representations, so
that the computer can process the data




Converting into numerical representations:
● Sunny = 1
● Cloudy = 0
● Rainy = 2

Binary representations:
● Yes = 1
● No = 0

Features = attributes

We can also convert the data into specific measurements...




Or visualize the data...




2

, Interpreting data

Algorithms look for rules, to be able to predict some kind of behavior / actions




Formally notations



Our prediction = y-​hat
Our target = y

- The difference is that ​y ​is given​, and ​y-hat is
a ​prediction of a model and therefore not
necessarily correct




The notations applied to the example above:




3

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
ioumi Tilburg University
Follow You need to be logged in order to follow users or courses
Sold
11
Member since
12 year
Number of followers
9
Documents
1
Last sold
4 year ago
BSc Political Science: International Relations / MSc Data Science & Society

Hi there! I studied the BSc Political Science with a specialization in IR at the Vrije Universiteit Amsterdam. Currently I'm studying Data Science & Society at Tilburg University. Writing summaries has always be my way of learning: hopefully my documents will make the exam period easier for you!

5.0

1 reviews

5
1
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions