100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.6 TrustPilot
logo-home
Summary

Summary Data Analytics (INFOB3DA) 2020/2021

Rating
1.0
(2)
Sold
8
Pages
40
Uploaded on
23-02-2021
Written in
2020/2021

Summary Data Analytics (INFOB3DA) 2020/2021 - 8,3 met with this summary - no guarantee for english students

Institution
Module











Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Study
Module

Document information

Uploaded on
February 23, 2021
Number of pages
40
Written in
2020/2021
Type
Summary

Subjects

Content preview

Data Analytics
HC1: Introduction to the course
Course construction

1. Basics
a. Introduction
b. Definitions & concepts
c. Data foundation
2. Data mining
a. Classification
b. Clustering & outer analysis
c. Association rules
3. Visualization
a. Human perception
b. Design of data visualization techniques
c. Visualization techniques for non-spatial data
d. Visualization techniques for temporal data
e. Visualization techniques for geo-spatial data
f. Visualization techniques for 3D Spatial data
You might be aware of is that huge amounts of data automatically connected whether you are
• Supermarket, YouTube, Netflix

Knowledge Discovery in Databases (KDD): the process of (semi-) automatic extraction of
knowledge from databases which is
- Valid: there’s somehow a model from which I can derive this knowledge and you can prop
that model several times with the same input and it shows the same output
- Previously unknown
- And potentially useful

Interdisciplinary field:
Database systems:
• Scalability for large datasets
• Integration from different sources
• Novel data types (e.g., text)
Data statistics
• Probabilistic knowledge (certainty and uncertainty)
• Model-based inferences
• Evaluation of knowledge
Machine learning
• Different paradigms of learning
• Supervised learning
• Hypothesis spaces and search strategies

,KDD Process Model

,Hands-on-questions

Bioinformatics: What is the data mining task?
A. Classification
B. Clustering
C. Association Rules

A: classification. You have to ask: what do you
want with this data? All answers are somehow
correct if you know what you want, and you can
argument it.

Network Security: What is the data mining task?
A. Classification
B. Clustering
C. Association Rules

B. Clustering (detection), and association rules
is also a possible solution


Visualization: data is coming in; you
visualize it and you gain insights

Visual analytics: computers are incredibly
fast, accurate, and stupid, humans are
incredibly slow, inaccurate, and brilliant,
together they are powerful beyond
imagination.

How to design good visualizations?
What are the goals of visualization?
Presentation
• Starting point: facts to be presented are fixed a priority
• Process: choice of appropriate presentation techniques
• Result: high-quality visualization of the data to present facts

Confirmatory Analysis
• Starting point: hypotheses about data
• Process: goal-oriented examination of the hypotheses
• Result: visualization of data to confirm or reject the hypotheses

Exploratory analysis
• Starting point: no hypotheses about the data
• Process: interactive usually undirected search for structures, trends
• Result: visualization of data to lead to hypotheses about the data

What is visualization?
• Visualization is the process of presenting data in a form that allows rapid understanding
of relationships and findings that are not readily evident frow raw data (National Center
for Statistics and Analysis)
• The use of computer-generated, interactive, visual representations of abstract data to
amplify cognition (Card, Mackinlay, Shneiderman)

, Visual Analytics

Reviews from verified buyers

Showing all 2 reviews
2 year ago

3 year ago

Not top... The preview seems good, but the rest of the document is very incomplete and cluttered..

1.0

2 reviews

5
0
4
0
3
0
2
0
1
2
Trustworthy reviews on Stuvia

All reviews are made by real Stuvia users after verified purchases.

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
marreslikker Universiteit Utrecht
Follow You need to be logged in order to follow users or courses
Sold
53
Member since
4 year
Number of followers
39
Documents
11
Last sold
8 months ago
Summaries for Information Science Bachelor at the Utrecht University

Hi! I\'m selling all of my Summaries for Information Science Bachelor at the Utrecht University. My average grade for the last study year has been 8+ so I decided to help you with sharing my summaries. I normally never do this, but hopefully it will be helpful. Please leave a rating!

3.2

12 reviews

5
3
4
4
3
1
2
0
1
4

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their exams and reviewed by others who've used these revision notes.

Didn't get what you expected? Choose another document

No problem! You can straightaway pick a different document that better suits what you're after.

Pay as you like, start learning straight away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and smashed it. It really can be that simple.”

Alisha Student

Frequently asked questions