100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Summary

Summary Data Analytics and Privacy (R_DAP)

Rating
-
Sold
7
Pages
28
Uploaded on
25-01-2023
Written in
2022/2023

In this file, I have summarized all the exam-related material. This includes a summary per week of the lecture material along with a summary of the literature and a summery per week of each tutorial. To top that off, in week 6, I have added the tutorial assignment with all the answers (this is highly important for the exam). And in week 7, I have added the mock exam questions with all the right answers. Good luck with the exam ;)

Show more Read less
Institution
Course










Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Study
Course

Document information

Uploaded on
January 25, 2023
Number of pages
28
Written in
2022/2023
Type
Summary

Subjects

Content preview

Data Analytics and Privacy (R_DAP)
All Lectures and Tutorials Summary

,Lecture 1 - Summary
Course introduction, overview, and why privacy is important
Data analysis is a process of inspecting, cleansing, transforming and modelling data with the goal of
discovering useful information, informing conclusions and supporting decision-making.

Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to
extract knowledge and insights from many structural and unstructured data.

Data collection and pre-processing are always the first steps of a Business Analytics project

• Descriptive (what happened?) → just grasping is what is in your data
- Activities
- Results
• Diagnostic (Why did it happen?) → whatever we can understand from the data
- Content correlations
- W/L analysis
• Predictive (What will happen next?)
- Lead scoring
- Sales forecast
• Prescriptive (How can we make it happen?)
- Content recommendations based on passed activities & demographics
- Opportunity prioritization
Goal: reach the prescriptive state


Data
Big data is data with 3Vs
1. Volume - Enormous amounts of data (zettabytes)
2. Velocity - Real time stream of data
3. Variety - Data from a range of sensors, with different types


Problems with big data
What makes privacy of Big Data a problem different to traditional privacy? Scale!

- Lack of control and transparency (about what is being collected from us and what is happening with it)
- Data reusability (data is used for other things than the initial purpose)
- Data inference and re-identification

Most BA projects do not involve big data, but use with relatively small and structured data sets.

Structured data sets:
Used by most predictive techniques. Usually consists of entries (e.g. people) with attributes (e.g., name,
income, sex, nationality).

Unstructured data sets:
Has no structure. It might be data from cameras, social media sites, text entered in free text fields, etc..
Unstructured data is the majority of the data that is stored today, and it is often also big data. When
working with unstructured data, the first step is often to extract features to make it structured and
therefore suitable as input for an algorithm working with structured data (e.g., images from road-side
cameras are used to extract license plates which are then used to analyze the movement of cars).

, Tutorial 1 - Tutorial Notes
Privacy is Dead! Long Live Privacy!
Workgroup Discussion:
1. In what ways could data compromise our autonomy? Our human dignity? Our
rationality?
2. Are there ‘no-go’ areas for computer scientists? Should there be?
3. What role for law in computer science? What role for computer science in law?
4. Where should the intervention of law be in building digital technology?

Tutorial attendees will be asked to think about the design of an app (description will be
provided). Students will be asked to identify what parts of their lives might be
compromised by the design of the app.




Important questions to think about:
What app data can infer what private data? For example:
• Location data can infer religious data (if someone is at the location of a church every Sunday)
• Diet + physical + medical data can infer religion (if someone is not eating for an entire day during
Ramadan)

Apps get a lot of data, and each data combination can infer something as well, like habits, religion, diets.

Speed of typing becomes a diagnostic test, people who are typing at a certain rate can have cross
references with a dementia patient.

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
tigovangerven Vrije Universiteit Amsterdam
Follow You need to be logged in order to follow users or courses
Sold
52
Member since
4 year
Number of followers
31
Documents
40
Last sold
8 months ago
Artificial Intelligence Bachelor at the VU

4.4

5 reviews

5
4
4
0
3
0
2
1
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions