Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4.6 TrustPilot
logo-home
Summary

Summary - Data Science and Society (INFOMDSS)

Rating
-
Sold
2
Pages
71
Uploaded on
30-03-2025
Written in
2024/2025

Full summary of all final exam parts A-E. Includes figures, equations and extra examples.

Institution
Course

Content preview

📊
Data Science and Society
Created @October 30, 2024 10:10 AM

Class INFOMDSS

Part A: Data Science & Processes,
Analytics
Data science involves analyzing data through specific frameworks and tools,
which shape perceptions of reality while presenting challenges related to time
and potential pitfalls from improper tool selection.

e.g. Twitter as a data source during a flood case: bias may arise, no
electricity means no way to communicate.

e.g. Using statistics to argue the odds of disease for baby case: incorrect
assumption of independence

e.g. Recidivist assessment case: bias towards woman and black people,
bias/unfairness




Data Science and Society 1

, Data science covers a wide range of tasks and models, including collecting
data, deployment of models and business understanding. Data science models
and insights affect individuals and society, and therefore data scientists should
be aware of these risks.




Analytics types
Descriptive analytics: answering the question of what happend, a
retrospective analysis of historic data. Done using data visualisation,
dashboards, statistics.
Predictive analytics: what is likely to happen in the future? Looking at past
data to predict the future. Done by using data mining, text mining, forecasting.

Prescriptive analytics: aims to determine the best possible decision based on
the data. Uses descriptive and predictive to create alternatives, and determines
the best one. Done by using optimization, simulation, heuristic programming.




Data Science and Society 2

, Business understanding
ML DevOps: the practice of integrating machine learning model development
with DevOps principles to streamline the deployment, monitoring, and
management of models in production, ensuring they remain efficient, reliable,
and scalable.


CRISP-DM: Cross Industry Standard Process for Data Mining

Methodology for structuring and managing data mining and data science
projects.

Business Understanding: Clearly define the project’s objectives from a
business perspective. Identifying the business problem, understanding
what the organization needs to achieve, and translating that into a data
science goal.

Data Understanding: Collect and assess the quality and characteristics of
the data. Gathering data, exploring it to discover initial insights, and
identifying any quality issues or patterns relevant to the business goal.



Data Science and Society 3

, Data Preparation: Clean, transform, and structure the data for analysis.
Involves selecting the relevant data, handling missing values, removing
noise, creating new variables (feature engineering), and preparing datasets
that are ready for modeling.

Modeling: Develop models using appropriate techniques. Modeling
techniques (e.g., regression, classification, clustering) are applied to the
data. It may involve trying out different algorithms, tuning parameters, and
selecting the best models based on performance metrics.

Evaluation: Evaluate the model and ensure it meets business objectives.
Evaluating in terms of both its accuracy and its relevance to the business
objectives. Ensures that the model’s performance is aligned with the
business problem and is not just technically good.

Deployment: Implement the model in the real-world environment. Involves
automating the model, integrating it into software systems, or presenting
results through reports or dashboard




Data Science and Society 4

Written for

Institution
Study
Course

Document information

Uploaded on
March 30, 2025
Number of pages
71
Written in
2024/2025
Type
SUMMARY

Subjects

$9.07
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
matthiaslouws Universiteit Utrecht
Follow You need to be logged in order to follow users or courses
Sold
79
Member since
5 year
Number of followers
35
Documents
31
Last sold
1 day ago

3.4

7 reviews

5
3
4
1
3
1
2
0
1
2

Trending documents

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions