100% tevredenheidsgarantie Direct beschikbaar na je betaling Lees online óf als PDF Geen vaste maandelijkse kosten 4.2 TrustPilot
logo-home
Samenvatting

Summary - Digital Methods (K001194A)

Beoordeling
-
Verkocht
4
Pagina's
89
Geüpload op
04-08-2023
Geschreven in
2022/2023

Full summary Digital Methods: All (guest) lectures and notes,












Oeps! We kunnen je document nu niet laden. Probeer het nog eens of neem contact op met support.

Documentinformatie

Geüpload op
4 augustus 2023
Aantal pagina's
89
Geschreven in
2022/2023
Type
Samenvatting

Voorbeeld van de inhoud

DIGITAL METHODS
CL AS S 1: INT RODUCT ION DIG IT AL M ETHODS


DIGITAL METH ODS: CL OSE READING, DIS TANT READIN G AN D CO MMON CHARACT ERIS TICS OF B IG DATA


• Data: online activities
• Can have risks: like overgeneralizing


SITUATIN G TH E COURS E

“The best way to create this powerful hybrid is not to focus on abstract social theory or fancy machine learning. The best
place to start is research design. If you think of social research as the process of asking and answering questions about
human behavior, then research design is the connective tissue; research design links questions and answers (Salganik, 2017
p.6).”


CLOSE READING (QUALITATIVE)

• Offline-online continuum

• Danah Boyd  making sense of teen life
o Research approach
 Immersion in teen pop culture and subculture
= Consuming media that is popular among teens
 Participant observation and content analysis of traces on social media
= Digital traces: observe behavior on social media that is public/semi-public (looking at things
that are trending)
 Deep hanging out in physical spaces
= Deep understanding of what they are doing (going to playgrounds, schools, malls)
 Semistructured face-to-face interviews (with teens)

Social media certainly make it much easier to peek into people’s lives, but it is also quite easy to misinterpret online traces.

Example: Gang isignia on MySpace profile –> indication of Gang involvement?

• Essay on gangs: someone from a gang wrote essay about wanting to leave the gang and go to college (Ivy League
College)
• BUT His MySpace account reveals that he is still involved in the gang community

> Using online content to get more information, but you need to take the content into account

• Close reading: detailed analysis of content is necessary
• Carefully reading and reflecting on a piece of content

ALTERNATIVE EXPLANATION:


“Without knowing the specific boy involved, I surmised that he was probably focused on fitting in, staying safe, or, more
directly, surviving in his home environment. Most likely he felt as though he needed to perform gang affiliation online—
especially if he was not affiliated—in order to make certain that he was not physically vulnerable.’


1

, • Importance of close reading: interpretation based on detailed analysis
o importance of context in combination with quantitative data
o Try to understand the meaning (close reading  carefully reading + reflecting)


OVERVIEW

• Lecture 7: Walk-through, going along and scrolling back
• Lecture 8: Digital ethnography I
• Lecture 9: Digital ethnography II


DISTANT READING (QUANTITATIVE)

“Big Data”: Advances in technology & analysis

• huge amounts of data left behind by people & furthermore aggregated by companies
- Data that is left behind
- How that data is being used/analysed by big tech corporations
- Not about one datasets but about multiple datasets and combinations of data sets
 Gather/analyze and link datasets
 Using the large datasets to identify trends/patterns  make conclusions

• Example: Investigating health and wealth in Rwanda
- Traditional social science survey: asking people specific questions (demographics)
- Call records (of approx. 1,5 million people)
- combined the 2 data sources: used survey to train machine learning model to predict a person’s wealth
 Result: high resolution maps of the distribution of wealth in Rwanda – combining 2 data
sources

Importance of distant reading: analyzing large numbers of data(sets)

OVERVIEW

• Lecture 2: Computational Social Science and Open Science
• Lecture 3: Data Vizualization
• Lecture 4: Collecting data from the web
• Lecture 5: The use of AI in Social Science Research
• Lecture 6: Working with Sensors and VR
• Lecture 10: The rise of the person-specific paradigm
• Lecture 11: Digital trace data

Practicals

• Practical 1: Assignment information and Installing R
• Practical 2: Getting acquainted with R
• Practical 3: Visualizing data with ggplot
• Practical 4: Webscraping and APIs
• Practical 5: working with sensors and VR
• Practical 6: Walk-through method
• Practical 7: Working with intensive longitudinal data


READYMADE VERSUS CUSTOMMADE DATA

• Readymade: Duchan (artist): takes an ordinary object and repurposes that as art (urinal)
• Custommade: Michelangelo (artist) statue of David (from Michelangelo) more than 3 years of labeling


2

, • Same when talking about data
o Readymade  repurpose data sets
o Custommade  start your own data set to answer research question


10 CHARACTERIS TICS OF BIG DAT A SOURC ES (DIVIDED IN HEL PF UL AND PROBL EMAT IC )

Digital traces  Big Data  Digital methods

• Generally helpful for research:
- Big
- always-on
- nonreactive
• Generally problematic for research:
- Incomplete
- Inaccessible
- Nonrepresentative
- Drifting
- algorithmically confounded
- dirty
- sensitive

Big data

• Repurposing
• Found versus designed data
- Found: data that has been found by researchers but designed by someone else
- Designed: data designed by researcher to then use it in their research
• What should the ideal data set look like?
• ‘Twitter’ versus ‘social survey’ data


1. BIG (HELPFUL)

• Is all that data really doing anything?
- More data does not always mean better data
• Big datasets are never an end in themselves, but do allow for the study of rare cases, detection small differences,
and estimation of heterogeneity

Example: Analysis of the 2016 US Presidential Campaign on Twitter (Kollanyi, Howard, Woolley, 2016)

• Total of 18 910 250 tweets were analyzed (analyzed 2 hashtags)
- 39.1% debat tweets pro Trump hashtag (e.g., #MAGA)
- 13.6% debat tweets pro Clinton (e.g., #ImWithHer)
• However… (many accounts were not real)
- 32.7% pro tweets Trump originated from bots
- 22.3% pro tweets Clinton originated from bots


2. ALWAYS-ON (HELPFUL)

(we are living in media, always connected, constant stream of data)

• Unexpected events
• Real-time measurements




3

, Example: hurricane Sandy-related Twitter and Foursquare data

• Data scientist combined foursquare data and Twitter data
• Surprising findings (that could otherwise not be found)
- People went out grocery shopping more the night before the storm
- People went out after the storm (kind of celebrating that it is over)


3. NONREACTIVE (HELPFUL)

• Measurement in big data sources is much less likely to change behavior.
- People not being aware that their data is being captured: so they do not change their behavior
• However, social desirability bias can still be present
- not neutral platforms
- example: social media: focusing on the good things in our life (personal milestones,….)


4. INCOMPLETE (PROBLEMATIC)

• Usually the following information is missing/ incomplete
- Demographic information about participants
- Behavior on other platforms
- Data to operationalize theoretical constructs
• Construct validity
o a type of validity that refers to the degree to which a measurement tool or method accurately measures
the underlying construct or concept that it is intended to measure.

Example: measuring social capital

• Articulated networks  contacts
• Behavioral networks  communication
• Construct validity?
- For example: spending more time on the phone with your colleague does not imply that they are more
important that spouses
- Will not always be perfect: context is important


5. INACCESSIBLE (PROBLEMATIC)

• Data held by companies and governments are difficult for researchers to access.

Examples: ‘Facebook reportedly provided inaccurate data of misinformation researchers

• Sometimes only semipublic available


6. NONREPRESENTATIVE (PROBLEMATIC)

• Nonrepresentative data are bad for out-of-sample generalizations, but can be quite useful for within-sample
comparisons.




4

Maak kennis met de verkoper

Seller avatar
De reputatie van een verkoper is gebaseerd op het aantal documenten dat iemand tegen betaling verkocht heeft en de beoordelingen die voor die items ontvangen zijn. Er zijn drie niveau’s te onderscheiden: brons, zilver en goud. Hoe beter de reputatie, hoe meer de kwaliteit van zijn of haar werk te vertrouwen is.
alinebruckner Universiteit Gent
Bekijk profiel
Volgen Je moet ingelogd zijn om studenten of vakken te kunnen volgen
Verkocht
81
Lid sinds
4 jaar
Aantal volgers
39
Documenten
12
Laatst verkocht
2 maanden geleden

4,0

1 beoordelingen

5
0
4
1
3
0
2
0
1
0

Recent door jou bekeken

Waarom studenten kiezen voor Stuvia

Gemaakt door medestudenten, geverifieerd door reviews

Kwaliteit die je kunt vertrouwen: geschreven door studenten die slaagden en beoordeeld door anderen die dit document gebruikten.

Niet tevreden? Kies een ander document

Geen zorgen! Je kunt voor hetzelfde geld direct een ander document kiezen dat beter past bij wat je zoekt.

Betaal zoals je wilt, start meteen met leren

Geen abonnement, geen verplichtingen. Betaal zoals je gewend bent via Bancontact, iDeal of creditcard en download je PDF-document meteen.

Student with book image

“Gekocht, gedownload en geslaagd. Zo eenvoudig kan het zijn.”

Alisha Student

Veelgestelde vragen