Resumen

Large scale analysis of biomedical data summary

Puntuación

Vendido

Páginas

Subido en

28-10-2025

Escrito en

2024/2025

summary of all the powerpoints together with own notes

Institución

Grado

Vista previa del contenido

LARGE SCALE ANALYSIS OF
BIOMEDICAL DATA
1. LESSON 1

1.1 DEFINE GOALS AND OBJECTIVES
Data science process
- Collection
- Cleaning
- Exploratory data analysis
- Model building
- Model deployments

GOALS
f.e.
- Development and implementation of protein-based assay to diagnose a specific cancer type
- Evaluation of a novel drug to treat colon cancer
- Prediction of cardiovascular disease based on blood counts and genomic data

(DATA-MINING) OBJECTIVE

Focus on data mining objective (Focusing on a data mining objective means identifying a specific goal or
question that you want to answer using data mining techniques.)
f.e.
- What proteins are differentially expressed in healthy vs diseased tissue
- What regulatory pathways are affected upon drug treatment in cell lines
- Does treatment A result in more pronounced tumor shrinkage in mice compared to conventional
therapy
- Comparative analysis of the blood cell count in 2 patientsgroup
SMART ->
 Specific = who and what
 Measurable = by how much
 Achievable= how
 Relevant= why
 Time-bound= when

Know what research you will perform exploratory vs descriptive
 Exploratory  see the difference between f.e. healthy and unhealthy tissue
 Descriptive  you know specific what you want to check
 Then you can try to get your data

1.2 EXPERIMENTAL/ STUDY DESIGN: SAMPLE NUMBERS, COFOUNDING,…

,1. How will your study be designed?
 Garbage in= garbage out
 (start with which kind of data you need and what kind and how many samples you need)
 Data type samples experiment

2. What datatype is needed to answer the question?
 Clinical data
 Imaging data
 Transcriptomics, genomics data
 Flow cytometry
 Proteomics

3. Will you generate own data or repurpose published data?
 Sometimes database already available and you don’t need to go to the lab
There is a lot online  test your hypotheses= repurpose data

4. What sample will be used?
A prospective study watches for outcomes, such as the development of a disease, during the study
period and relates this to other factors such as suspected risk or protection factor(s). The study
usually involves taking a cohort of subjects and watching them over a long period. The outcome of
interest should be common; otherwise, the number of outcomes observed will be too small to be
statistically meaningful (indistinguishable from those that may have arisen by chance). All efforts
should be made to avoid sources of bias such as the loss of individuals to follow up during the study.
Prospective studies usually have fewer potential sources of bias and confounding than retrospective
studies. Prospective investigation is required to make precise estimates of either the incidence of an
outcome or the relative risk of an outcome based on exposure. Retrospective

A retrospective study looks backwards and examines exposures to suspected risk or protection factors
in relation to an outcome that is established at the start of the study. Many valuable case-control
studies were retrospective investigations. Most sources of error due to confounding and bias are more
common in retrospective studies than in prospective studies. For this reason, retrospective
investigations are often criticized. If the outcome of interest is uncommon, however, the size of
prospective investigation required to estimate relative risk is often too large to be feasible. In
retrospective studies the odds ratio provides an estimate of relative risk. You should take special care
to avoid sources of bias and confounding in retrospective studies.

case-control studies
Case-Control studies are usually but not exclusively retrospective. The following notes relate case-
control to cohort
studies:
 outcome is measured before exposure/biomarker test
 controls are selected on the basis of not having the outcome
 good for rare outcomes
 quicker to complete
 prone to selection bias
 prone to recall/retrospective bias (do not remember previous events)

, cohort studies
Cohort studies are usually but not exclusively prospective. The following notes relate cohort to case-
control studies:
 outcome is measured after exposure/biomarker test
 yields true incidence rates and relative risks
 may uncover unanticipated associations with outcome
 best for common outcomes
 takes a long time to complete
 prone to attrition bias (unequal loss of participants)
 prone to the bias of change in methods over time

WHAT IS ECPERIMENT DESIGN?  HOW DO YOU ORGANISE YOUR EXPERIMENT AND GENERATE
THE DATA TO LEARN ABOUT AN A PRIORI DEFINED HYPOTHESIS OR ANSWER THE BIOLOGICAL
QUESTION OF INTEREST

Define the factors you are interested in
 different treatment or combination to test
 concentrations of compound
be aware of confounding factor, influence the result but you are not interested in
= the inability to distinguish the effect of one factor from the effect of another: interesting vs nuisance
You can have biological effect, technical effect
 batch (tissue from different city= different moment, hospital, machine)
 plate effect (Example: Imagine a study investigating the effect of a new drug on blood pressure. If a
confounding variable (like age) is not controlled for, older patients—who may have higher blood
pressure and be more likely to be prescribed the drug—might skew the results, making it appear that
the drug is more effective or less effective than it actually is.)
example  see dia 25
 layout of 96 well-plate
 organisation of mice in cages
 batches of materials used
batches:
how to know whether you have batches fe in rna seq experiment
 were all RNA isolations performed on the same day
 were all library preparation performed on the same day
 did the same person perform the RNA isolation/ library preparation for all samples
 did you use the same reagents for all samples
 did you perform the rna isolation/ library preparation in the same location
 if any of the answer is no  batches

how bad is confounding
 complete confounding= impossible to fix after experiment
Dit treedt op wanneer de associatie tussen de blootstelling en de uitkomst volledig wordt vertekend
door de confounder. Met andere woorden, als de confounder niet wordt gecontroleerd, lijkt er een
sterke relatie tussen de blootstelling en de uitkomst te zijn, die in werkelijkheid volledig te wijten is
aan de confounder.
 with incomplete confounding  try to wark around it in the analysis, often statistical power suffers
Dit treedt op wanneer de confounder de associatie tussen de blootstelling en de uitkomst verstoort,
maar niet volledig. De confounder heeft invloed, maar er blijft nog steeds een deel van de associatie
dat niet door de confounder wordt verklaard.

,  Some confounding is worse than others, depends on the effect of the confounding factor

How to find confounding
 Be weary of unexpectedly good separation between groups
 Make plots for visualizing all factor in experiment
 Dia 30  design 2 is better  for design 1: if you see a different => don’t know if it is bcs of the
treatment or the line
Solution
 Easiest to avoid confounding
o Exclude nuisance factors if possible, balance biological factors if possible + randomise if
possible
o Full randomization= lot of work, but often acceptable compromise exist
o Third well is what you are going to do

to avoid confounding in mice experiment
 Ensure animal in each condition are all the same sex, age, litter, batch
 If not possible  ensure to split the animals equally between condition
Avoid batch effect
 Design experiment in a way to avoid batches, if possible
 If unable to avoid batches
o Do not confound your experiment by batch:
o Do split replicates of the different sample groups across batches  the more replicate the
better (effectiviteit van de behandeling niet moet verwarren met de batc)
o Do include batch information in your experimental metadata (Voeg informatie over de
batches toe aan je experimentdata. Dit helpt later bij de analyse.)
During the analysis, we can regress out the variation due to batch if not confounded so it
doesn’t affect our result if we have that information
o Important= randomization
Three important things to avoid batch effect
 No shuffling of an effect leads to uncorrectable confounding
 Randomization
 Blocking of an effect  balanced design

decision about type and number of replicate

genuine replication vs pseudoreplication
 Genuinee replicate  increases ample size

Informar violación de derechos de autor

Escuela, estudio y materia

Institución: Universiteit Gent (UGent)
Estudio: Biomedische Wetenschappen
Grado: D013628A

Todos documentos para esta materia (1)

Información del documento

Subido en: 28 de octubre de 2025
Número de páginas: 50
Escrito en: 2024/2025
Tipo: RESUMEN

Temas

slides
summary
notes

$9.26

Accede al documento completo:

100% de satisfacción garantizada

Inmediatamente disponible después del pago

Leer en línea o como PDF

No estas atado a nada

Conoce al vendedor

laurinevandewiele92

3.0

(4)

Conoce al vendedor

laurinevandewiele92

Ver perfil

Seguir

Vendido

Miembro desde

3 año

Número de seguidores

Documentos

Última venta

3 meses hace

3.0

4 reseñas

Documentos populares

Recientemente visto por ti

Por qué los estudiantes eligen Stuvia

Creado por compañeros estudiantes, verificado por reseñas

Calidad en la que puedes confiar: escrito por estudiantes que aprobaron y evaluado por otros que han usado estos resúmenes.

¿No estás satisfecho? Elige otro documento

¡No te preocupes! Puedes elegir directamente otro documento que se ajuste mejor a lo que buscas.

Paga como quieras, empieza a estudiar al instante

Sin suscripción, sin compromisos. Paga como estés acostumbrado con tarjeta de crédito y descarga tu documento PDF inmediatamente.

“Comprado, descargado y aprobado. Así de fácil puede ser.”

Alisha Student

Preguntas frecuentes

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

100% de satisfacción garantizada: ¿Cómo funciona?

Nuestra garantía de satisfacción le asegura que siempre encontrará un documento de estudio a tu medida. Tu rellenas un formulario y nuestro equipo de atención al cliente se encarga del resto.

Who am I buying this summary from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller laurinevandewiele92. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy this summary for $9.26. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 45,681 summaries were sold in the last 30 days Founded in 2010, the go-to place to buy summaries for 16 years now