Notas de lectura

Analytical Epidemiology II Lecture Notes for the second exam

Puntuación

Vendido

Páginas

106

Subido en

13-03-2025

Escrito en

2024/2025

In this document, you’ll find all the relevant slides along with my notes. I often used bullet points to keep things clear and organized. You can use these notes during the exam. They really helped me pass!

Institución

Grado

Ups! No podemos cargar tu documento ahora. Inténtalo de nuevo o contacta con soporte.

Informar violación de derechos de autor

Escuela, estudio y materia

Institución: Wageningen University (WUR)
Estudio: Msc Nutrition and Health
Grado: Analytical Epidemiology II

Todos documentos para esta materia (2)

Información del documento

Subido en: 13 de marzo de 2025
Número de páginas: 106
Escrito en: 2024/2025
Tipo: Notas de lectura
Profesor(es): Hans verhoef
Contiene: Todas las clases

Temas

analytical epidemiology ii

Vista previa del contenido

Analytical Epidemiology II: Lecture notes for the second exam

Module 8 Count modelling: understanding count data

Module learning objectives:

After successful completion of this module, students are expected to be able to:

1. Identify and distinguish count variables from other types of variables (binary, ordinal,
nominal, continuous).
2. Identify and distinguish between count variables with peculiar distributions.
3. Explain differences between distributions with continuous outcomes and count outcomes.
4. Describe the following terms: binary variable, ordinal variable, nominal variable, count
variable, continuous variable, categorical variable, numerical variable, discrete data,
censoring, truncation, probability density function, probability mass function.

Critique of categorisation:

● Issue with categorisation: It can lead to a loss of information and reduced statistical
precision.
● Example: Counts or continuous variables are sometimes dichotomised (e.g., converting
counts to binary for logistic regression).
● Recommendation: Avoid categorising variables unnecessarily, as it limits the accuracy of
statistical analysis.

Continuous data

• Continuous numbers are real numbers, ∈ ℝ.
• Continuous data have an infinite number of possibilities.

1

, • Between any two numbers is always another number.
• How to analyse continuous outcome variables?
o t-tests
o Linear regression
o Analysis of variance, ANOVA

Discrete data

• Finite set or an infinite sequence of numbers.
• The set is countable.
• Between any two numbers there is not always a third number.

Discrete data: binary outcome data

• Outcome only has two possible classes
o Y/N disease (cancer, diabetes, etc.)

• Binary outcome models, e.g.,:
o (Binary) logit regression model (yields odds ratios)
o (Binary) probit regression model (yields odds ratios)
o Binomial regression model (yields risk ratios)

The method used depends on the nature of the variable.

● Health sciences: Typically analysed using logistic regression (also called logit regression).
● Social sciences: More common to use probit analysis, though it usually gives similar results
to logistic regression.
● Epidemiology: Increasing use of binomial regression, which allows results to be expressed as
risk ratios instead of odds ratios.

Discrete data: ordered outcome data

Outcome has finite number of ordered classes:

• Mild, moderate or severe case
• Adherence to treatment (poor, reasonable, good, excellent)
• Likert scale

Ordinal outcomes are usually analysed by ordinal logit or ordinal probit regression.

Discrete data: non-ordered (nominal) outcome data

• Outcome has finite number of non-ordered classes.
• Health outcome: died, hospitalised, sick, healthy.
• Birth type (vaginal delivery, Caesarian section, miscarriage).

By contrast, nominal variables are typically analysed by polytomous logit or polytomous

2

,probit analysis. Keep in mind that regression analysis makes no assumptions about the distributions
of the independent variables. That should not be a concern in the selection of the appropriate type
of model.

This illustrates how count data might appear in a dataset.

Dataset structure:

1. First column: Participant ID – Identifies individual study participants.
2. Second column: Sex – A binary variable (e.g., male or female).
3. Third Column: Number of traffic offenses, a count variable (values range from 0 to infinity)
generated by a counting process. Count Data refers to the collection of these individual
count values.
4. Last two columns: Exposure variables: Used in count modelling to account for differences in
exposure time between individuals. These variables allow results to be expressed as rates
(e.g., traffic offenses per person-months at risk or per kilometres travelled).

3

, 1. Number of lightning strikes experienced by individual persons
o Count data (non-negative integers).
o Exposure variable: Person (value of 1 for each person, so it is effectively ignored).
o Special distribution: None in particular.

2. Number of mosquito larvae caught in a scoop of water
o Count data.
o Exposure variable: Scoop size or number of scoops, if these vary.
o Special distribution: May have a disproportionate number of zeros if samples are
taken from areas without mosquito breeding.

3. Number of beverages consumed per day
o Count data.
o Exposure variable:
o Not needed for a 24-hour recall (fixed period).
o Required if the number of days varies across participants.
o Special distribution: None in particular.

4. Number of ‘n’-s that appear on a printed page
o Count data.
o No exposure variable needed (fixed observation unit – a page).
o Special distribution: None in particular.

5. Number of ‘n’-s minus the number of ‘p’-s that appear on a printed page
o Not count data – Subtraction can produce negative values, which are not valid for
counts.
o No exposure variable applies.

6. Number of items bought by customers in a cash transaction report
o Count data.
o No exposure variable (each transaction is a fixed unit of observation).
o Special distribution: Zero-truncated – No zeros because only paying customers are
recorded.

7. Number of items bought by people walking around in a shopping mall
o Count data.
o No exposure variable (each person is a unit of observation).
o Special distribution: Excess zeros – Many people may not buy anything.

4

$8.39

Accede al documento completo:

100% de satisfacción garantizada

Inmediatamente disponible después del pago

Tanto en línea como en PDF

No estas atado a nada

Conoce al vendedor

elmadewolf20001

3.0

(2)

Conoce al vendedor

elmadewolf20001 Hanzehogeschool Groningen

Ver perfil

Seguir

Vendido

Miembro desde

3 año

Número de seguidores

Documentos

Última venta

1 mes hace

3.0

2 reseñas

Recientemente visto por ti

Por qué los estudiantes eligen Stuvia

Creado por compañeros estudiantes, verificado por reseñas

Calidad en la que puedes confiar: escrito por estudiantes que aprobaron y evaluado por otros que han usado estos resúmenes.

¿No estás satisfecho? Elige otro documento

¡No te preocupes! Puedes elegir directamente otro documento que se ajuste mejor a lo que buscas.

Paga como quieras, empieza a estudiar al instante

Sin suscripción, sin compromisos. Paga como estés acostumbrado con tarjeta de crédito y descarga tu documento PDF inmediatamente.

“Comprado, descargado y aprobado. Así de fácil puede ser.”

Alisha Student

Preguntas frecuentes

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

100% de satisfacción garantizada: ¿Cómo funciona?

Nuestra garantía de satisfacción le asegura que siempre encontrará un documento de estudio a tu medida. Tu rellenas un formulario y nuestro equipo de atención al cliente se encarga del resto.

Who am I buying this summary from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller elmadewolf20001. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy this summary for $8.39. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 45,681 summaries were sold in the last 30 days Founded in 2010, the go-to place to buy summaries for 16 years now