100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Summary

Summary of Empirical Methods

Rating
-
Sold
-
Pages
10
Uploaded on
23-02-2025
Written in
2024/2025

This document summarizes empirical methods in data analysis, focusing on A/B Testing, Analysis of (Co-)Variance, Regression Analysis, Cluster Analysis, and Conjoint Analysis. It includes core concepts, statistical test selection, model assumptions, decision trees, and validation checklists.

Show more Read less
Institution
Course









Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Study
Course

Document information

Uploaded on
February 23, 2025
File latest updated on
February 24, 2025
Number of pages
10
Written in
2024/2025
Type
Summary

Subjects

Content preview

Empirical Methods - Summary
Dennis Klappe
February 24, 2025


General
Data Types and Examples
• Categorical Data: Represents distinct groups; non-numeric.
– Example: Browser type (e.g., Chrome, Firefox)
• Nominal Data: Categories without a meaningful order.
– Example: Types of cuisine (e.g., Italian, Chinese, Mexican)
– Nominal Scale: Size of number is not related to the amount of the characteristic measured.
– Example: Eye color.
• Ordinal Data: Ordered categories; unequal intervals.
– Example: Education level (e.g., Bachelor’s, Master’s)
– Ordinal Scale: Larger numbers indicate more (or less) of the characteristic, but not how much
more (or less).
– Example: Military ranks, ranking of teams in a tournament.
• Continuous Data: Numeric values within a range; measured.
– Example: Height (e.g., 175 cm)
• Ratio Scale: Contains interval properties, with a natural zero point, allowing interpretation of ratios.
– Example: Height.
• Binary Data: Two possible outcomes.
– Example: Pass/Fail
• Discrete Data: Countable numeric values; distinct.
– Example: Number of children (0, 1, 2, ...)
• Interval Scale: Contains ordinal properties, with equal differences between scale points.
– Example: Temperature [°C].

Topics Overview
• A/B Testing: Used to compare two versions (A vs. B) to determine which performs better on specific
metrics.
– Example: Comparing click-through rates between two webpage designs.
• ANOVA (Analysis of Variance): Tests for significant differences between means across 3+ groups.
– Example: Examining differences in mean test scores across multiple teaching methods.
• Regression Analysis: Models relationships between a dependent variable and one or more indepen-
dent variables.
– Example: Predicting house prices based on size, location, and age.
• Cluster Analysis: Groups observations with similar characteristics into homogeneous groups.
– Example: Segmenting customers into distinct profiles based on purchasing behavior.
• Conjoint Analysis: Determines how consumers value different attributes of a product.
– Example: Evaluating trade-offs consumers make between price and features in smartphones.


A/B Testing
Core Concept
A/B testing compares two variants (A vs. B) to determine which performs better on specific metrics.
Use it when:


1

, • Testing UI changes, pricing strategies, or marketing campaigns
• Evaluating algorithm changes or feature rollouts
• Needing statistical confidence before full implementation

Statistical Test Selection Guide
• Categorical Data (e.g., conversion rates):
– Small samples: Fisher’s Exact Test
– Large samples: Pearson’s Chi-Square Test
• Continuous Data (e.g., revenue):
– Normal distribution/large samples:
∗ Equal variances: Student’s t-test
∗ Unequal variances: Welch’s t-test
– Non-normal/small samples: Mann-Whitney U Test

Key Considerations
• Use Levene’s Test to determine equal vs. unequal variances:
– p-value > 0.05: Equal variances, use Student’s t-test
– p-value < 0.05: Unequal variances, use Welch’s t-test
• Welch’s t-test is generally preferred over Student’s t-test (more robust to unequal variances)
• Mann-Whitney tests distributions/medians, not means
• Large samples (> 30) can use t-tests via CLT even with non-normal data
• For skewed data, consider transformations or median-based analysis

Decision Tree
1. Categorical outcome?
• Small sample → Fisher’s Exact
• Large sample → Chi-Square
2. Continuous outcome?
• Normal/large sample:
– Equal variances → Student’s t-test
– Unequal variances → Welch’s t-test
• Non-normal/small sample → Mann-Whitney


Analysis of (Co-)Variance
Core Concept
• ANOVA: Tests differences between 3+ group means
• ANCOVA: ANOVA with continuous control variables (covariates)
• Sum of Squares (SS):
– SS Total: Total variance in the data.
Interpretation: High SS Total indicates high overall variability in your data.
– SS Between: Variance due to differences between group means.
Interpretation: High SS Between means greater differences between groups, potentially significant
if the F-statistic is also high.
– SS Within: Variance within groups, capturing unaccounted variance.
Interpretation: High SS Within suggests more variability within each group, which can reduce the
likelihood of finding significant between-group effects.
– SS Covariate (ANCOVA): Variance explained by covariates.
Interpretation: High SS Covariate indicates that covariates account for a significant portion of the
variance, improving the analysis by controlling for confounders.
• Key output: F-statistic F = Between-group variance
Within-group variance
– Interpretation:
∗ High F-value: Greater between-group variance compared to within-group variance, indicating
significant differences.


2

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
d3nnis Universiteit Twente
Follow You need to be logged in order to follow users or courses
Sold
21
Member since
11 months
Number of followers
0
Documents
2
Last sold
1 week ago

5.0

1 reviews

5
1
4
0
3
0
2
0
1
0

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions