100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Summary

Summary Schafer, J. & Graham, J. – Missing Data: Our view of the State of the Art

Rating
-
Sold
-
Pages
3
Uploaded on
05-04-2019
Written in
2018/2019

Article by Schafer and Graham on missing data

Institution
Course








Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Study
Course

Document information

Uploaded on
April 5, 2019
Number of pages
3
Written in
2018/2019
Type
Summary

Subjects

Content preview

Schafer, J. & Graham, J. – Missing Data: Our view of the State of the Art

Most data analysis are not designed for missing data. Missingness is usually a nuisance, not
the main focus of inquiry. Most researches resort to editing the data to lend an appearance
of completeness. Unfortunately this can lead to biased, inefficient, and unreliable answers.

What Is a Missing Value?
Missing values are part of the more general concept of coarsened data, which includes
numbers that have been grouped, aggregated, rounded, censored, or truncated, resulting in
partial loss of information. Latent variables are closely related to missing data, which are
unobservable quantities (e.g. intelligence) that are imperfectly measured by test of
questionnaire items.

Historical Development
Until the 1970s, missing values were handled primarily by editing. The formulation of the
EM (expectation-maximization) algorithm made it feasible to compute ML (maximum
likelihood) estimates in many missing-data problems. ML treats the missing data as random
variables to be removed from the likelihood function as if they were never sampled.
Later the idea of MI (multiple imputation) was introduced, in which each missing value is
replaced with m>1 simulated values prior to analysis.

Goals and Criteria
A missing value treatment can’t be properly evaluated apart from the modeling, estimation
or testing procedure in which it is embedded (e.g. mean substation –replacing each missing
value for a variable with the average of the observed values- may accurately predict
missing data, but distort estimated variances and correlations).
When Q is a population, and ^Q an estimated of Q based on a sample data, then if the
procedure will have ^Q close to Q. We thus want the difference, the bias, to be small. Bias/
variance are often calculated by (^Q-Q)², which is the mean square error. But this does not
yet describe the measures of uncertainty.
When missing values occur for reasons beyond our control, we must make assumptions
about the processes that create them. These are usually untestable.
Finally, one should avoid tricks that apparently solve the missing-data problem but actually
redefine the parameters or the population.

Types and Patterns of Nonresponse
Unit nonresponse is when the entire data collection procedure fails (e.g. sampled person is
not at home). Item nonresponse is when partial data available (e.g. sampled person does
not respond to certain items). Especially in longitudinal studies, both concepts are common,
which is referred to as wave nonresponse. Attrition/dropout is when one leaves the study
and does not return.
A univariate pattern is when missing values occur on an item Y, but a set of p other items
X1, X2..Xp is completely observed (see figure 1a).
A monotone pattern is when items or item groups (Y1, Y2..Yp) may be ordered in such a
way that if Yj is missing for a unit, then Yj+1 are missing as well (see figure 1b).
An arbitrary pattern is when any set of variables may be missing for any unit (see figure
1c).

The Distribution of Missingness
R is referred to as the missingness. The form of missingness depends on the complexity of
the pattern. When R=1, it indicates whether Y is observed. When R=0, it indicates whether
Y is missing.

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
lindawijnhoven Radboud Universiteit Nijmegen
Follow You need to be logged in order to follow users or courses
Sold
60
Member since
8 year
Number of followers
54
Documents
24
Last sold
1 year ago

4,3

13 reviews

5
9
4
1
3
2
2
0
1
1

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their exams and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can immediately select a different document that better matches what you need.

Pay how you prefer, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card or EFT and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions