Journal Home
Search for

Volume 19, Issue 12, Pages 908-914 (December 2009)


View previous. 12 of 14 View next.

Multiple Imputation for Missing Laboratory Data: An Example from Infectious Disease Epidemiology

Zuber D. Mulla, PhDabCorresponding Author Informationemail address, Byungtae Seo, PhDc, Ramaswami Kalamegham, PhDad, Bahij S. Nuwayhid, MD, PhDa

Received 13 May 2009; accepted 9 August 2009. published online 07 October 2009.

Purpose

To present multiple imputation (MI) as an appropriate method to address missing values for a laboratory parameter (serum albumin) in an epidemiologic study.

Methods

A data set of patients who were hospitalized for invasive group A streptococcal infections was accessed. Age was the exposure of interest. The outcome was hospital mortality. Several variables, including serum albumin, were considered to be potential confounders. Of the 201 records, 91 had missing values for serum albumin. The MI procedure in SAS was used to perform 20 imputations of serum albumin by using a Markov chain Monte Carlo approach. Logistic regression was then performed on each of the 20 filled-in data sets, and the results were appropriately combined by using the MIANALYZE procedure.

Results

Age (≥55 years vs. 0–54 years) was not a risk factor for hospital mortality in the complete-case analysis (n=110): adjusted odds ratio (OR)=2.43 (95% confidence interval [CI]: 0.79–7.53). Age was a significant risk factor in the imputed data set (n=201): adjusted OR=3.08 (95% CI: 1.22–7.78).

Conclusions

Epidemiologists frequently encounter data sets that contain missing values. Traditional missing data techniques such as the complete-subject analysis may lead to biased results. We have demonstrated the use of a novel technique, MI, to account for missing data.

a Department of Obstetrics and Gynecology, Paul L. Foster School of Medicine, Texas Tech University Health Sciences Center, El Paso

b Department of Epidemiology and Biostatistics, University of South Florida College of Public Health, Tampa

c Department of Mathematics and Statistics, Texas Tech University, Lubbock

d Department of Pathology, Paul L. Foster School of Medicine, Texas Tech University Health Sciences Center, El Paso

Corresponding Author InformationAddress correspondence to: Zuber D. Mulla, Department of Obstetrics and Gynecology, Paul L. Foster School of Medicine, Texas Tech University Health Sciences Center, 4800 Alberta Ave., El Paso, TX 79905. Tel: (915) 545-6710. Fax: (915) 545-6946.

PII: S1047-2797(09)00285-3

doi:10.1016/j.annepidem.2009.08.002


View previous. 12 of 14 View next.