Application of Multiple Imputation Method in Analyzing Data with Missing Continuous Covariates

  • Published : 2008.08.31


Missing continuous covariates are pervasive in the use of generalized linear models for medical data. Multiple imputation is the most common and easy-to-do method of dealing with missing covariate data. However, there are always serious warnings in using this method. There should be concern to make imputed values more proper. In this paper, proper imputation from posterior predictive distribution is developed for implementing with arbitrary priors. We use empirical distribution of the posterior for approximating the posterior predictive distribution, to sample from it. This method is preferable in comparison with a presented imputation method of us which uses a full model to impute missing values using available software. The proposed methods are implemented on glucocorticoid data.



  1. Ibrahim, J. G., Chen, M. H. and Lipsitz, S. R. (2002). Bayesian methods for generalized linear models with covariates missing at random, The Canadian Journal of Statistics / La Revue Canadienne de Statistique, 30, 55-78
  2. Ibrahim, J. G., Chen, M. H., Lipsitz, S. R. and Herring, A. H. (2005). Missing-data methods for generalized linear models: A comparative review, Journal of the American Statistical Association, 100, 332-347
  3. Little, R. J. A. and Rubin, D. B. (2002). Statistical Analysis with Missing Data, John Wiley & Sons, New York
  4. Nielsen, S. F. (2003). Proper and improper multiple imputation, International Statistical Review, 71, 593-607
  5. Rubin, D. B. (1976). Inference and missing data, Biometrika, 63, 581-592
  6. Rubin, D. B. (1977a). Formalizing subjective notions about the effect of nonrespondents in sample surveys, Journal of the American Statistical Association, 72, 538-543
  7. Rubin, D. B. (1977b). The Design of a General and Flexible System for Handling Non-Response in Sample Surveys, working document prepared for the U.S. Social Security Administration
  8. Rubin, D. B. (1987). Multiple Imputation for Nonresponse in Surveys, John Wiley & Sons, New York
  9. Schafer, J. L. (1997). Analysis of Incomplete Multivariate Data, Chapman & Hall/CRC, New York
  10. van Buuren, S. and Oudshoorn, K. (1999). Flexible multivariate imputation by MICE, Leiden, The Netherlands: TNO Prevention Center
  11. van Marter, L. J., Leviton, A., Kuban, K. C. K., Pagano, M. and Allred, E. N. (1990). Maternal glucocorticoid therapy and reduced risk of bronchopulmonary dysplasia, Pediatrics, 86, 331-336