DOI QR코드

DOI QR Code

A Note on Cook's Distance in the Multivariate Linear Model

  • Bae, Whasoo (Department of Data Science/Institute of Statistical Information, Inje University) ;
  • Hwang, Hyunmi (Department of Statistics, Pusan National University) ;
  • Kim, Choongrak (Department of Statistics, Pusan National University)
  • Received : 2012.10.16
  • Accepted : 2012.11.28
  • Published : 2013.01.31

Abstract

We propose a version of Cook's distance (called local distance) in the multivariate linear model. The proposed version is a matrix, while the existing version of Cook's distance (called global distance) is a scalar. The existing Cook's distance is the trace of the proposed Cook's distance. In addition, we argue that the proposed Cook's distance has a more natural extension of the Cook's distance in the univariate linear model than the existing Cook's distance. An illustrative example based on a real data set is given.

Keywords

References

  1. Altunkaynak, B. and Ekni, M. (2002). Detection of influential observation vectors for multivariate linear regression, Journal of Mathematics and Statistics, 31, 139-151.
  2. Banerjee, M. and Frees, E. W. (1997). Influence diagnostics for linear longitudinal models, Journal of the American Statistical Association, 92, 999-1005. https://doi.org/10.1080/01621459.1997.10474055
  3. Caroni, C. (1987). Residuals and influence in the multivariate linear model, The Statistician, 36, 365-370. https://doi.org/10.2307/2348833
  4. Cook, R. D. (1977). Detection of influential observations in linear regression, Technometrics, 19, 15-18. https://doi.org/10.2307/1268249
  5. Cook, R. D. (1986). Assessment of local influence, Journal of the Royal Statistical Society, Series B, 48, 133-169.
  6. Cook, R. D., Pena, D. and Weisberg, S. (1988). The likelihood displacement, Communications in Statistics - Theory and Methods, 17, 623-640. https://doi.org/10.1080/03610928808829645
  7. Diaz-Garcia, J. A., Rohas, M. G. and Leiva-Sanchez, V. (2003). Influence diagnostics for elliptical multivariate linear regression models, Communications in Statistics - Theory and Methods, 32, 625-641. https://doi.org/10.1081/STA-120018555
  8. Fung, W. K. (1999). Outlier diagnostics in several multivariate samples, Journal of the Royal Statistical Society, Series D, 48, 73-84. https://doi.org/10.1111/1467-9884.00172
  9. Green, P. E. (1978). Analyzing Multivariate Data, Hindsdale, Ill, The Dryden Press.
  10. Lindsey, P. J. and Lindsey, J. K. (2000). Diagnostic tools for random effects in the repeated measures growth curve model, Computational Statistics and Data Analysis, 33, 79-100. https://doi.org/10.1016/S0167-9473(99)00049-3
  11. Preisser, J. S. and Qaqish, B. F. (1996). Deletion diagnostics for generalized estimating equations, Biometrika, 83, 551-562. https://doi.org/10.1093/biomet/83.3.551
  12. Tang, M. K. and Fung, W. K. (1997). Case-deletion diagnostics for test statistics in multivariate regression, Australian and New Zealand Journal of Statistics, 39, 345-353. https://doi.org/10.1111/j.1467-842X.1997.tb00696.x