DOI QR코드

DOI QR Code

Estimating the AUC of the MROC curve in the presence of measurement errors

  • G, Siva (Department of Statistics, Pondicherry University) ;
  • R, Vishnu Vardhan (Department of Statistics, Pondicherry University) ;
  • Kamath, Asha (Department of Data Science, MAHE)
  • Received : 2022.01.01
  • Accepted : 2022.05.19
  • Published : 2022.09.30

Abstract

Collection of data on several variables, especially in the field of medicine, results in the problem of measurement errors. The presence of such measurement errors may influence the outcomes or estimates of the parameter in the model. In classification scenario, the presence of measurement errors will affect the intrinsic cum summary measures of Receiver Operating Characteristic (ROC) curve. In the context of ROC curve, only a few researchers have attempted to study the problem of measurement errors in estimating the area under their respective ROC curves in the framework of univariate setup. In this paper, we work on the estimation of area under the multivariate ROC curve in the presence of measurement errors. The proposed work is supported with a real dataset and simulation studies. Results show that the proposed bias-corrected estimator helps in correcting the AUC with minimum bias and minimum mean square error.

Keywords

References

  1. Anderson TW and Bahadur RR (1962). Classification into two multivariate normal distributions with different covariance matrices, The Annals of Mathematical Statistics, 33, 420-431. https://doi.org/10.1214/aoms/1177704568
  2. Bamber D (1975). The area above the ordinal dominance graph and the area below the receiver operating characteristic graph, Journal of Mathematical Psychology, 12, 387-415. https://doi.org/10.1016/0022-2496(75)90001-2
  3. Begg CB and Greenes RA (1983). Assessment of diagnostic tests when disease verification is subject to selection bias, Biometrics, 39, 207-215. https://doi.org/10.2307/2530820
  4. Begg CB and McNeil BJ (1988). Assessment of radiologic tests: control of bias and other design considerations, Radiology, 167, 565-569. https://doi.org/10.1148/radiology.167.2.3357976
  5. Berbaum KS, Dorfman DD, and Franken Jr EA (1989). Measuring observer performance by ROC analysis: indications and complications, Investigative Radiology, 24, 228-233. https://doi.org/10.1097/00004424-198903000-00011
  6. Coffin M and Sukhatme S (1996). A parametric approach to measurement errors in receiver operating characteristic studies, In Lifetime Data: Models in Reliability and Survival Analysis, 71-75.
  7. Coffin M and Sukhatme S (1997). Receiver operating characteristic studies and measurement errors, Biometrics, 53, 823-837. https://doi.org/10.2307/2533545
  8. Dunn G (1989). Design and Analysis of Reliability Studies: The Statistical Evaluation of Measurement Errors, Edward Arnold Publishers, New York.
  9. Egan JP (1975). Signal Detection Theory and ROC Analysis, Academic Press, New York.
  10. Faraggi D (2000). The effect of random measurement error on receiver operating characteristic (ROC) curves, Statistics in Medicine, 19, 61-70. https://doi.org/10.1002/(SICI)1097-0258(20000115)19:1<61::AID-SIM297>3.0.CO;2-A
  11. Fuller WA (2009). Measurement Error Models, John Wiley & Sons, New York.
  12. Guilherme AB and Ajalmar Rago RN (2011). Department of Teleinformatics Engineering, Federal University of Ceara, Fortaleza, Ceara, Brazil, UCI Machine Learning Repository, Available from: https://archive.ics.uci.edu/ml/datasets/Vertebral+Column
  13. Hand DJ (2010). Evaluating diagnostic tests: the area under the ROC curve and the balance of errors, Statistics in Medicine, 29, 1502-1510. https://doi.org/10.1002/sim.3859
  14. Kim J and Gleser LJ (2000). SIMEX approaches to measurement error in ROC studies, Communications in Statistics-Theory and Methods, 29, 2473-2491. https://doi.org/10.1080/03610920008832617
  15. Perkins NJ, Schisterman EF, and Vexler A (2009). Generalized ROC curve inference for a biomarker subject to a limit of detection and measurement error, Statistics in Medicine, 28, 1841-1860. https://doi.org/10.1002/sim.3575
  16. Reiser B (2000). Measuring the effectiveness of diagnostic markers in the presence of measurement error through the use of ROC curves, Statistics in Medicine, 19, 2115-2129. https://doi.org/10.1002/1097-0258(20000830)19:16<2115::AID-SIM529>3.0.CO;2-M
  17. Sameera G, Vardhan RV, and Sarma KVS (2016). Binary classification using multivariate receiver operating characteristic curve for continuous data, Journal of Biopharmaceutical Statistics, 26, 421-431. https://doi.org/10.1080/10543406.2015.1052479
  18. Schisterman EF (1999). Lipid peroxidation and cardiovascular disease: an ROC approach (Doctoral dissertation, State University of New York at Buffalo ProQuest Dissertations Publishing, New York.
  19. Schisterman EF, Faraggi D, Reiser B, and Trevisan M (2001). Statistical inference for the area under the receiver operating characteristic curve in the presence of random measurement error, American Journal of Epidemiology, 154, 174-179. https://doi.org/10.1093/aje/154.2.174
  20. Schisterman EF, Faraggi D, and Reiser B (2004). Adjusting the generalized ROC curve for covariates, Statistics in Medicine, 23, 3319-3331. https://doi.org/10.1002/sim.1908
  21. Shear L, Burke GL, Freedman DS, Webber LS, and Berenson GS (1987). Designation of children with high blood pressure-considerations on percentile cut points and subsequent high blood pressure: the Bogalusa Heart Study, American Journal of Epidemiology, 125, 73-84. https://doi.org/10.1093/oxfordjournals.aje.a114513
  22. Su JQ and Liu JS (1993). Linear combinations of multiple diagnostic markers, Journal of the American Statistical Association, 88, 1350-1355. https://doi.org/10.1080/01621459.1993.10476417
  23. Tosteson TD, Buonaccorsi JP, Demidenko E, and Wells WA (2005). Measurement error and confidence intervals for ROC curves, Biometrical Journal: Journal of Mathematical Methods in Biosciences, 47, 409-416. https://doi.org/10.1002/bimj.200310159
  24. Vexler A, Schisterman EF, and Liu A (2008). Estimation of ROC curves based on stably distributed biomarkers subject to measurement error and pooling mixtures, Statistics in Medicine, 27, 280-296. https://doi.org/10.1002/sim.3035
  25. Yin J and Tian L (2014). Optimal linear combinations of multiple diagnostic biomarkers based on Youden index, Statistics in Medicine, 33, 1426-1440. https://doi.org/10.1002/sim.6046
  26. Yuan Z and Ghosh D (2008). Combining multiple biomarker models in logistic regression, Biometrics, 64, 431-439. https://doi.org/10.1111/j.1541-0420.2007.00904.x