DOI QR코드

DOI QR Code

Antibiotics-Resistant Bacteria Infection Prediction Based on Deep Learning

딥러닝 기반 항생제 내성균 감염 예측

  • Oh, Sung-Woo (Graduate School of Information, Yonsei University) ;
  • Lee, Hankil (College of Pharmacy, Yonsei University) ;
  • Shin, Ji-Yeon (Graduate School of Information, Yonsei University) ;
  • Lee, Jung-Hoon (Graduate School of Information, Yonsei University)
  • Received : 2019.02.11
  • Accepted : 2019.02.22
  • Published : 2019.02.28

Abstract

The World Health Organization (WHO) and other government agencies aroundthe world have warned against antibiotic-resistant bacteria due to abuse of antibiotics and are strengthening their care and monitoring to prevent infection. However, it is highly necessary to develop an expeditious and accurate prediction and estimating method for preemptive measures. Because it takes several days to cultivate the infecting bacteria to identify the infection, quarantine and contact are not effective to prevent spread of infection. In this study, the disease diagnosis and antibiotic prescriptions included in Electronic Health Records were embedded through neural embedding model and matrix factorization, and deep learning based classification predictive model was proposed. The f1-score of the deep learning model increased from 0.525 to 0.617when embedding information on disease and antibiotics, which are the main causes of antibiotic resistance, added to the patient's basic information and hospital use information. And deep learning model outperformed the traditional machine hospital use information. And deep learning model outperformed the traditional machine learning models.As a result of analyzing the characteristics of antibiotic resistant patients, resistant patients were more likely to use antibiotics in J01 than nonresistant patients who were diagnosed with the same diseases and were prescribed 6.3 times more than DDD.

세계보건기구(WHO)를 비롯해 세계 각국의 정부기관은 항생제 오남용에 따른 항생제 내성균 감염에 대해 심각하게 경고하며 이를 예방하기 위한 관리와 감시를 강화하고 있다. 하지만 감염을 확인하기 위한 감염균 배양에 수일의 시간이 소요되면서 격리와 접촉주의를 통한 감염확산 방지 효과가 떨어져 선제적 조치를 위한 신속하고 정확한 예측 및 추정방법이 요구되고 있다. 본 연구는 Electronic Health Records에 포함된 질병 진단내역과 항생제 처방내역을 neural embedding model과 matrix factorization을 통해 embedding 하였고, 이를 활용한 딥러닝 기반분류 예측 모형을 제안하였다. 항생제 내성균 감염의 주요 원인인 질병과 항생제 정보를 embedding하여 환자의 기본정보와 병원이용 정보에 추가했을 때 딥러닝 예측 모형의 f1-score는 0.525에서 0.617로 상승하였고, 딥러닝 모형은 Super Learner와 같은 기존 기계학습 모형보다 더 나은 성능을 보여주었다. 항생제 내성균 감염환자의 특성을 분석한 결과, 감염환자는 동일한 질병을 진단받은 비감염환자에 비교해 J01 계열 항생제 사용이 많았고 WHO 권고기준(DDD)을 크게 벗어나는 오남용 청구사례가 6.3배 이상 높게 나타났으며 항생제 오남용과 항생제 내성균 감염간의 높은 연관성이 발견되었다.

Keywords

References

  1. A Medium Corporation, "I'll tell you why Deep Learning is so popular and in demand," https://medium.com/swlh/ill-tell-you-why-deep-learning-is-so-popular-and-in-demand-5aca72628780, 2019. 02. 22.
  2. Arango-Argoty, G., Garner, E., Pruden, A., Heath, L. S., Vikesland, P., and Zhang, L., "DeepARG: a deep learning approach for predicting antibiotic resistance genes from metagenomic data," Microbiome, Vol. 6, No. 1, p. 23, 2018. https://doi.org/10.1186/s40168-018-0401-z
  3. Bullinaria, J. A. and Levy, J. P., "Extracting semantic representations from word co-occurrence statistics: stop-lists, stemming, and SVD," Behavior Research Methods, Vol. 44, No. 3, pp. 890-907, 2012. https://doi.org/10.3758/s13428-011-0183-8
  4. Chen, M. L., Doddi, A., Royer, J., Freschi, L., Schito, M., Ezewudo, M., and Farhat, M., "Deep Learning Predicts Tuberculosis Drug Resistance Status from Whole-Genome Sequencing Data," BioRxiv, 2018.
  5. Choi, J. W. and Lee, H. J., "An Integrated Perspective of User Evaluating Personalized Recommender Systems: Performance-Driven or User-Centric," The Journal of Society for e-Business Studies, Vol. 17, No. 3, pp. 85-103, 2012. https://doi.org/10.7838/jsebs.2012.17.3.085
  6. Chung, J., Bhat, A., Kim, C. J., Yong, D., and Ryu, C. M., "Combination therapy with polymyxin B and netropsin against clinical isolates of multidrug-resistant Acinetobacter baumannii," Scientific Reports, Vol. 6, p. 28168, 2016. https://doi.org/10.1038/srep28168
  7. Dumais, S. T., "Latent semantic analysis," Annual Review of Information Science and Technology, Vol. 38, No. 1, pp. 188-230, 2004. https://doi.org/10.1002/aris.1440380105
  8. Fonarev, A. Matrix Factorization Methods For Training Embeddings, 2018.
  9. Gamallo, P. and Bordag, S., "Is Singular Value Decomposition Useful for Word Similarity Extraction?," Lang. Resour. Eval., Vol. 45, No. 2, pp. 95-119, 2011. https://doi.org/10.1007/s10579-010-9129-5
  10. Gao, X. W. and Qian, Y., "Prediction of Multidrug-Resistant TB from CT Pulmonary Images Based on Deep Learning Techniques," Molecular Pharmaceutics, Vol. 15, No. 10, pp. 4326-4335, 2018. https://doi.org/10.1021/acs.molpharmaceut.7b00875
  11. Ho, J. C., Ghosh, J., and Sun, J., "Marble: High-throughput Phenotyping from Electronic Health Records via Sparse Nonnegative Tensor Factorization," In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 115-124, New York, NY, USA: ACM, 2014.
  12. Ho, J. C., Ghosh, J., Steinhubl, S. R., Stewart, W. F., Denny, J. C., Malin, B. A., and Sun, J., "Limestone: High-throughput candidate phenotype generation via tensor factorization," Journal of Biomedical Informatics, Vol. 52, pp. 199-211, 2014. https://doi.org/10.1016/j.jbi.2014.07.001
  13. Jensen, P. B., Jensen, L. J., and Brunak, S., "Mining electronic health records: towards better research applications and clinical care," Nature Reviews Genetics, Vol. 13, No. 6, p. 395, 2012. https://doi.org/10.1038/nrg3208
  14. Kim, L., Sakong, J., Kim, Y., Kim, S., Kim, S., Tchoe, B., and Lee, T., "Developing the Inpatient Sample for the National Health Insurance Claims Data," Health Policy and Management, Vol. 23, No. 2, pp. 152-161, 2015. https://doi.org/10.4332/KJHPA.2013.23.2.152
  15. Kim, Y. H., Shin, G. W., and Lee, Y. H., "The Forecast of Future Technology Based on Deep Learning," Proceedings of KIIT Conference, pp. 219-220, 2015.
  16. Koren, Y., Bell, R., and Volinsky, C., "Matrix Factorization Techniques for Recommender Systems," Computer, Vol. 42, No. 8, pp. 30-37, 2009. https://doi.org/10.1109/MC.2009.263
  17. Krizhevsky, A., Sutskever, I., and Hinton, G. E., "ImageNet Classification with Deep Convolutional Neural Networks," In Proceedings of the 25th International Conferenceon Neural Information Processing Systems, Vol. 1, pp. 1097-1105, USA: Curran Associates Inc, 2012.
  18. Levy, O. and Goldberg, Y., "Neural Word Embedding As Implicit Matrix Factorization," In Proceedings of the 27th International Conference on Neural Information Processing Systems, Vol. 2, pp. 2177-2185, Cambridge, MA, USA: MIT Press, 2014.
  19. Liang, D., Altosaar, J., Charlin, L., and Blei, D. M., "Factorization Meets the Item Embedding: Regularizing Matrix Factorization with Item Co-occurrence," In Proceedings of the 10th ACM Conference on Recommender Systems, pp. 59-66, New York, NY, USA: ACM, 2016.
  20. Martinez, J. L., Baquero, F., and Andersson, D. I., "Predicting antibiotic resistance," Nature Reviews Microbiology, Vol. 5, No. 12, pp. 958-956, 2007. https://doi.org/10.1038/nrmicro1796
  21. Mcadam, A. J., Hooper, D. C., Demaria, A., Limbago, M. B., O'brien, T. F., and Mccaughey, B., "Antibiotic Resistance: How Serious Is the Problem, and What Can Be Done?," Clinical Chemistry, Vol. 58, No. 8, pp. 1182-1186, 2012. https://doi.org/10.1373/clinchem.2011.181636
  22. Mikolov, T., Sutskever, I., Chen, K., Corrado, G., and Dean, J., "Distributed Representations of Words and Phrases and Their Compositionality," In Proceedings of the 26th International Conference on Neural Information Processing Systems, Vol. 2, pp. 3111-3119, USA: Curran Associates Inc, 2013.
  23. Moradigaravand, D., Martin, P., Anne, F., Ville, M., Warringer, J., and Parts, L., "Prediction of antibiotic resistance in Escherichia coli from large-scale pan-genome data," PLOS Computational Biology, Vol. 14, No. 12, pp. 1-17, 2018.
  24. Omlin, C. W. and Giles, C. L., "Stable Encoding of Large Finite-State Automata in Recurrent Neural Networks with Sigmoid Discriminants," Neural Computation, Vol. 8, No. 4, pp. 675-696, 1996. https://doi.org/10.1162/neco.1996.8.4.675
  25. Park, S. H., "Management of multi-drug resistant organisms in healthcare settings," Vol. 61, No. 1, pp. 26-35, 2018. https://doi.org/10.5124/jkma.2018.61.1.26
  26. Polley, E. C. and van der Laan, M. J., "Super Learner in Prediction," U.C. Berkeley Division of Biostatistics Working Paper, 1-19, 2010.
  27. Purushotham, S., Meng, C., Che, Z., and Liu, Y., "Benchmark of Deep Learning Models on Large Healthcare MIMIC Datasets," CoRR, 2017.
  28. Rajkomar, A., Oren, E., Chen, K., Dai, A. M., Hajaj, N., Hardt, M., and Dean, J., "Scalable and accurate deep learning with electronic health records," Npj Digital Medicine, Vol. 1, No. 1, p. 18, 2018. https://doi.org/10.1038/s41746-018-0029-1
  29. Roh, J. H., Kim, H. J., and Chang, J. Y., "Improving Hypertext Classification Systems through WordNet-based Feature Abstraction," The Jounal of Society for e-Business Studies, Vol. 18, No. 2, pp. 95-110, 2013. https://doi.org/10.7838/jsebs.2013.18.2.095
  30. Santos, R. P., Mayo, T. W., and Siegel, J. D., "Active Surveillance Cultures and Contact Precautions for Control of Multidrug-Resistant Organisms-Ethical Considerations," Clinical Infectious Disease, Vol. 47, No. 1, pp. 110-116, 2008. https://doi.org/10.1086/588789
  31. Socher, R., Lin, C. C. Y., Ng, A. Y., and Manning, C. D., "Parsing Natural Scenes and Natural Language with Recursive Neural Networks," In Proceedings of the 28th International Conference on International Conference on Machine Learning, pp. 129-136, USA: Omnipress, 2011.
  32. Song, J. H., "Current status and future strategies of antimicrobial resistance in Korea," The Korean Journal of Medicine, Vol. 77, No. 2, pp. 143-151, 2009.
  33. Van der Laan, M. J., Polley, E. C., and Hubbard, A. E., "Super Learner," Statistical Applications in Genetics and Molecular Biology, Vol. 6, No. 1, 2007.
  34. Xiang, T., Ray, D., Lohrenz, T., Dayan, P., and Montague, P. R., "Computational Phenotyping of Two-Person Interactions Reveals Differential Neural Response to Depth-of-Thought," PLOS Computational Biology, Vol. 8, No. 12, e1002841, 2012. https://doi.org/10.1371/journal.pcbi.1002841
  35. Xu, X., Liang, T., Zhu, J., Zheng, D., and Sun, T., "Review of classical dimensionality reduction and sample selection methods for large-scale data processing," Neurocomputing, Vol. 328, pp. 5-15, 2019. https://doi.org/10.1016/j.neucom.2018.02.100
  36. Yang, Y., Niehaus, K. E., and Clifton, D. A., "Predicting antibiotic resistance from genomic data," In Machine learning for healthcare technologies, pp. 203-226, IET, 2016.
  37. Young, S., Abdou, T., and Bener, A., "Deep super learner: A deep ensemble for classification problems," Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 10832 LNAI, pp. 84-95, 2018.
  38. Zhang, M., Hu, B., Shi, C., Wu, B., and Wang, B. (n.d.)., "Matrix Factorization meets Social Network Embedding for Rating Prediction," In Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data, pp. 121-129, Springer, Cham, 2018.