DOI QR코드

DOI QR Code

A Novel Approach to COVID-19 Diagnosis Based on Mel Spectrogram Features and Artificial Intelligence Techniques

  • Alfaidi, Aseel (Department of Computer Science and Artificial Intelligence, College of Computer Science and Engineering, University of Jeddah) ;
  • Alshahrani, Abdullah (Department of Computer Science and Artificial Intelligence, College of Computer Science and Engineering, University of Jeddah) ;
  • Aljohani, Maha (Department of Software Engineering, College of Computer Science and Engineering, University of Jeddah)
  • Received : 2022.09.05
  • Published : 2022.09.30

Abstract

COVID-19 has remained one of the most serious health crises in recent history, resulting in the tragic loss of lives and significant economic impacts on the entire world. The difficulty of controlling COVID-19 poses a threat to the global health sector. Considering that Artificial Intelligence (AI) has contributed to improving research methods and solving problems facing diverse fields of study, AI algorithms have also proven effective in disease detection and early diagnosis. Specifically, acoustic features offer a promising prospect for the early detection of respiratory diseases. Motivated by these observations, this study conceptualized a speech-based diagnostic model to aid in COVID-19 diagnosis. The proposed methodology uses speech signals from confirmed positive and negative cases of COVID-19 to extract features through the pre-trained Visual Geometry Group (VGG-16) model based on Mel spectrogram images. This is used in addition to the K-means algorithm that determines effective features, followed by a Genetic Algorithm-Support Vector Machine (GA-SVM) classifier to classify cases. The experimental findings indicate the proposed methodology's capability to classify COVID-19 and NOT COVID-19 of varying ages and speaking different languages, as demonstrated in the simulations. The proposed methodology depends on deep features, followed by the dimension reduction technique for features to detect COVID-19. As a result, it produces better and more consistent performance than handcrafted features used in previous studies.

Keywords

Acknowledgement

The researchers would like to express their gratitude and appreciation to the Department of Computer Science and Technology at the University of Cambridge for making the dataset available to them in order for them to complete their research.

References

  1. D. Cucinotta and M. Vanelli, "WHO declares COVID19 a pandemic," Acta Biomed., vol. 91, no. 1, pp. 157-160, 2020, doi: 10.23750/abm.v91i1.9397.
  2. V. Corman et al., "Detection of 2019 novel coronavirus (2019-nCoV) by real-time RT-PCR," Euro Surveill, vol. 25, no. 3, pp. 1-8, 2020.
  3. R. W. Peeling, P. L. Olliaro, D. I. Boeras, and N. Fongwen, "Scaling up COVID-19 rapid antigen tests: promises and challenges," Lancet Infect. Dis., vol. 21, no. 9, pp. e290-e295, 2021, doi: 10.1016/S1473-3099(21)00048-7.
  4. S. Swayamsiddha, K. Prashant, D. Shaw, and C. Mohanty, "The prospective of Artificial Intelligence in COVID-19 Pandemic," Health Technol. (Berl)., vol. 11, no. 6, pp. 1311-1320, 2021, doi: 10.1007/s12553-021-00601-2.
  5. J. Shuja, E. Alanazi, W. Alasmary, and A. Alashaikh, "COVID-19 open source data sets : a comprehensive survey," pp. 1296-1325, 2021.
  6. T. F. Quatieri, T. Talkar, and J. S. Palmer, "A Framework for Biomarkers of COVID-19 Based on Coordination of Speech-Production Subsystems," IEEE Open J. Eng. Med. Biol., vol. 1, pp. 203-206, 2020, doi: 10.1109/ojemb.2020.2998051.
  7. S. Sondhi et al., "Voice Processing For Covid-19 Scanning And Prognostic Indicator," Heliyon, vol. 7, no. 10, p. e08134, 2021, doi: 10.1016/j.heliyon.2021.e08134.
  8. Z. Ren, N. Cummins, J. Han, S. Schnieder, and J. Krajewski, "Evaluation of the Pain Level from Speech : Introducing a Novel Pain Database and Benchmarks Evaluation of the Pain Level from Speech : Introducing a Novel Pain Database and Benchmarks 1 Introduction," no. October, 2018.
  9. L. Lin, X. Chen, Y. Shen, and L. Zhang, "Towards Automatic Depression Detection : A BiLSTM / 1D CNN-Based Model," pp. 1-20, 2020, doi: 10.3390/app10238701.
  10. L. Verde, G. D. E. Pietro, and G. Sannino, "Voice Disorder Identification by Using Machine Learning Techniques," IEEE Access, vol. 6, pp. 16246-16255, 2018, doi: 10.1109/ACCESS.2018.2816338.
  11. P. Vizza, G. Tradigo, D. Mirarchi, R. B. Bossio, and P. Veltri, "On the use of voice signals for studying sclerosis disease," Computers, vol. 6, no. 4, pp. 1-12, 2017, doi: 10.3390/computers6040030.
  12. L. Zahid et al., "A Spectrogram-Based Deep Feature Assisted Computer-Aided Diagnostic System for Parkinson's Disease," IEEE Access, vol. 8, pp. 35482-35495, 2020, doi: 10.1109/ACCESS.2020.2974008.
  13. M. Kiran Reddy et al., "The automatic detection of heart failure using speech signals," Comput. Speech Lang., vol. 69, p. 101205, 2021, doi: 10.1016/j.csl.2021.101205.
  14. B. W. Schuller, D. M. Schuller, K. Qian, J. Liu, H. Zheng, and X. Li, "COVID-19 and Computer Audition: An Overview on What Speech & Sound Analysis Could Contribute in the SARS-CoV-2 Corona Crisis," Front. Digit. Heal., vol. 3, no. March, 2021, doi: 10.3389/fdgth.2021.564906.
  15. A. Imran et al., "AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app," Informatics Med. Unlocked, vol. 20, p. 100378, 2020, doi: 10.1016/j.imu.2020.100378.
  16. C. Brown et al., "Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data," Proc. ACM SIGKDD Int. Conf. Knowl. Discov. Data Min., pp. 3474-3484, 2020, doi: 10.1145/3394486.3412865.
  17. H. Coppock, A. Gaskell, P. Tzirakis, A. Baird, L. Jones, and B. Schuller, "End- - to- - end convolutional neural network enables COVID-19 detection from breath and cough audio : a pilot study," pp. 356-362, 2021, doi: 10.1136/bmjinnov-2021-000668.
  18. J. Han et al., "An early study on intelligent analysis of speech under COVID-19: Severity, sleep quality, fatigue, and anxiety," Proc. Annu. Conf. Int. Speech Commun. Assoc. INTERSPEECH, vol. 2020-Octob, no. October, pp. 4946-4950, 2020, doi: 10.21437/Interspeech.2020-2223.
  19. L. Verde, G. De Pietro, and G. Sannino, "Artificial Intelligence Techniques for the Non-invasive Detection of COVID-19 Through the Analysis of Voice Signals," Arab. J. Sci. Eng., vol. 19, 2021, doi: 10.1007/s13369-021-06041-4.
  20. M. Usman, V. K. Gunjan, M. Wajid, M. Zubair, and K. N. Siddiquee, "Speech as a Biomarker for COVID-19 Detection Using Machine Learning," vol. 2022, 2022.
  21. J. Han et al., "Exploring automatic covid-19 diagnosis via voice and symptoms from crowdsourced data," ICASSP, IEEE Int. Conf. Acoust. Speech Signal Process. - Proc., vol. 2021-June, pp. 8328-8332, 2021, doi: 10.1109/ICASSP39728.2021.9414576.
  22. B. Stasak, Z. Huang, S. Razavi, D. Joachim, and J. Epps, "Automatic Detection of COVID-19 Based on ShortDuration Acoustic Smartphone Speech Analysis," J. Healthc. Informatics Res., vol. 5, no. 2, pp. 201-217, 2021, doi: 10.1007/s41666-020-00090-4.
  23. M. T. Garcia-Ordas, J. A. Benitez-Andrades, I. GarciaRodriguez, C. Benavides, and H. Alaiz-Moreton, "Detecting respiratory pathologies using convolutional neural networks and variational autoencoders for unbalancing data," Sensors (Switzerland), vol. 20, no. 4, 2020, doi: 10.3390/s20041214.
  24. L. Vavrek, M. Hires, D. Kumar, and P. Drotar, "Deep convolutional neural network for detection of pathological speech," SAMI 2021 - IEEE 19th World Symp. Appl. Mach. Intell. Informatics, Proc., pp. 245-249, 2021, doi: 10.1109/SAMI50585.2021.9378656.
  25. Q. Zhou et al., "Cough Recognition Based on MelSpectrogram and Convolutional Neural Network," Front. Robot. AI, vol. 8, no. May, pp. 1-7, 2021, doi: 10.3389/frobt.2021.580080.
  26. T. Xia et al., "COVID-19 Sounds: A Large-Scale Audio Dataset for Digital Respiratory Screening," Thirty-fifth Conf. Neural Inf. Process. Syst. Datasets Benchmarks Track (Round 2), pp. 1-13, 2021, [Online]. Available: https://covid19.who.int/.
  27. D. Jurafsky and J. H. Martin, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. 2009.
  28. N. K. Manaswi, Deep Learning with Applications Using Python Chatbots and Face, Object, and Speech Recognition With TensorFlow and Keras. 2018.
  29. A. Solanki and S. Pandey, "Music instrument recognition using deep convolutional neural networks," Int. J. Inf. Technol., 2019, doi: 10.1007/s41870-019-00285-y.
  30. B. McFee et al., "librosa: Audio and Music Signal Analysis in Python," Proc. 14th Python Sci. Conf., no. Scipy, pp. 18-24, 2015, doi: 10.25080/majora7b98e3ed-003.
  31. K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," 3rd Int. Conf. Learn. Represent. ICLR 2015 - Conf. Track Proc., pp. 1-14, 2015.
  32. M. Elgendy, Deep Learning for Vision Systems. 2020.
  33. A. Geron, Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. 2019.
  34. I. H. Witten and E. Frank, Data Mining Practical Machine Learning Tools and Techniques. 2005.
  35. H. Luu, Machine Learning with Spark. 2021.
  36. S. Kaur and S. Kalra, "Disease prediction using hybrid K-means and support vector machine," India Int. Conf. Inf. Process. IICIP 2016 - Proc., 2017, doi: 10.1109/IICIP.2016.7975367.
  37. B. Zheng, S. W. Yoon, and S. S. Lam, "Breast cancer diagnosis based on feature extraction by hybrid of Kmeans and extreme learning machine algorithms," Expert Syst. Appl., vol. 11, no. 7, pp. 4581-4586, 2014.
  38. M. A. Khan, "An automated and fast system to identify COVID-19 from X-ray radiograph of the chest using image processing and machine learning," Int. J. Imaging Syst. Technol., vol. 31, no. 2, pp. 499-508, 2021, doi: 10.1002/ima.22564.
  39. C. Cortes and V. Vapnik, "Support-Vector Networks," vol. 297, pp. 273-297, 1995.
  40. T. Agrawal, Hyperparameter Optimization in Machine Learning: Make Your Machine Learning and Deep Learning Models More Efficient. 2021.
  41. A. U. Haq et al., "Feature Selection Based on L1-Norm Support Vector Machine and Effective Recognition System for Parkinson's Disease Using Voice Recordings," IEEE Access, vol. 7, no. March, pp. 37718-37734, 2019, doi: 10.1109/ACCESS.2019.2906350.
  42. E. Wirsansky, Hands-On Genetic Algorithms with Python. 2020.
  43. J. N. Mandrekar, "Receiver operating characteristic curve in diagnostic test assessment," J. Thorac. Oncol., vol. 5, no. 9, pp. 1315-1316, 2010, doi: 10.1097/JTO.0b013e3181ec173d.