DOI QR코드

DOI QR Code

COVID-19 Prediction model using Machine Learning

  • Jadi, Amr (Department of Computer Science and Information College of Computer Science and Engineering, University of Ha'il)
  • Received : 2021.08.05
  • Published : 2021.08.30

Abstract

The outbreak of the deadly virus COVID-19 is said to infect 17.3Cr people around the globe since 2019. This outbreak is continuously affecting a lot of new people till this day and, most of it is said to under control. However, vaccines introduced around the world can help mitigate the risk of the virus. Apart from medical professionals, prediction models are also said to combinedly help predict the risk of infection based on given datasets. This paper is based on publication of a machine learning approach using regression models to predict the output based on dataset which have indictors grouped based on active, tested, recovered and critical cases along with regions and cities covering most of it from Dubai. Hence, the active cases are tested based on the other indicators and other attributes. The coefficient of the determination (r2) is 0.96, which is considered promising. This model can be used as an frame work, among others, to predict the resources related to the dangerous outbreak.

Keywords

References

  1. D. Rafiq, A. Batool, and M.A. Bazaz. "Three months of COVID-19: A systematic review and meta-analysis." Reviews in Medical Virology 30, no. 4 (2020): e2113. https://doi.org/10.1002/rmv.2113
  2. G. Barsoum. "Arab youth: the challenges of education, employment and civic paricipation." OIDA International Journal of Sustainable Development 5, no. 10 (2012): 39-54.
  3. H. Alahdal, F. Basingab, and R. Alotaibi. "An analytical study on the awareness, attitude and practice during the COVID-19 pandemic in Riyadh, Saudi Arabia." Journal of infection and public health 13, no. 10 (2020): 1446-1452. https://doi.org/10.1016/j.jiph.2020.06.015
  4. J.W. Lai, and K.H. Cheong. "Superposition of COVID-19 waves, anticipating a sustained wave, and lessons for the future." BioEssays 42, no. 12 (2020): 2000178. https://doi.org/10.1002/bies.202000178
  5. M. Sironi, S.E. Hasnain, T. Phan, F. Luciani, M.A. Shaw, M.A. Sallum, M.E. Mirhashemi, S. Morand, and F. GonzalezCandelas. "SARS-CoV-2 and COVID-19: A genetic, epidemiological, and evolutionary perspective." Infection, Genetics and Evolution (2020): 104384. https://doi.org/10.1016/j.meegid.2020.104384
  6. T. Acter, N. Uddin, J. Das, A. Akhter, T.R. Choudhury, and S. Kim. "Evolution of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) as coronavirus disease 2019 (COVID-19) pandemic: A global health emergency." Science of the Total Environment (2020): 138996.
  7. S. Krishnan, M. J. Franklin, K. Goldberg, J. Wang, and E. Wu. "Activeclean: An interactive data cleaning framework for modern machine learning." In Proceedings of the 2016 International Conference on Management of Data, pp. 2117-2120. 2016.
  8. M. Dallachiesa, A. Ebaid, A. Eldawy, A. Elmagarmid, I. F. Ilyas, M. Ouzzani, and N. Tang. "NADEEF: a commodity data cleaning system." In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pp. 541-552. 2013.
  9. J. Van den Broeck, S. A. Cunningham, R. Eeckels, and K. Herbst. "Data cleaning: detecting, diagnosing, and editing data abnormalities." PLoS Med 2, no. 10 (2005): e267. https://doi.org/10.1371/journal.pmed.0020267
  10. D.C. Montgomery, E.A. Peck, and G.G. Vining. Introduction to linear regression analysis. John Wiley & Sons, 2021.
  11. R. Aggarwal, and P. Ranganathan. "Common pitfalls in statistical analysis: Linear regression analysis." Perspectives in clinical research 8, no. 2 (2017): 100. https://doi.org/10.4103/2229-3485.203040
  12. M. Tranmer, and M. Elliot. "Multiple linear regression." The Cathie Marsh Centre for Census and Survey Research (CCSR) 5, no. 5 (2008): 1-5.
  13. N. Altman, and M. Krzywinski. "Simple linear regression." (2015): 999-1000.
  14. M. Li. "Moving beyond the linear regression model: Advantages of the quantile regression model." Journal of Management 41, no. 1 (2015): 71-98. https://doi.org/10.1177/0149206314551963
  15. T. Fang, and R. Lahdelma. "Evaluation of a multiple linear regression model and SARIMA model in forecasting heat demand for district heating system." Applied energy 179 (2016): 544-552. https://doi.org/10.1016/j.apenergy.2016.06.133
  16. B. Jann. "The Blinder-Oaxaca decomposition for linear regression models." The Stata Journal 8, no. 4 (2008): 453-479. https://doi.org/10.1177/1536867X0800800401
  17. G.C. McDonald. "Ridge regression." Wiley Interdisciplinary Reviews: Computational Statistics 1, no. 1 (2009): 93-100. https://doi.org/10.1002/wics.14
  18. R. Tibshirani. "Regression shrinkage and selection via the lasso: a retrospective." Journal of the Royal Statistical Society: Series B (Statistical Methodology) 73, no. 3 (2011): 273-282. https://doi.org/10.1111/j.1467-9868.2011.00771.x