DOI QR코드

DOI QR Code

A Hybrid Multi-Level Feature Selection Framework for prediction of Chronic Disease

  • G.S. Raghavendra (RVR & JC College of Engineering, Atria Institute of technology) ;
  • Shanthi Mahesh (RVR & JC College of Engineering, Atria Institute of technology) ;
  • M.V.P. Chandrasekhara Rao (RVR & JC College of Engineering, Atria Institute of technology)
  • Received : 2023.12.05
  • Published : 2023.12.30

Abstract

Chronic illnesses are among the most common serious problems affecting human health. Early diagnosis of chronic diseases can assist to avoid or mitigate their consequences, potentially decreasing mortality rates. Using machine learning algorithms to identify risk factors is an exciting strategy. The issue with existing feature selection approaches is that each method provides a distinct set of properties that affect model correctness, and present methods cannot perform well on huge multidimensional datasets. We would like to introduce a novel model that contains a feature selection approach that selects optimal characteristics from big multidimensional data sets to provide reliable predictions of chronic illnesses without sacrificing data uniqueness.[1] To ensure the success of our proposed model, we employed balanced classes by employing hybrid balanced class sampling methods on the original dataset, as well as methods for data pre-processing and data transformation, to provide credible data for the training model. We ran and assessed our model on datasets with binary and multivalued classifications. We have used multiple datasets (Parkinson, arrythmia, breast cancer, kidney, diabetes). Suitable features are selected by using the Hybrid feature model consists of Lassocv, decision tree, random forest, gradient boosting,Adaboost, stochastic gradient descent and done voting of attributes which are common output from these methods.Accuracy of original dataset before applying framework is recorded and evaluated against reduced data set of attributes accuracy. The results are shown separately to provide comparisons. Based on the result analysis, we can conclude that our proposed model produced the highest accuracy on multi valued class datasets than on binary class attributes.[1]

Keywords

References

  1. Pronab Ghosh1, Sami Azam 2, Mirjam Jonkman2, (Member, Ieee),Asif Karim 2, F. M. Javed Mehedi Shamrat3, Eva Ignatious 2,Shahana Shultana1, Abhijith Reddy Beeravolu2,And Friso De Boer2 , "Efficient Prediction of Cardiovascular DiseaseUsing Machine Learning Algorithms With Reliefand LASSO Feature Selection Techniques",IEEE Access , Machine Learning . 
  2. Helen C. S. C. Lima 1, Fernando E. B. Otero 2, Luiz H. C. Merschmann 3,And Marcone J. F. Souza,"A novel hybrid feature selectionalgorithm for hierarchical classification",10.1109/ACCESS.2021.3112396, IEEE Access
  3. Naif Almusallam; Zahir Tari; Jeffrey Chan; Adil Fahad; AbdulatifAlabdulatif; Mohammed Al-Naeem ,"Towards an Unsupervised Feature Selection Method for Effective Dynamic Features", IEEE Access ( Volume: 9) 
  4. Amira S. Ashour; Majid Kamal A. Nour; Kemal Polat; Yanhui Guo; Wafaa Alsaggaf; Amira E,"A Novel Framework of Two Successive Feature Selection Levels Using Weight-Based Procedure for Voice-Loss Detection in Parkinson's Disease", IEEE Access
  5. Peng Zhou; Peipei Li; Shu Zhao; Xindong Wu,"Feature Interaction for Streaming Feature Selection", IEEE Transactions on Neural Networks and Learning Systems ( Volume: 32, Issue: 10, Oct. 2021) 
  6. Zigeng Wang; Xia Xiao; SanguthevarRajasekaran,"Novel and efficient randomized algorithms for feature selection",Big Data Mining and Analytics ( Volume: 3, Issue: 3, Sept. 2020) 
  7. Yazhou Li; Wei Dai; Weifang Zhang,"Bearing Fault Feature Selection Method Based on Weighted Multidimensional Feature Fusion",IEEE Access ( Volume: 8) 
  8. Wanfu Gao; Juncheng Hu; Yonghao Li; Ping Zhang,"Feature Redundancy Based on Interaction Information for Multi-Label Feature Selection", IEEE Access ( Volume: 8)