DOI QR코드

DOI QR Code

A Feature Selection-based Ensemble Method for Arrhythmia Classification

  • Namsrai, Erdenetuya (Database/Bioinformatics Laboratory, Chungbuk National University) ;
  • Munkhdalai, Tsendsuren (Database/Bioinformatics Laboratory, Chungbuk National University) ;
  • Li, Meijing (Database/Bioinformatics Laboratory, Chungbuk National University) ;
  • Shin, Jung-Hoon (Dept. of Software Engineering, Chonbuk National University) ;
  • Namsrai, Oyun-Erdene (Dept. of Information Technology, Mongolian National University) ;
  • Ryu, Keun Ho (Database/Bioinformatics Laboratory, Chungbuk National University)
  • Received : 2012.02.17
  • Accepted : 2012.11.22
  • Published : 2013.03.31

Abstract

In this paper, a novel method is proposed to build an ensemble of classifiers by using a feature selection schema. The feature selection schema identifies the best feature sets that affect the arrhythmia classification. Firstly, a number of feature subsets are extracted by applying the feature selection schema to the original dataset. Then classification models are built by using the each feature subset. Finally, we combine the classification models by adopting a voting approach to form a classification ensemble. The voting approach in our method involves both classification error rate and feature selection rate to calculate the score of the each classifier in the ensemble. In our method, the feature selection rate depends on the extracting order of the feature subsets. In the experiment, we applied our method to arrhythmia dataset and generated three top disjointed feature sets. We then built three classifiers based on the top-three feature subsets and formed the classifier ensemble by using the voting approach. Our method can improve the classification accuracy in high dimensional dataset. The performance of each classifier and the performance of their ensemble were higher than the performance of the classifier that was based on whole feature space of the dataset. The classification performance was improved and a more stable classification model could be constructed with the proposed approach.

Keywords

References

  1. Schapire RE, Freund Y, Bartlett P, Lee WS, "Boosting the margin: A new explanation for the effectiveness of voting methods", Statistics, 1998, pp.1651-1686.
  2. Alexey Tsymbal, Seppo Puuronen, David Patterson, "Feature selection for Simple Bayesian Classifiers", ISSU, 2002, pp.592-600.
  3. Eugene Tuv, Alexander Borisov, George Runger, "Feature selection with Ensembles, Artificial variables, and Redundancy elimination", Journal of Machine Learning Research, Vol.10, 2009, pp.1341-1366.
  4. Ching Wei Bang, "New Ensemble Machine Learning Method for Classification and Prediction on Gene Expression Data", IEEE, 2006, pp.3478-3481.
  5. Mohamed Ezzeldin A.Bashir, Dong Gyu Lee, Keun Ho Ryu, "Nested Ensemble Technique for Excellence Real time cardiac Health Monitoring", BIOCOMP, 2010.
  6. S.Oh, M.S Lee, B. Zhang, "Ensemble learning with active example selection for imbalanced biomedical data classification", IEEE/ACM transactions on computational biology and bioinformatics, Vol.8, No.2, 2011.
  7. Thara Soman, Patrick O.Bobbie, "Classification of Arrhythmia Using Machine Learning Techniques", WSEAS Transactions on computers, Vol.4, June, 2005, pp.548-552.
  8. Mohd Fauzi bin Othman, Thomas Moh Shan Yau, "Comparison of Different Classification Techniques Using WEKA for Breast cancer", IFMBE Proceedings Vol.15, 2007, pp.520-523.
  9. Asha Gowda Karegowda, M.A.Jayaram, A.S. Manjunath, "Feature Subset Selection Problem using Wrapper Approach in Supervised learning", International journal of Computer applications, Vol.1, No.7, 2010, pp.13-17.
  10. Pengy Yang, Yee Hwa Yang, Bing B.Zhou, "A Review of Ensemble Methods in Bioinformatics", Current Bioinformatics, Vol.5, No.4, 2010, pp.296-308. https://doi.org/10.2174/157489310794072508
  11. Ching Wei Bang, "New Ensemble Machine Learning Method for Classification and Prediction on Gene Expression Data", IEEE, 2006, pp.3478-3481.
  12. UCI Machine Learning Repository, http://archive.ics.edu/ml/datasets/Arrhythmia
  13. Weka web site, http://www.cs.waikato.ac.nz/ml/WEKA/
  14. F.Yaghouby, A.Ayatollahi, R.Soleimani, "Classification of Cardiac Abnormalities Using Reduced Features of Heart Rate Variability Signal", World Applied Sciences Journal, Vol.6, 2009, pp.1547-1554.
  15. M.R.Homaeinezhad, E.Tavakkoli, M.Habibi, "Combination of Different Classifiers for cardiac Arrhythmia Recognition", World Academy of Science, Engiinering and technology, 2011, pp.1189-1200.
  16. Zhi Hua Zhou, "Ensemble Learning", Encyclopedia of Bioinformatics, 2009, pp.270-273.
  17. Ho Sun Shon, Kyung-Sook Yang, Keun Ho Ryu, "Feature Selection Method using WF-LASSO for Gene Expression Data analysis", ACM-BCM, 2011, pp.522-525.
  18. Jon Atli Benediktsson, Havier Ceamanos Garcia, Bjorn Waske, Jocelyn Chanussor, "Ensemble Methods for Classification of Hyperspectral Data", IEEE, 2008, pp.62-65.
  19. Dymitr Ruta, Bogdan Gabrys, "Classifier selection for majority voting", Information fusion, Vol.6, 2005, pp.63-81. https://doi.org/10.1016/j.inffus.2004.04.008
  20. Guangrong Li, Xiaohua Hu, Xiajiong Shen, Xin Chen, "A Novel Unsupervised feature Selection Method for Bioinformatics Data Sets through Feature Clustering", IEEE, 2008, pp.41-47.
  21. Lior Rokach, Barak Chizi, "A Methodology for Improving the Performance of Non-ranker Feature Selection Filters", International Journal Pattern Recognition and Artificial Intelligence, 2007, pp.1-20.

Cited by

  1. Missing Values and Optimal Selection of an Imputation Method and Classification Algorithm to Improve the Accuracy of Ubiquitous Computing Applications vol.2015, 2015, https://doi.org/10.1155/2015/538613
  2. A hybrid ensemble for classification in multiclass datasets: An application to oilseed disease dataset vol.124, 2016, https://doi.org/10.1016/j.compag.2016.03.026
  3. A smartphone based real-time daily activity monitoring system vol.17, pp.3, 2014, https://doi.org/10.1007/s10586-013-0335-y
  4. Reliable Fault Classification of Induction Motors Using Texture Feature Extraction and a Multiclass Support Vector Machine vol.2014, 2014, https://doi.org/10.1155/2014/814593
  5. Discriminant Feature Distribution Analysis-Based Hybrid Feature Selection for Online Bearing Fault Diagnosis in Induction Motors vol.2016, 2016, https://doi.org/10.1155/2016/7145715
  6. Watermelon lycopene β-cyclase: promoter characterization leads to the development of a PCR marker for allelic selection vol.200, pp.3, 2014, https://doi.org/10.1007/s10681-014-1158-5
  7. Output Effect Evaluation Based on Input Features in Neural Incremental Attribute Learning for Better Classification Performance vol.7, pp.1, 2015, https://doi.org/10.3390/sym7010053
  8. Feature Selection and Parameter Optimization of Support Vector Machines Based on Modified Artificial Fish Swarm Algorithms vol.2015, 2015, https://doi.org/10.1155/2015/604108
  9. The Development of a Tourism Attraction Model by Using Fuzzy Theory vol.2015, 2015, https://doi.org/10.1155/2015/643842
  10. Multiclass Classification of Cardiac Arrhythmia Using Improved Feature Selection and SVM Invariants vol.2018, pp.1748-6718, 2018, https://doi.org/10.1155/2018/7310496