DOI QR코드

DOI QR Code

Performance Comparison of Decision Trees of J48 and Reduced-Error Pruning

  • Jin, Hoon (Dept. of Computer Engineering, Sungkyunkwan University) ;
  • Jung, Yong Gyu (Department of Medical IT Marketing, Eulji University)
  • 투고 : 2015.11.17
  • 심사 : 2016.01.23
  • 발행 : 2016.03.31

초록

With the advent of big data, data mining is more increasingly utilized in various decision-making fields by extracting hidden and meaningful information from large amounts of data. Even as exponential increase of the request of unrevealing the hidden meaning behind data, it becomes more and more important to decide to select which data mining algorithm and how to use it. There are several mainly used data mining algorithms in biology and clinics highlighted; Logistic regression, Neural networks, Supportvector machine, and variety of statistical techniques. In this paper it is attempted to compare the classification performance of an exemplary algorithm J48 and REPTree of ML algorithms. It is confirmed that more accurate classification algorithm is provided by the performance comparison results. More accurate prediction is possible with the algorithm for the goal of experiment. Based on this, it is expected to be relatively difficult visually detailed classification and distinction.

키워드

참고문헌

  1. Witten, Ian H., and Eibe Frank. Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann, 2005.
  2. Baldi, Pierre, and Soren Brunak. Bioinformatics: the machine learning approach. MIT press, 2001.
  3. Kononenko, Igor. "Machine learning for medical diagnosis: history, state of the art and perspective." Artificial Intelligence in medicine 23.1 (2001): 89-109. https://doi.org/10.1016/S0933-3657(01)00077-X
  4. Zhou, Zhi-Hua, and Min-Ling Zhang. "Solving multi-instance problems with classifier ensemble based on constructive clustering." Knowledge and Information Systems 11.2 (2007): 155-170. https://doi.org/10.1007/s10115-006-0029-3
  5. Bellazzi, Riccardo, and Blaz Zupan. "Predictive data mining in clinical medicine: current issues and guidelines." International journal of medical informatics 77.2 (2008): 81-97. https://doi.org/10.1016/j.ijmedinf.2006.11.006
  6. Zhu, Xiaojin. "Semi-supervised learning literature survey." (2005).
  7. Cho, Sung-Bae, and Hong-Hee Won. "Machine learning in DNA microarray analysis for cancer classification." Proceedings of the First Asia-Pacific bioinformatics conference on Bioinformatics 2003-Volume 19. Australian Computer Society, Inc., 2003.
  8. Yongheng Zhao and Yanxia Zhang, Comparison of decision tree methods for finding active objects, Advances of Space Research, 2007
  9. D. L. Gupta, A. K. Malviya, Satyendra Singh Performance Analysis of Classification Tree Learning Algorithms, International Journal of Computer Applications (0975 - 8887) Volume 55- No.6, October 2012