A Comparative Study of Medical Data Classification Methods Based on Decision Tree and System Reconstruction Analysis

  • Tang, Tzung-I (Department of Information & Electronic Commerce Kainan University) ;
  • Zheng, Gang (Department of Computer Science Tianjin University of Technology) ;
  • Huang, Yalou (Computer Science Institute Nankai University) ;
  • Shu, Guangfu (Institute of Systems Science Chinese Academy of Science) ;
  • Wang, Pengtao (Department of Computer Science Tianjin University of Technology)
  • Published : 2005.06.30

Abstract

This paper studies medical data classification methods, comparing decision tree and system reconstruction analysis as applied to heart disease medical data mining. The data we study is collected from patients with coronary heart disease. It has 1,723 records of 71 attributes each. We use the system-reconstruction method to weight it. We use decision tree algorithms, such as induction of decision trees (ID3), classification and regression tree (C4.5), classification and regression tree (CART), Chi-square automatic interaction detector (CHAID), and exhausted CHAID. We use the results to compare the correction rate, leaf number, and tree depth of different decision-tree algorithms. According to the experiments, we know that weighted data can improve the correction rate of coronary heart disease data but has little effect on the tree depth and leaf number.