Proceedings of the Korean Society for Bioinformatics Conference (한국생물정보학회:학술대회논문집)
- 2005.09a
- /
- Pages.183-187
- /
- 2005
Development of Correlation Based Feature Selection Method by Predicting the Markov Blanket for Gene Selection Analysis
- Adi, Made (Bioinformatics Research Centre (BIRC)) ;
- Yun, Zhen (Nanyang Technological University) ;
- Keong, Kwoh-Chee (Nanyang Technological University)
- Published : 2005.09.22
Abstract
In this paper, we propose a heuristic method to select features using a Two-Phase Markov Blanket-based (TPMB) algorithm. The first phase, filtering phase, of TPMB algorithm works by filtering the obviously redundant features. A non-linear correlation method based on Information theory is used as a metric to measure the redundancy of a feature [1]. In second phase, approximating phase, the Markov Blanket (MB) of a system is estimated by employing the concept of cross entropy to identify the MB. We perform experiments on microarray data and report two popular dataset, AML-ALL [3] and colon tumor [4], in this paper. The experimental results show that the TPMB algorithm can significantly reduce the number of features while maintaining the accuracy of the classifiers.
Keywords