특징 선택을 위한 혼합형 유전 알고리즘과 분류 성능 비교

Hybrid Genetic Algorithms for Feature Selection and Classification Performance Comparisons

  • 오일석 (전북대학교 전자정보공학부) ;
  • 이진선 (우석대학교 컴퓨터공학) ;
  • 문병로 (서울대학교 컴퓨터공학부)
  • 발행 : 2004.08.01

초록

이 논문은 특징 선택을 위한 새로운 혼합형 유전 알고리즘을 제안한다. 탐색을 미세 조정하기 위한 지역 연산을 고안하였고, 이들 연산을 유전 알고리즘에 삽입하였다. 연산의 미세 조정 강도를 조절할 수 있는 매개 변수를 설정하였으며, 이 변수에 따른 효과를 측정하였다. 다양한 표준 데이타 집합에 대해 실험한 결과, 제안한 혼합형 유전 알고리즘이 단순 유전 알고리즘과 순차 탐색 알고리즘에 비해 우수함을 확인하였다.

This paper proposes a novel hybrid genetic algorithm for the feature selection. Local search operations are devised and embedded in hybrid GAs to fine-tune the search. The operations are parameterized in terms of the fine-tuning power, and their effectiveness and timing requirement are analyzed and compared. Experimentations performed with various standard datasets revealed that the proposed hybrid GA is superior to a simple GA and sequential search algorithms.

키워드

참고문헌

  1. J. Kittler, 'Feature selection and extraction,' in Handbook of Pattern Recognition and Image Processing, Academic Press (Edited by T.Y. Young and K.S. Fu), pp. 59-83, 1986
  2. W. Siedlecki and J. Sklansky, 'On automatic feature selection,'On automatic feature selection,' International Journal of Pattern Recognition and Artificial Intelligence, Vol2. No.2, pp.197-220, 1988 https://doi.org/10.1142/S0218001488000145
  3. F.J. Ferri, P. Pudil, M. Hatef, and J. Kittler, 'Comparative study of techniques for large-scale feature selection,' in Pattern Recognition in Practice IV (Edited by E.S. Gelsema and L.N. Kanal), Elsevier Science, pp.403-413, 1994
  4. A. Jain and D. Zongker, 'Feature selection: evaluation, application, and small sample performance,' IEEE Tr. PAMI, Vol.19, No.2, pp.153-158 https://doi.org/10.1109/34.574797
  5. M. Kudo and J. Sklansky, 'Comparison of algorithms that select features for pattern recognition,' Pattern Recognition, Vol.33, No.1, pp.25-41, 2000 https://doi.org/10.1016/S0031-3203(99)00041-2
  6. J. Holland, Adaptation in Nature and Artificial Systems, MIT Press, 1992
  7. W. Siedlecki and J. Sklansky, 'A note on genetic algorithms for large-scale feature selection,' Pattern Recognition Letters, Vol.10, pp.335-347, 1989 https://doi.org/10.1016/0167-8655(89)90037-8
  8. F.Z. Brill, D.E. Brown, and W.N. Martin, 'Fast genetic selection of features for neural network classifiers,' IEEE Tr. Neural Networks, Vol.3, No.2, pp.324-328, March 1992 https://doi.org/10.1109/72.125874
  9. J.H. Yang and V. Honavar, 'Feature subset selection using a genetic algorithm,' IEEE Intelligent Systems, Vol.13, No2, pp.44-49, 1998 https://doi.org/10.1109/5254.671091
  10. L.I. Kuncheva and L.C. Jain, 'Nearest neighborclassifier: simultaneous editing and feature selection,' Pattern Recognition Letters, Vol.20, pp.1149-1156, 1999 https://doi.org/10.1016/S0167-8655(99)00082-3
  11. M.L. Raymer, W.F. Punch, E.D. Goodman, L.A. Kuhn, and A.K. Jain, 'Dimensionality reduction using genetic algorithms,' IEEE Tr. Evolutionary Computation, Vol.4, No.2, pp.164-171, July 2000 https://doi.org/10.1109/4235.850656
  12. P. Pudil, J. Novovicova, and J. Kittler, 'Floating search methods in feature selection,' Pattern Recognition Letters, Vol.15, pp.1119-1125, 1994 https://doi.org/10.1016/0167-8655(94)90127-9
  13. P. Jog J. Suh and D. Gucht, 'The effect of population size, heuristic crossover and local improvement on a genetic algorithm for the traveling salesman problem,' Proc. of International Conference on Genetic Algorithms, pp.110-115, 1989
  14. T.N. Bui and B.R. Moon, 'Genetic algorithm and graph partitioning,' IEEE Tr. Computers, Vol45, No.7, pp.841-855, July 1996 https://doi.org/10.1109/12.508322
  15. X. Zheng B.A. Julstrom, and W. Cheng, 'Design of vector quantization codebooks using a genetic algorithm,' Proc. of IEEE International Conf. on Evolutionary Computation, pp.525-529, 1997 https://doi.org/10.1109/ICEC.1997.592366
  16. P.M. Murphy and D.W. Aha, 'UCI repository for machine learning databases(http://www.ics.uci.edu/~mlearn/MLRepository.html,' Irvine, CA: University of California, Department of Information and Computer Science, 1994