An Example-based Korean Standard Industrial and Occupational Code Classification

예제기반 한국어 표준 산업/직업 코드 분류

  • 임희석 (한신대학교 컴퓨터정보소프트웨어학부)
  • Published : 2006.08.01

Abstract

Coding of occupational and industrial codes is a major operation in census survey of Korean statistics bureau. The coding process has been done manually. Such manual work is very labor and cost intensive and it usually causes inconsistent results. This paper proposes an automatic coding system based on example-based learning. The system converts natural language input into corresponding numeric codes using code generation system trained by example-based teaming after applying manually built rules. As experimental results performed with training data consisted of 400,000 records and 260 manual rules, the proposed system showed about 76.69% and 99.68% accuracy for occupational code classification and industrial code classification, respectively.

Keywords

Occupational code classification;Industrial code classification;Example-based learning