Key-word Error Correction System using Syllable Restoration Algorithm

Ahn, Chan-Shik;Oh, Sang-Yeob;

doi:10.9708/jksci.2010.15.10.165

Journal of the Korea Society of Computer and Information (한국컴퓨터정보학회논문지)

Volume 15 Issue 10
/
Pages.165-172
/
2010
/
1598-849X(pISSN)
/
2383-9945(eISSN)

Korean Society of Computer Information (한국컴퓨터정보학회)

DOI QR Code

Key-word Error Correction System using Syllable Restoration Algorithm

음절 복원 알고리즘을 이용한 핵심어 오류 보정 시스템

안찬식 (광운대학교 컴퓨터공학과) ;
오상엽 (경원대학교 IT대학 컴퓨터소프트웨어)

Received : 2010.05.17
Accepted : 2010.08.17
Published : 2010.10.31

https://doi.org/10.9708/jksci.2010.15.10.165 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

There are two method of error correction in vocabulary recognition system. one error pattern matting base on method other vocabulary mean pattern base on method. They are a failure while semantic of key-word problem for error correction. In improving, in this paper is propose system of key-word error correction using algorithm of syllable restoration. System of key-word error correction by processing of semantic parse through recognized phoneme meaning. It's performed restore by algorithm of syllable restoration phoneme apply fluctuation before word. It's definitely parse of key-word and reduced of unrecognized. Find out error correction rate using phoneme likelihood and confidence for system parse. When vocabulary recognition perform error correction for error proved vocabulary. system performance comparison as a result of recognition improve represent 2.3% by method using error pattern learning and error pattern matting, vocabulary mean pattern base on method.

어휘 인식 시스템의 오류 보정방법으로는 오류 패턴매칭 기반 방법과 어휘의미 패턴 기반방법이있으며, 이들 방법에서는 오류 보정을 위해 핵심어를 의미적으로 분석하지 못하는 문제점을 가지고 있다. 이를 개선하기 위해 본 논문에서는 음절 복원 알고리즘을 이용한 핵심어 오류 보정 시스템을 제안한다. 인식된 음소 열을 의미 분석 과정을 거쳐 음소가 갖는 의미를 파악하고 음절 복원 알고리즘을 통해 음운 변동이 적용되기 이전의 문자열로 복원하므로 핵심어를 명확히 분석하고 오인식을 줄일 수 있다. 시스템 분석을 위해 음소 유사율과 신뢰도를 이용하여 오류 보정율을 구하였으며, 어휘 인식 과정에서 오류로 판명된 어휘에 대하여 오류 보정을 수행하였다. 에러 패턴 학습을 이용한 방법과 오류 패턴 매칭 기반 방법, 어휘 의미 패턴 기반 방법의 성능 평가 결과 3.0%의 인식 향상율을 보였다.

Keywords

References

한동조, 최기호, "음성인식 후처리에서 음소 유사율을 이용한 오류보정에 관한 연구," 한국ITS학회논문지, 제 6권, 제 3호, 77-86쪽, 2007년 12월.
조시원, 이동욱, "음성 인식 후처리를 위한 연속 음절 문장의 키워드 추출 알고리즘," 대한전기학회, 학술대회 2008정보 및 제어 심포지엄(ICS'08) 논문집, 170-171쪽, 2008년 4월.
S. Kaki, E. Sumita, and H. Iida, "A method for correction speech recognition using the statistical features of character co-occurrence," Proc. COLINGACL, pp.653-657, Aug, 1998.
김용현, 정민화, "에러패턴학습과후처리모듈을이용한연속 음성 인식의 성능향상," Proc. KISS Spring Semiannual Conf. 제27권, 제 1호, 441-443쪽, 2000년 4월.
M. W. Jeong, B. C. Kim, and G. G. Lee, "Semanticoriented error correction for spoken query processing," Proc. IEEE Workshop on ASRU, pp.156-161, Nov, 2003.
최용선, 오상훈, 이수영, "핵심어 검출을 위한 단일 끝점 DTW 알고리즘," 대한전자공학회논문지, 제 41권, SP편 제 3호, 209-219쪽, 2004년 5월.
Eiichi Tanaka and Tamotsu Kasai, "Synchronization and Substitution Error-correcting codes for the Levenshtein Metric," IEEE Trans. Information Theory, Vol.IT-22, No.2, pp.156-176, 1976.
E. K. Ringer and J. F. Allen, "A fertility channel model for post-correction of continuous speech recognition," Proc. ICSLP, pp.897-900, Oct, 1996.
S. Kaki, E. Sumita, and H. Iida, "A method for correction speech recognition using the statistical features of character co-occurrence," Proc. COLINGACL, pp.653-657, Aug, 1998.
L. Rabiner and B. H. Juang, "Fundamentals of Speech Recognition", Prentice-Hall, 1993.
안찬식, 오상엽, "MLHF 모델을 적용한 어휘 인식 탐색 최적화 시스템," 한국컴퓨터정보학회지, 제 14권, 제 10호, 217-223쪽, 2009년 10월.
송원문, 김명원, "문맥 및 사용 패턴 정보를 이용한 음성 인식 후처리," 정보처리학회논문지, 제 13-B권, 제 5호, 553-560쪽, 2006년.
박미성, 김미진, 김계성, 최재혁, 이상조, "연속 음성 인식 후처리를 위한 음절 복원 rule-based 시스템과 형태 소분석기법의 적용," 대한전자공학회논문지, 제 36권, 제 3호, 47-57쪽, 1999년 3월.
M. F. Gales, "Model-based techniques for nosie robust speech recognition," Ph. D. dissertation, University of Cambridge, Sept, 1995.
S. Young, D. Kershaw, J. Odell, D. Ollason, Valtcher, P. Woodland, "The HTK Book," Cambridge University Engineering Department, 2002.
박미성, 김미진, 김계성, 김성규, 이문희, 최재혁, 이상조, "형태소 분석 기법을 이용한 음성 인식 후처리," 대한전자공학회논문지, 제 36권, 제 4호, 65-77쪽, 1999년 4월.
안찬식, 오상엽, "공유모델 인식 성능 향상을 위한 효율적인 연속 어휘 군집화 모델링," 한국컴퓨터정보학회지, 제 15권, 제 1호, 177-183쪽, 2010년 1월. https://doi.org/10.9708/jksci.2010.15.1.177
T. Jitsuhiro, S. Takatoshi, and K. Aikawa, "Rejection of out-of-vocabulary works using phoneme confidence likelihood," Proc. ICSSP, pp.217-220, May, 1998.
음성정보기술산업지원센터, "한국어 음성인식 플랫폼 사용자 매뉴얼(ECHOS Manual)," 135-308쪽, 2006년.

Journal of the Korea Society of Computer and Information (한국컴퓨터정보학회논문지)

Key-word Error Correction System using Syllable Restoration Algorithm

음절 복원 알고리즘을 이용한 핵심어 오류 보정 시스템

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)