DOI QR코드

DOI QR Code

Implementation of Korean Vowel 'ㅏ' Recognition based on Common Feature Extraction of Waveform Sequence

파형 시퀀스의 공통 특징 추출 기반 모음 'ㅏ' 인식 구현

  • 노원빈 (숙명여자대학교 멀티미디어과학과) ;
  • 이종우 (숙명여자대학교 멀티미디어과학과)
  • Received : 2014.02.07
  • Accepted : 2014.09.26
  • Published : 2014.11.15

Abstract

In recent years, computing and networking technologies have been developed, and the communication equipments have become smaller and the mobility has increased. In addition, the demand for easily-operated speech recognition has increased. This paper proposes method of recognizing the Korean phoneme 'ㅏ'. A phoneme is the smallest unit of sound, and it plays a significant role in speech recognition. However, the precise recognition of the phonemes has many obstacles since it has many variations in its pronunciation. This paper proposes a simple and efficient method that can be used to recognize a Korean vowel 'ㅏ'. The proposed method is based on the common features that are extracted from the 'ㅏ' waveform sequences, and this is simpler than when using the previous complex methods. The experimental results indicate that this method has a more than 90 percent accuracy in recognizing 'ㅏ'.

최근 네트워크와 컴퓨팅 기술의 발달로 정보기기가 소형화되고 이동성이 중요시되면서 간편하게 제어할 수 있는 음성 인식에 대한 수요가 증가하고 있다. 본 논문은 음성 인식 시스템의 일부로써 한국어 음소 중 모음 'ㅏ' 인식에 대한 연구 결과를 제시한다. 음소는 음성을 구성하고 있는 최소단위로서 음성을 인식하는데 매우 중요한 역할을 한다. 그러나 각각의 음소들을 정확하게 인식하려면 발음의 다양성 등으로 인해 많은 어려움이 존재한다. 본 논문에서는 한국어 음소 중 모음 'ㅏ'를 인식하기 위한 간단하고도 새로운 방식을 제안한다. 제안된 'ㅏ' 인식 휴리스틱은 파형 시퀀스의 공통 특징 추출을 기반으로 이루어졌으며, 이는 기존의 복잡한 방법에 비해 간단하면서도 실험 결과 90% 이상의 성공률로 'ㅏ'를 인식하는 것을 확인하였다.

Keywords

Acknowledgement

Supported by : 한국연구재단

References

  1. KOCCA, Culture Technology(CT) in-depth Report, Nov. 2011
  2. D. R. REDDY, "Speech recognition by machine: A review," IEEE, Vol. 64, pp. 501-531, 1976. https://doi.org/10.1109/PROC.1976.10158
  3. Y. J. Kim, H. L. Kim, and J. H. Jung, "A Study on the Korean Syllable As Recognition Unit," Journal of Acoustical Society of Korea, Vol. 16, No. 3, pp. 64-72, 1997. (in Korean)
  4. Y. Y., Seo, J. D. Song, and J. H. Lee, "Phoneme Segmentation in Consideration of Speech feature in Korean Speech Recognition," Journal of Korean Society for Internet Information, Vol. 2, No. 1, pp. 31-38, 2001. (in Korean)
  5. M. Y. Nam, J. J. Lee, J. H. Park, and S. Y. No, "Recognition of Korean Fricatives and Affricates Using Modified Teager Energy Measurement Method," Proc. of the IEEK Conference 1993, Vol. 15, No. 1, pp. 23-26, 1993. (in Korean)
  6. Y. K. Lee, "Speech Interface Technology and Service Trend under the Smart Phone Environment," Information & Communications Magazine, Vol. 29, No. 4, pp. 3-9, 2012. (in Korean)
  7. H. S. Baek, S. H. Cho, and D. S. Yook, "Connected Korean Digit Speech Recognition Using Syllablebased Recognition Units," Proc. of the KMMS Conference 2010, pp. 514-515, 2010. (in Korean)
  8. Rabiner, L, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. of the IEEE, Vol. 77, pp. 257-286, 1989. https://doi.org/10.1109/5.18626
  9. Lang, K. J. and A. Waibel, "A Time-Delay Neural Network Architecture for Isolated Word Recognition," Neural Networks, Vol. 3, pp. 23-43, 1990. https://doi.org/10.1016/0893-6080(90)90044-L
  10. H. Jung, Korean Speech Recognition Using Neural Networks, Korean Institute of Information Scientists and Engineers, pp. 63-82, 1993.
  11. Y. S. Joe, K. J. Lee, K. T. Kim, and J. K. Hong, "Korean Phoneme Recognition using HMM," IEEK, Vol. 16, No. 1, pp. 81-84, 1994.
  12. D, K. Kim, C. G. Jeong, and H. Jeong, "Hierarchy and Modulatity in Time-Delay Neural Networks for Korean Phoneme Recognition," Korean Institute of Information Scientists and Engineers, Vol. 18, No. 1, pp. 565-568, 1991.
  13. J. H. Lee, J. W. Lee, and J. W. Lee, "Korean Phonemes 'ㅅ', 'ㅈ', 'ㅊ' Recognition based on Sign Distribution Volatility," Communications of the Korean Institute of Information Scientists and Engineers, Vol. 19, pp. 377-382, 2013. (in Korean)
  14. S. K. Choi, J. W. Lee, and J. W. Lee, "Korean Vowel 'ㅏ' Recognition based on Wave Sequence Detection," Proc. of the Digital Contents Society Joint Conference 2013, Vol. 14, pp. 577-579, 2013. (in Korean)

Cited by

  1. Speech Recognition for the Korean Vowel 'ㅣ' based on Waveform-feature Extraction and Neural-network Learning vol.22, pp.2, 2016, https://doi.org/10.5626/KTCP.2016.22.2.69
  2. Speech Recognition of the Korean Vowel 'ㅐ', Based on Time Domain Sequence Patterns vol.21, pp.11, 2015, https://doi.org/10.5626/KTCP.2015.21.11.713