DOI QR코드

DOI QR Code

Vocabulary Recognition Performance Improvement using k-means Algorithm for GMM Support

GMM 지원을 위해 k-means 알고리즘을 이용한 어휘 인식 성능 개선

  • Received : 2014.11.06
  • Accepted : 2015.02.20
  • Published : 2015.02.28

Abstract

General CHMM vocabulary recognition system is model observation probability for vocabulary recognition of recognition rate's low. Used as the limiting unit is applied only to some problem in the phoneme model. Also, they have a problem that does not conform to the needs of the search range to meaning of the words in the vocabulary. Performs a phoneme recognition using GMM to improve these problems. We solve the problem according to the limited search words characterized by an improved k-means algorithm. Measure the effectiveness represented by the accuracy and reproducibility as compared to conventional system performance experiments. Performance test results accuracy is 83%p, and recall is 67%p.

일반적인 CHMM 어휘 인식 시스템은 어휘 인식에 대한 모델들의 관측 확률 인식률이 낮고, 일부 단위 음소 모델에만 적용되어 제한적으로 사용되는 문제점이 있다. 또한, 어휘 탐색에서 어휘의 의미가 다양하여 탐색된 어휘가 사용자의 요구에 부합되지 않는 문제점을 가진다. 이러한 문제를 개선하기 위해 GMM(Gaussian Mixture Model)을 이용한 음소인식을 수행하고, 개선된 k-means 알고리즘을 이용하여 어휘 특성에 따른 제한적인 탐색 문제점을 해결하였다. 성능 실험은 기존의 시스템과 비교하여 정확도와 재현율로 대변되는 효과성을 측정하였으며, 성능 실험 결과 정확도는 83%, 재현율은 67%로 나타났다.

Keywords

References

  1. Sang-Yeob Oh. Selective Speech Feature Extraction using Channel Similarity in CHMM Vocabulary Recognition. The Journal of digital policy and management. Vol. 11, No. 10, pp. 453-458, 2013.
  2. Chan-Shik Ahn, Sang-Yeob Oh. Vocabulary Recognition Retrieval Optimized System using MLHF Model. Journal of the Korea Society of Computer and Information. Vol. 14, No. 10, pp. 217-223, 2009.
  3. Chan-Shik Ahn, Sang-Yeob Oh. Echo Noise Robust HMM Learning Model using Average Estimator LMS Algorithm. The Journal of Digital Policy and Management. Vol. 10, No. 10, pp. 277-282, 2012.
  4. A. Srinivasan, Speech Recognition Using Hidden Markov Model, Applied Mathematical Sciences, vol. 5, no. 79, pp. 3943-3948, 2011.
  5. Campbell, W. M., Sturim, D. E., Reynolds, D. A., Solomonoff, A. SVM based speaker verification using a GMM supervector kernel and NAP variability compensation. Proc. ICASSP, No. 1, pp. 97-100, 2006.
  6. Chan-Shik Ahn, Sang-Yeob Oh. CHMM Modeling using LMS Algorithm for Continuous Speech Recognition Improvement. The Journal of digital policy and management. Vol. 10, No. 11, pp. 377-382, 2012.
  7. Zhang, Y., Xu, J., Yan, Z. J., & Huo, Q. An i-vector based approach to training data clustering for improved speech recognition. Proc. Interspeech, pp. 1247-1250. 2011.
  8. Beaufays, F., Vanhoucke, V., & Strope, B. Unsupervised discovery and training of maximally dissimilar cluster models. Proc. Interspeech, pp. 66-69, 2010.
  9. Sang-Yeob Oh. Improving Phoneme Recognition based on Gaussian Model using Bhattacharyya Distance Measurement Method. Journal of Korea Multimedia Society. Vol. 14, No. 1, pp. 85-93, 2011. https://doi.org/10.9717/kmms.2011.14.1.085
  10. Chan-Shik Ahn, Sang-Yeob Oh. Gaussian Model Optimization using Configuration Thread Control In CHMM Vocabulary Recognition. The Journal of Digital Policy and Management. Vol. 10, No. 7, pp. 167-172, 2012.
  11. Caban, A. Dolinska, B. Budzinski, G. Oczkowicz, G. Ostrozka-Cieslik, A. Cierpka, L. Ryszka, F. The Effect of HTK Solution Modification by Addition of Thyrotropin and Corticotropin on Biochemical Indices Reflecting Ischemic Damage to Porcine Kidney. Transplantation proceedings. Vol. 45, No. 5, pp. 1720-1722, 2013 https://doi.org/10.1016/j.transproceed.2013.01.094
  12. Chan-Shik Ahn, Sang-Yeob Oh. User's Individuality Preference Recommendation System using Improved k-means Algorithm. Journal of the Korea society of computer and information. Vol. 15 No. 8, pp. 141-148, 2010. https://doi.org/10.9708/jksci.2010.15.8.141
  13. Myoung-hwan Ahn, Joon-hee Kwon. Ontology based Context-Aware Recommendation System using Concept Hierarchy. Journal of Korean Society for Internet Information. Vol. 8, No. 5, pp. 81-89, 2007.
  14. Sung-Hwa Hong, Suk-Yong Jung.The Study for the Image Quality Measurement in IPTV. Journal of the Korea Convergence Society. Vol. 2, No. 3, pp. 39-43, 2011.
  15. Nam-Hoon Kim, Tong-Queue Lee, Suk-Yong Jung, Hae-Yong Park. A Study on Integrated Billing System for Multi-language. Journal of the Korea Convergence Society. Vol. 3, No. 3, pp. 1-5, 2012.