A Study for Complexity Improvement of Automatic Speaker Verification in PDA Environment

PDA 환경에서 자동화자 확인의 계산량 개선을 위한 연구

  • Published : 2009.07.30

Abstract

In this paper, we propose real time automatic speaker verification (ASV) system to protect personal information on personal digital assistant (PDA) device. Recently, the capacity of PDA has extended and been popular, especially for mobile environment such as mobile commerce (M-commerce). However, there still exist lots of difficulties for practical application of ASV utility to PDA device because it requires too much computational complexity. To solve this problem, we apply the method to relieve the computational burden by performing the preprocessing such as spectral subtraction and speech detection during the speech utterance. Also by applying the hidden Markov model (HMM) optimal state alignment and the sequential probability ratio test (SPRT), we can get much faster processing results. The whole system implementation is simple and compact enough to fit well with PDA device's limited memory and low CPU speed.

본 논문은 PDA 디바이스에서 개인정보를 보호하기 위한 자동화자확인 시스템을 제안한다. 최근 M-커머스와 같은 모바일 환경을 위한 PDA의 용량이 확장되고 사용이 증가되고 있다. 그러나 너무 많은 계산량 때문에 PDA 디바이스에서 자동화자확인의 실질적인 응용은 여전히 많은 어려움이 존재한다. 본 논문에서는 이러한 문제점을 해결하기 위해서 음성발성 동안 스펙트럼 차감법과 음성 검출과 같은 전처리를 수행함으로써 계산량을 줄일 수 있는 방법을 적용하였다. 또한 빠른 처리 결과를 얻기 위한 은닉마코프모델의 최적 상태 정합과 시퀀스 확률비 테스트를 적용하였다. 전체적인 시스템은 PDA디바이스의 제한된 메모리와 낮은 CPU 속도에 적합하도록 간결하게 구현하였다.

Keywords

References

  1. X. Huang, A. Acero, C. Chelba, etc., "MiPad : a Multimodel Interaction Prototype", Proc. of ICASSP2001, pp. 9-12, 2001.
  2. R. Isotani, K. Yamabana, S. Ando, etc., "An Automatic Speech Translation System on PDAs for Travel Conversation", Proc. of ICMJ'02, 2002.
  3. D. Yong-gang, X. Bo, and H. Tai-yi, "Speech Recognition Algorithm Design and Implementation on Palm PC", Journal of Computer Research & Development, vol. 37, no. 8, pp. 937-941, 2000.
  4. T. Iizuka, A. Lau and T. Suda, "A design of local resource access control for mobile agent in PDA", IEICE Trans. Commun. vol. E83-B, No. 11, 2001.
  5. Y. Yang, Z. Wu, Z. Ma, "Real time user authentication system for PDA", Proc. of the IEEE Int. Conf. on Systems, Man and Cybernetics, pp. 439-443, 2005.
  6. P.C. Joseph, "Speaker Recognition: A Tutorial", Proc. of the IEEE, vol. 85, no. 9, pp. 1437-1462, 1997.
  7. Q. Li, "A Detection approach to Search-Space Reduction for HMM Sate Alignment in Speaker Verification", IEEE Trans. SAP, vol. 9, no. 5, pp. 569-578, 2001.
  8. Q. Li, "A fast decoding algorithm based on sequential detection of the changes in distribution", Proc. Int. Conf. Spoken Language Processing, 1998.
  9. M.A. Lund and C.C. Lee, "A robust sequential test for text-independent speaker verification", J. Acoust. Soc. Am., vol. 99, no. 1, pp. 609-621, 1996. https://doi.org/10.1121/1.414516
  10. H. Noda, K. Harada, E. Kawaguchi, "A context-dependent Sequential decision for speaker verification", IEICE Trans. INF. & syst. vol. E82-D, no. 10 , pp. 1433-1436, 1999.
  11. S. F. Boll, "Suppressing of acoustic noise in speech using spectral subtraction", IEEE Trans. ASSP, vol. 27, no. 2, pp. 113-119, 1979. https://doi.org/10.1109/TASSP.1979.1163209
  12. S. Furui, "Cepstral analysis technique for automatic speaker verification", IEEE Trans. On Acoust, Speech, Signal Processing, pp. 254-272, vol. 10, no. 4, 1981.
  13. S. Parthasarathy and A. E. Rosenberg, "General phrase speaker verification using sub-word background models and likelihood-ratio scoring", ICSLP, vol. 4 ,pp. 2403 - 2406, 1996.
  14. D. Naik, "Pole-filtered cepstral mean subtraction", in Proc. ICASSP-1995, vol. 1, pp. 157-160, 1995.
  15. J. K. Chen and F. K. Soong, "An n-best candidates-based discriminative training for speech-recognition applications", IEEE Trans. Speech Audio Processing, vol. 2, pp. 206-216, 1994.