DOI QR코드

DOI QR Code

Development of Autonomous Mobile Robot with Speech Teaching Command Recognition System Based on Hidden Markov Model

HMM을 기반으로 한 자율이동로봇의 음성명령 인식시스템의 개발

  • 조현수 (부산대학교 지능기계공학과) ;
  • 박민규 (부산대학교 기계기술연구소) ;
  • 이현정 (부산대학교 지능기계공학과) ;
  • 이민철 (부산대학교 기계공학부)
  • Published : 2007.08.01

Abstract

Generally, a mobile robot is moved by original input programs. However, it is very hard for a non-expert to change the program generating the moving path of a mobile robot, because he doesn't know almost the teaching command and operating method for driving the robot. Therefore, the teaching method with speech command for a handicapped person without hands or a non-expert without an expert knowledge to generate the path is required gradually. In this study, for easily teaching the moving path of the autonomous mobile robot, the autonomous mobile robot with the function of speech recognition is developed. The use of human voice as the teaching method provides more convenient user-interface for mobile robot. To implement the teaching function, the designed robot system is composed of three separated control modules, which are speech preprocessing module, DC servo motor control module, and main control module. In this study, we design and implement a speaker dependent isolated word recognition system for creating moving path of an autonomous mobile robot in the unknown environment. The system uses word-level Hidden Markov Models(HMM) for designated command vocabularies to control a mobile robot, and it has postprocessing by neural network according to the condition based on confidence score. As the spectral analysis method, we use a filter-bank analysis model to extract of features of the voice. The proposed word recognition system is tested using 33 Korean words for control of the mobile robot navigation, and we also evaluate the performance of navigation of a mobile robot using only voice command.

Keywords

References

  1. 성경학, 김진오, 김성권, '공장자동화를 위한 지능 로봇 시스템,' 제어 . 자동화 . 시스템 공학회지, 제 2 권 제 3 호, pp. 16-24, 1997
  2. 윤지섭, '원자력용 매니퓰레이터의 세계적 개발 현황,' 제어.자동화. 시스템 공학회지, 제 2 권 제 5 호, pp. 17-25. 1997
  3. P. J. McKerrow, Introduction to Robotics, Addison Wesley, 1993
  4. 구병완, '음성인식기술 현황과 전망,' 전자공학회지, 제 20 권 제 5 호, pp. 548-555, 1993
  5. D. B. Roe and J. G. Wilpon, 'Whither speech recognition: The next 25 Years,' IEEE Communications Magazine, vol. 31, no. 11, pp. 54-62, November 1993
  6. X. D. Huang, Y. Ariki, and M. A. Jack, hidden markov models for speech Recognition, Edinburgh University Press, Edinburgh, England, 1990
  7. C. S. Myers and L. R Rabiner, 'A comparative study of several dynamic time-warping algorithms for connected-word recognition,' Bell system Tech. J., vol. 60, no. 7, pp. 1389-1409, September 1981 https://doi.org/10.1002/j.1538-7305.1981.tb00272.x
  8. H. F. Siverman and D. P. Morgan, 'The application of dynamic programming to connected speech recognition,' IEEE Transactions on Acoustics, Speech, and Signal Processing Magazine, vol. 7, pp. 6-25, July 1990
  9. L. R. Rabiner, 'A tutorial on hidden markov models and selected application in speech recognition,' Proc. of the IEEE, vol. 77, no. 2, pp. 257-285, February 1989 https://doi.org/10.1109/5.18626
  10. L. R Rabinar and B. H. Juang, 'An introduction to hidden markov models,' IEEE ASSP Magazine, vol. 3, no. 1, pp. 4-16, January 1986 https://doi.org/10.1109/MASSP.1986.1165351
  11. R. P. Lippmann, 'An introduction to computing with neural nets,' IEEE Transactions on Acoustics, Speech, and Signal processing Magazine, vol. 4, pp. 4-22, Apr. 1987
  12. Jang, Sun and Mizutani, Neuro-Fuzzy and Soft Computing, Prentice-Hall, 1997
  13. J. G. Wilpon and L. R. Rabiner, 'A modified K-means clustering algorithm for use in isolated word recognition,' IEEE Trans, on ASSP, vol. 33, no. 3, pp. 587-594, 1985 https://doi.org/10.1109/TASSP.1985.1164581
  14. J. Makhoul, S. Roucos, and H. Gish, 'Vector quantization in speech coding,' Proc. of IEEE, vol. 73, no. 11, pp. 1551-1558, 1985 https://doi.org/10.1109/PROC.1985.13340
  15. K. F. Lee, Automatic Speech Recognition, Kluwer Academic Publshers, Norwell, M.A. U.S.A, 1976
  16. L. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition, New Jersey: Prentice Hall, 1993
  17. J. A. Freeman and D. M. Skapura, Neural Networks: Algorithms, Applications, and Programming Techniques, Addison Wesley, 1991
  18. K. J. Lang and Waibel A., 'A time-delay neural network architecture for isolated work recognition,' Neural Networks, vol. 3, pp. 23-43, 1990 https://doi.org/10.1016/0893-6080(90)90044-L