Emotion recognition in speech using hidden Markov model

;;

Journal of the Institute of Convergence Signal Processing (융합신호처리학회논문지)

Volume 3 Issue 3
/
Pages.21-26
/
2002
/
2765-1134(pISSN)

The Korea Institute of Convergence Signal Processing (한국융합신호처리학회)

Emotion recognition in speech using hidden Markov model

은닉 마르코프 모델을 이용한 음성에서의 감정인식

김성일 (중국 청화대학 음성기술센타) ;
정현열 (영남대학교 공과대학 정보통신공학과)

Published : 2002.07.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

This paper presents the new approach of identifying human emotional states such as anger, happiness, normal, sadness, or surprise. This is accomplished by using discrete duration continuous hidden Markov models(DDCHMM). For this, the emotional feature parameters are first defined from input speech signals. In this study, we used prosodic parameters such as pitch signals, energy, and their each derivative, which were then trained by HMM for recognition. Speaker adapted emotional models based on maximum a posteriori(MAP) estimation were also considered for speaker adaptation. As results, the simulation performance showed that the recognition rates of vocal emotion gradually increased with an increase of adaptation sample number.

본 논문은 분노, 행복, 평정, 슬픔, 놀람 등과 같은 인간의 감정상태를 인식하는 새로운 접근에 대해 설명한다. 이러한 시도는 이산길이를 포함하는 연속 은닉 마르코프 모델(HMM)을 사용함으로써 이루어진다. 이를 위해, 우선 입력음성신호로부터 감정의 특징 파라메타를 정의한다. 본 연구에서는 피치 신호, 에너지, 그리고 각각의 미분계수 등의 운율 파라메타를 사용하고, HMM으로 훈련과정을 거친다. 또한, 화자적응을 위해서 최대 사후확률(MAP) 추정에 기초한 감정 모델이 이용된다. 실험 결과로서, 음성에서의 감정 인식률은 적응 샘플수의 증가에 따라 점차적으로 증가함을 보여준다.

Keywords

References

Proc. of the ICSLP'96 Recognizing Emotion in Speech F. Dellaert;T. Polzin;A. Waibel
Proc. of International Conferenceon Multimedia Computing and Systems(ICMCS'99) Emotion Recognition and Synthesis System on Speech T. Moriyama;S. Ozawa
Proc. of the 2nd International Conference on Automatic Face and Gesture Recognition Spoken Affect Classification and Analysis D. Roy;A. Pentland
The 2002 Intel International Science and Engineering Fair Computer Recognition of Emotion in Speech Y. Yu;E. Chang;C. Li
Book Digital Processing of Speech Signal L.R. Rabiner;R.W. Schafer
Book Speech Recognition: Theory and C++ Implementation C. Becchetti;L.P. Riotti
Proc. Int. Symposium on Spoken Dialogue Speech recognition and understanding of spoken dialogue S. Nakagawa;A. Kai;T. Itoh;S. Kogure
MIT EECS Thesis for M.Sc. degree in Electrical Engineerign and Computer Science Stochastic Modeling of Physiological Signals with Hidden Markov Models: A Step Toward Frustration detection in Human- Computer Interfaces R. Fernandez
Doctoral Thesis Prosody and Speech Recognition Waibel A
Master's Thesis A Text-to-Speech System based on (NET)talk C. Turek
Speech Coding and Synthesis A robust algorithm for pitch tracking (RAPT) David Talkin
Journal of the Acoustical Society of America v.99 no.6 The processing of duration and intensity cues to prominence Alice E. Turk;James R. Sawusch
Developmental Psychology v.64 Approval and disapproval: Infant responsiveness to vocal affect in familiar and unfamiliar languages A. Fernald
Affective Computing Rosalind W. Picard
proc. of International Conference on Multimedia Computing and Systems(ICMCS'99) Emotion Recognition and Synthesis System on Speech T. Moriyama;S. Ozawa
Mechanical Engineer's Degree Thesis Recognition of Emotional and Cognitive States Using Physiological Data E. Vyzas
Automatic Speech Recognition: The Development of SPHINX System K.F. Lee
Prentice Hall Signal Processing Series Fundamentals of Speech Recognition L. Rabiner;B.H. Juang
Proc. of ICSLP'94 An Unsupervised Speaker Adaptation Method for Continuous Parameter HMM by Maximum a Posteriori Probability Estimation Y. Tsurumi;S. Nakagawa

Journal of the Institute of Convergence Signal Processing (융합신호처리학회논문지)

Emotion recognition in speech using hidden Markov model

은닉 마르코프 모델을 이용한 음성에서의 감정인식

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)