Speaker-Dependent Emotion Recognition For Audio Document Indexing

Hung LE Xuan;QUENOT Georges;CASTELLI Eric;

대한전자공학회:학술대회논문집 (Proceedings of the IEEK Conference)

대한전자공학회 2004년도 ICEIC The International Conference on Electronics Informations and Communications
/
Pages.92-96
/
2004

대한전자공학회 (The Institute of Electronics and Information Engineers)

Speaker-Dependent Emotion Recognition For Audio Document Indexing

Hung LE Xuan (International Research Center MICA) ;
QUENOT Georges (International Research Center MICA) ;
CASTELLI Eric (International Research Center MICA)

발행 : 2004.08.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

The researches of the emotions are currently great interest in speech processing as well as in human-machine interaction domain. In the recent years, more and more of researches relating to emotion synthesis or emotion recognition are developed for the different purposes. Each approach uses its methods and its various parameters measured on the speech signal. In this paper, we proposed using a short-time parameter: MFCC coefficients (MelFrequency Cepstrum Coefficients) and a simple but efficient classifying method: Vector Quantification (VQ) for speaker-dependent emotion recognition. Many other features: energy, pitch, zero crossing, phonetic rate, LPC... and their derivatives are also tested and combined with MFCC coefficients in order to find the best combination. The other models: GMM and HMM (Discrete and Continuous Hidden Markov Model) are studied as well in the hope that the usage of continuous distribution and the temporal behaviour of this set of features will improve the quality of emotion recognition. The maximum accuracy recognizing five different emotions exceeds $88\%$ by using only MFCC coefficients with VQ model. This is a simple but efficient approach, the result is even much better than those obtained with the same database in human evaluation by listening and judging without returning permission nor comparison between sentences [8]; And this result is positively comparable with the other approaches.

대한전자공학회:학술대회논문집 (Proceedings of the IEEK Conference)

Speaker-Dependent Emotion Recognition For Audio Document Indexing

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)