Estimation of speech feature vectors and enhancement of speech recognition performance using lip information

Min So-Hee;Kim Jin-Young;Choi Seung-Ho;

대한음성학회지:말소리 (MALSORI)

제44호
/
Pages.83-92
/
2002
/
1226-1173(pISSN)

대한음성학회 (The Korean Society Of Phonetic Sciences And Speech Technology)

입술정보를 이용한 음성 특징 파라미터 추정 및 음성인식 성능향상

Estimation of speech feature vectors and enhancement of speech recognition performance using lip information

민소희 (전남대) ;
김진영 (전남대) ;
최승호 (동신대)

발행 : 2002.12.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

Speech recognition performance is severly degraded under noisy envrionments. One approach to cope with this problem is audio-visual speech recognition. In this paper, we discuss the experiment results of bimodal speech recongition based on enhanced speech feature vectors using lip information. We try various kinds of speech features as like linear predicion coefficient, cepstrum, log area ratio and etc for transforming lip information into speech parameters. The experimental results show that the cepstrum parameter is the best feature in the point of reconition rate. Also, we present the desirable weighting values of audio and visual informations depending on signal-to-noiso ratio.

대한음성학회지:말소리 (MALSORI)

입술정보를 이용한 음성 특징 파라미터 추정 및 음성인식 성능향상

Estimation of speech feature vectors and enhancement of speech recognition performance using lip information

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)