A Study on Speaker Adaptation of Large Continuous Spoken Language Using back-off bigram

;

한국통신학회논문지 (The Journal of Korean Institute of Communications and Information Sciences)

제28권9C호
/
Pages.884-890
/
2003
/
1226-4717(pISSN)
/
2287-3880(eISSN)

한국통신학회 (The Korean Institute of Commucations and Information Sciences)

Back-off bigram을 이랑한 대용량 연속어의 화자적응에 관한 연구

A Study on Speaker Adaptation of Large Continuous Spoken Language Using back-off bigram

최학윤 (김포대학 전자정보계열)

발행 : 2003.09.01

PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

본 논문에서는 화자 독립 시스템에서 필요한 화자 적응 방법에 관해 연구하였다. 훈련에 참여하지 않은 새로운 화자에 대해서 bigram과 back-off bigram, MAP와 MLLR의 결과를 비교해 보았다. back-off bigram은 훈련중 나타나지 않은 bigram 확률을 unigram과 back-off 가중치를 적용하므로 bigram 확률 값에 약간의 가중치를 더하는 효과를 가져온다. 음성의 특징 파라미터로는 12차의 MFCC와 log energy, 1차 미분, 2차 미분을 사용하여 총 39차의 특징 벡터를 사용하였다. 인식 실험을 위해 CHMM, 삼중음소(tri-phones)의 인식 단위, 그리고 bigram과 back-off bigram의 언어 모델을 사용한 시스템을 구성하였다.

In this paper, we studied the speaker adaptation methods that improve the speaker independent recognition system. For the independent speakers, we compared the results between bigram and back-off bigram, MAP and MLLR. Cause back-off bigram applys unigram and back-off weighted value as bigram probability value, it has the effect adding little weighted value to bigram probability value. We did an experiment using total 39-feature vectors as featuring voice parameter with 12-MFCC, log energy and their delta and delta-delta parameter. For this recognition experiment, We constructed a system made by CHMM and tri-phones recognition unit and bigram and back-off bigrams language model.

키워드

참고문헌

Proc. ICASSP Speaker adaptation in continuous speech recognition via estimation of correlated mean vectors W.A.Rozzi;R.M.Stern
Proc. ICASSP On speaker independent, speaker dependent, and speaker adaptive speech recognition X.D.Huang;K.F.Lee
IEEE Trans. speech Audio Processing v.2 no.3 An acoustic-phonetic-based speaker adaptation technique for improving speaker independent continuous speech recognition Y.Zhao
Proc. ICASSP A bayesian approach to speaker adaptation for the stochastic segment model B.F.Necioglu;M.Ostendorf;J.R.Rohlicek
Fundamentals of Speech Recognition L.R.Rabiner;B.Juang
Digital Signal Processing Oppenheim A.V.;Shafer R.W.
Digital Coding of Waveforms Jayant N.O.S;Noll P.
IEEE Tran. on ASSP v.29 no.2 Cepstral analysis techniques for automatic speaker verification Fruri S.
Digital Processing of Speech Signal Rabiner L.R.;Shafer R.W.
Computer Speech and Language v.8 On structuring Probabilstic dependencies in stochastic language modeling Ney H.;Essen U;Kneser R
Technical Report LWMS-79 , Brown Unibersity Modelling and learning in speech recognition : the relationship between stochastic pattern chassigiera and neural networks Niles L.T.
Handbook of Standards and Resources for Spoken Language Systems, Mouton de Gruyter Language models Ney H.
The HTK Book Steve Young(et al.)
Spoken Language Processing Xueding Huang;Alejandere Acero;Hsiao Wuen Hon
Speech Recognition Claudio Becchetti;Lucio Prina Ricotti
Proc. Eur. Conf. Speech Communication Technology Maximum a Posterior linear regression with elliptically symmetric matrix variate priors W.Chou

한국통신학회논문지 (The Journal of Korean Institute of Communications and Information Sciences)

Back-off bigram을 이랑한 대용량 연속어의 화자적응에 관한 연구

A Study on Speaker Adaptation of Large Continuous Spoken Language Using back-off bigram

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)