Detection and Synthesis of Transition Parts of The Speech Signal

Kim, Moo-Young;

The Journal of Korean Institute of Communications and Information Sciences (한국통신학회논문지)

Volume 33 Issue 3C
/
Pages.234-239
/
2008
/
1226-4717(pISSN)
/
2287-3880(eISSN)

The Korean Institute of Commucations and Information Sciences (한국통신학회)

Detection and Synthesis of Transition Parts of The Speech Signal

Kim, Moo-Young (Information and Communications Eng., Sejong University)

Published : 2008.03.31

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

For the efficient coding and transmission, the speech signal can be classified into three distinctive classes: voiced, unvoiced, and transition classes. At low bit rate coding below 4 kbit/s, conventional sinusoidal transform coders synthesize speech of high quality for the purely voiced and unvoiced classes, whereas not for the transition class. The transition class including plosive sound and abrupt voiced-onset has the lack of periodicity, thus it is often classified and synthesized as the unvoiced class. In this paper, the efficient algorithm for the transition class detection is proposed, which demonstrates superior detection performance not only for clean speech but for noisy speech. For the detected transition frame, phase information is transmitted instead of magnitude information for speech synthesis. From the listening test, it was shown that the proposed algorithm produces better speech quality than the conventional one.

Keywords

References

L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals. Upper Saddle River, NJ: Prentice Hall, 1978
T. F. Quatieri, Discrete-Time Speech Signal Processing: Principles and Practices. Upper Saddle River, NJ: Prentice Hall, 2002
DVSI, APCO project 25: Vocoder Description, Version 1.3. July, 1993
Y. D. Cho, M. Y. Kim, and S. R. Kim, 'A spectrally mixed excitation (SMX) vocoder with robust parameters determination,' in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, pp. 601-604, Seattle, WA, USA, 1998
C. Li and V. Cuperman, 'Enhanced Harmonic Coding of Speech with Frequency Domain Transition Modeling,' in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, pp. 581-584, Seattle, WA, USA, 1998
W. B. Kleijn and J. Haagen, Speech Coding and Synthesis. Amsterdam, The Netherlands: Elsevier, 1995
T. Unno, T. P. barnwell III, and K. Truong, 'An Improved Mixed Excitation Linear Prediction (MELP) Coder,' in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, pp. 245-248, Phoenix, Arizona, USA, 1999
D. S. Kim and M. Y. Kim, 'On the perceptual weighting function for phase quantization of speech,' in Proc. IEEE Workshop on Speech Coding, pp.62-64, Finland, 2000

The Journal of Korean Institute of Communications and Information Sciences (한국통신학회논문지)

Detection and Synthesis of Transition Parts of The Speech Signal

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)