Design and Implementation of the low power and high quality audio encoder/decoder for voice synthesis

Park, Nho-Kyung;Park, Sang-Bong;Heo, Jeong-Hwa;

doi:10.7236/JIIBC.2013.13.6.55

The Journal of the Institute of Internet, Broadcasting and Communication (한국인터넷방송통신학회논문지)

Volume 13 Issue 6
/
Pages.55-61
/
2013
/
2289-0238(pISSN)
/
2289-0246(eISSN)

The Institute of Internet, Broadcasting and Communication (한국인터넷방송통신학회)

DOI QR Code

Design and Implementation of the low power and high quality audio encoder/decoder for voice synthesis

음성 합성용 저전력 고음질 부호기/복호기 설계 및 구현

Park, Nho-Kyung (Dept. Information Communication Engineering, Hoseo University) ;
Park, Sang-Bong ;
Heo, Jeong-Hwa

박노경 (호서대학교 정보통신공학과) ;
박상봉 (세명대학교 정보통신학과) ;
허정화 (세명대학교 정보통신학과)

Received : 2013.10.08
Accepted : 2013.12.13
Published : 2013.12.31

https://doi.org/10.7236/JIIBC.2013.13.6.55 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we describe design and implementation of audio encoder/decoder for voice synthesis. It uses the encoding of difference value of successive samples instead of the original sample value. and has the compression ratio of 4. The function is verified by using FPGA and the performance is measured by the fabricated chip using $0.35{\mu}m$ standard CMOS process. The system clock is 16.384MHz. The measured THD+n is from -40dB to -80dB with frequency variation and the power consumption is about 80mW. It is suited for the mobile application of high audio quality and low power consumption.

본 논문은 음성합성에서 사용되는 오디오 부호기/복호기 설계 및 구현을 기술한다. 설계된 회로는 원래 음성 샘플대신에 연속되는 음성 샘플의 차를 부호화하는 방식으로 압축율은 4:1 이다. FPGA를 이용해서 각각의 기능을 검증하고, $0.35{\mu}m$ 표준 CMOS 공정을 이용하여 칩으로 제작해서 성능을 측정하였다. 시스템 클럭 주파수는 16.384MHz를 사용한다. THD(Total Harmonic Distortion)+n은 주파수에 따라서 -40dB에서 -80dB 값을 지니고, 전력 소모는 전원 전압 3.3V에서 80mW로써, 고음질과 저전력 소모를 요구하는 모바일 응용에 적합하다.

Keywords

References

H. Han, "Variable Quad Rate ADPCM for Efficient Speech Transmission and Real Time Implementation on DSP", Journal of Korean Institute of illuminating and Electrical Installation Engineers, vol. 18, No 1, pp. 129-136, January 2004. https://doi.org/10.5207/JIEIE.2004.18.1.129
S. Y. Min, D. S. Na, "A Study on Implementation of Emotional Speech Synthesis System using Variable Prosody Model", Journal of the Korea Academia-Industrial cooperation Society, v.14, no.8, pp.3992-3998, 2013. https://doi.org/10.5762/KAIS.2013.14.8.3992
Bluetooth Audio Video Working Croup, "Bluetooth Specification: Advanced Audio Distribution Profile, Bluetooth SIG Inc. 2002 Hoc Networks", IEEE Computer, Feb. 2004
D.Hermann, R.L. Brenna, H.Sheikhzad, E.Cornu, "Low-Power implementation of the bluetooth subband audio codec", IEEE, 2004.
J. S. Choi, "Speech Synthesis Algorithm Applied to Methods of Multi-Cepstrum Extraction and Root Mean Square Amplitude", Journal of Korean Institute of Information Technology, vol. 11, issue 6, pp. 157-162, June 2013.
K. H. Han, "Coding Method of Variable Threshold Dual Rate ADPCM Speech Considering the Background Noise", Journal of Korean Institute of illuminating and Electrical Installation Engineers, vol. 17, No 6, pp. 154-159, November 2003. https://doi.org/10.5207/JIEIE.2003.17.6.154

The Journal of the Institute of Internet, Broadcasting and Communication (한국인터넷방송통신학회논문지)

Design and Implementation of the low power and high quality audio encoder/decoder for voice synthesis

음성 합성용 저전력 고음질 부호기/복호기 설계 및 구현

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)