Search | Korea Science

A Study on TSIUVC Approximate-Synthesis Method using Least Mean Square and Frequency Division (주파수 분할 및 최소 자승법을 이용한 TSIUVC 근사합성법에 관한 연구)

이시우
- Journal of Korea Multimedia Society
- /
- v.6 no.3
- /
- pp.462-468
- /
- 2003
In a speech coding system using excitation source of voiced and unvoiced, it would be involved a distortion of speech quality in case coexist with a voiced and an unvoiced consonants in a frame. So, I propose TSIUVC(Transition Segment Including Unvoiced Consonant) searching and extraction method in order to uncoexistent with a voiced and unvoiced consonants in a frame. This paper present a new method of TSIUVC approximate-synthesis by using Least Mean Square and frequency band division. As a result, this method obtain a high quality approximation-synthesis waveforms within TSIUVC by using frequency information of 0.547KHz below and 2.813KHz above. The important thing is that the maximum error signal can be made with low distortion approximation-synthesis waveform within TSIUVC. This method has the capability of being applied to a new speech coding of Voiced/Silence/TSIUVC, speech analysis and speech synthesis.
PDF

Unproved Speech Enhancement Algorithm employing Multi-band Power Subtraction and Wavelet Packets Decomposition (Multi-band Power Subtraction과 Wavelet Packets Decomposition을 이용한 개선된 음성 향상 방법)

Lee Yoon-Chang;Kwak Jeong-Hoon;Ahn Sang-Sik
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.31 no.6C
- /
- pp.589-602
- /
- 2006
잡음은 음성과 관련된 시스템의 성능을 제한하는 주된 원인이기 때문에 음성향상과 관련된 연구는 꾸준히 계속되어왔다. 전통적인 음성향상 방법은 무성음과 잡음을 구분하지 알기 때문에 잡음제거 과정에서 무성음이 함께 제거되는 단점이 있으며, 웨이블릿 기반의 전통적인 잡음제거 방법은 각 대역마다 동일한 문턱값을 사용하기 때문에 시변 환경에서 성능이 떨어지는 단점이 있다. 이 단점들을 개선하기위해 다중대역 파워 차감법과 Perceptual 웨이블릿 패킷 분해를 이용한 웨이블릿 기반의 개선된 음성향상 방법을 제안한다. 전처리 과정으로 다중대역 파워 차감법을 사용하여 광대역 잡음을 제거하고 뮤지컬 잡음의 발생을 줄이며, psycho-acoustic 모델 기반 Perceptual 웨이블릿 패킷으로 신호를 분해한 후 각 웨이블릿 노드의 엔트로피 비율과 음성검출을 이용하여 무성음/유성음/잡음을 구분한다. 구분된 신호에 따라 각 웨이블릿 노드마다의 문턱값을 기준으로 웨이블릿 Shrinkage를 적용하여 잡음을 제거하고 무성음이나 파워가 작은 유성음이 제거되는 오류를 최소화한다. 또한 잡음 파워 추정 과정에 적응적으로 망각 계수를 선택하여 잡음 파워 추정 오류를 최소화한다.
PDF KSCI

A Study on Extracting Valid Speech Sounds by the Discrete Wavelet Transform (이산 웨이브렛 변환을 이용한 유효 음성 추출에 관한 연구)

Kim, Jin-Ok;Hwang, Dae-Jun;Baek, Han-Uk;Jeong, Jin-Hyeon
- The KIPS Transactions:PartB
- /
- v.9B no.2
- /
- pp.231-236
- /
- 2002
The classification of the speech-sound block comes from the multi-resolution analysis property of the discrete wavelet transform, which is used to reduce the computational time for the pre-processing of speech recognition. The merging algorithm is proposed to extract vapid speech-sounds in terms of position and frequency range. It performs unvoiced/voiced classification and denoising. Since the merging algorithm can decide the processing parameters relating to voices only and is independent of system noises, it is useful for extracting valid speech-sounds. The merging algorithm has an adaptive feature for arbitrary system noises and an excellent denoising signal-to-noise ratio and a useful system tuning for the system implementation.
https://doi.org/10.3745/KIPSTB.2002.9B.2.231 인용 PDF KSCI

Enhancement Voiced/Unvoiced Sounds Classification for 3GPP2 SMV Employing GMM (3GPP2 SMV의 실시간 유/무성음 분류 성능 향상을 위한 Gaussian Mixture Model 기반 연구)

Song, Ji-Hyun;Chang, Joon-Hyuk
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.45 no.5
- /
- pp.111-117
- /
- 2008
In this paper, we propose an approach to improve the performance of voiced/unvoiced (V/UV) decision under background noise environments for the selectable mode vocoder (SMV) of 3GPP2. We first present an effective analysis of the features and the classification method adopted in the SMV. And then feature vectors which are applied to the GMM are selected from relevant parameters of the SMV for the efficient voiced/unvoiced classification. For the purpose of evaluating the performance of the proposed algorithm, different experiments were carried out under various noise environments and yields better results compared with the conventional scheme of the SMV.
PDF KSCI

Design of a Low Bit-rate Speech Coder Based on Mixed Multi-band Excitation Model (혼합 다중대역 여기모델에 기반한 저 전송률 음성 부호화기의 설계)

한우진;오영환
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.6
- /
- pp.510-521
- /
- 2002
MBE (multi-band excitation) coder can achieve high qualify synthetic speech below 4.0 kbps. There are, however, significant differences of the fine structure between the original spectrum and the synthetic spectrum. They are mainly due to the exclusive partition of voiced and unvoiced regions in frequency domain and the decision procedure based on the experimental threshold. This paper proposes MMBE (mixed multi-band excitation) speech model to overcome drawbacks of a MBE coder. In addition, two analysis methods, which do not need my decision procedure based on a threshold, are presented. Both voiced and unvoiced components can be mixed over all the frequency axis in the MMBE speech model. To illustrate the potential of the proposed speech model, we develop a 2.6 kbps MMBE coder and compare it with a 2.9 kbps MBE coder by both objective and subjective methods. The results have shown that the proposed coder has a better performance even at a lower bit-rate compared with the MBE coder.
PDF KSCI

An Efficient Pitch Estimation for IMBE (Improved Multi-band Excitation) Speech Coder (개량형 다중대역 여기 (IMBE: Improved Multi-band Excitation) 음성 부호기의 피치 예측 개선)

Na, Hoon;Jeong, Dae-Gwon
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.3
- /
- pp.34-41
- /
- 2001
In an IMBE (Improved Multi-band Excitation) speech coder, initial pitch estimation occupies most of the total computing time for the coder due to complex cost function and exhaustive search over candidate pitches. Future frames in initial pitch estimation cause inevitable time delay. Therefore, it is difficult to implement a real-time coder. Furthermore, unvoiced frames use the unnecessary pitch estimation as in the voiced frames. In this paper, each frame is determined voiced or unvoiced by Dyadic Wavelet Transform (DyWT) and, then, initial pitch estimation is performed only for voiced frame. Therefore different pitch estimation algorithms are employed between voiced and unvoiced frames incurring reduced time delay at transmitter and receiver. Simulation result show that the relative complexity of initial pitch estimation is reduced by 23％, and the processing time decreases down to 1/10 ∼ 1/1l of the IMBE coder while speech quality is almost maintained.
PDF

STUDIES ON KOREAN PHONOLOGY (PART II) -;HYSIOLOGICAL PRODUCTION MECHANISMS OF KOREAN STOP CONSONANTS(Summarized Version)- (한극파열자음발음시의 생리기전)

Kim, Byoung-Wook
- The Journal of the Korean dental association
- /
- v.10 no.9
- /
- pp.605-625
- /
- 1972
한글의 파열자음(Plosive Consonants of stop consonants)인 ㄱ, R, ㅋ, ㅃ, ㅍ이나, ㄷ, ㄸ, ㅌ의 삼중 구별은 외국어에서는 찾아 볼수 없는 한글 고유의 특성이라고 생각되어 왔다. 이는 한국인 학자에게 뿐만 아니라 외국인 학자에게도 크나큰 관심사가 되어 왔다. 그 가장 큰 이유중의 하나가 소위 파열자음의 생리기전면에서 볼때, p,b나 k,g의 구별이 단순히 무성(Voiceless)대 유성(Voiced)의 차이냐 또는 근육의 힘의 차이냐 하는 문제가 오랫동안 국제학계에서 논난의 대상이 되어왔기 때문이다. 둘째로는 기이하게도 한글의 파열자음은 모두 무성자음으로 외국어에서 볼수있는 유성대 무성의 대조는 없으며, 다만 근육의 gal의 대조가 있지 않을까 생각되어 왔기 때문이다. 그러나 현재까지는 이 세가지 종류의 한글파열자음 발음시의 생리기전의 차이를 규명한 연구가 없었다. 저자는 이점에 착안하여 미국 위스컨신대학교 언어병리학 및 생리학 실험실에서 고도로 발달된 최신 연구기구를 이용하여 한글파열 자음 발음시의 생리기전의 차이를 규명하였다. 서론 연구방법, 연구결과, 결론에 관한 자세한 내용은 영문초록에 기술되어 있다.
PDF

Implementation of MPEG-4 HVXC decoder with VHDL (VHOL을 이용한 MPEG-4 HVXC 복호화기 구현)

김구용;임강희;차형태
- Proceedings of the IEEK Conference
- /
- 2001.09a
- /
- pp.465-468
- /
- 2001
MPEG-4 Parametric Coding 중 저 비트율로 음성신호를 부호화하는 HVXC(Harmonic Vector excitation Ending)의 복호화 모듈인 LSP 합성필터와 무성음 합성부, 유성음 합성부를 VHDL을 이용하여 구현하였다. MPEG-4 HVXC의 복호화 과정은 코드북을 이용하여 LSP 계수, VXC signal, 그리고 Spectral Envelop이 복호화 되어 각각 LSP 역필터, 무성음과 유성음 합성단을 통과하여 LPC계수와 유,무성음 여기신호로 변환된 후 LPC 합성필터링 과정을 거쳐 최종적으로 음성신호를 출력시킨다. LSP inverse filter에서 사용되는 cosine함수값을 위하여 Table based Approximation을 이용하여 적은 양의 Table 값을 사용하여 정확하고 고속의 cosine 연산을 수행하였다. VXC 복호화 과정에서는 신호의 중복성을 제거하는 Hidden Address in LSH 방법을 사용하여 코드북의 크기를 줄였다. 유성음 합성단에서는 IFFT 모듈을 이용하여 연산속도를 증가 시켰다. 최종적으로 위와 같이 구현된 시스템을 Simulation을 통해 Software 검증을 하였다.
PDF

Program development for calculating the operating costs of silent discharge ozone generator (무성방전식 오존발생기의 운영비 산정을 위한 프로그램 개발)

You, Jung-Ho;Lee, Gyung;Shim, Hyeon-Sung;Lee, Gyung-Hyeok
- Proceedings of the KIEE Conference
- /
- 2011.07a
- /
- pp.1576-1577
- /
- 2011
오존은 수처리공정에서 사용되고 있는 산화제중에서 염소 다음으로 가장 강력한 산화력을 가지고 있는 물질이다. 그에 따라 상 하수도, 축산폐수, 농약등 오염원을 효과적으로 제거하는데 활용이 확대되고 있는 추세이다. 또한 2차적인 오염물질이 없다는 점에서 더욱 다양한 분야에서 적용되고 있다. 현재 오존을 발생시키기 위해 다양한 기술이 개발되고 있으며, 그 중에서 1857년 Siemens에 의해 개발된 무성방전식 오존발생기가 최근 가장 널리 사용되고 있다. 본 연구에서는 무성방전식 오존발생기에 대한 최적 운전점을 도출하기 위한 시뮬레이션을 JAVA로 프로그램하고 실제 운영현장에 적용하여 결과를 도출하였다.
PDF

Improvement of Dynamic Time Warping Algorithm by Using Voice/Unvoiced/Silence Information (유성/무성/묵음 정보론 이용한 동적 시간 정합 알고리즘 개선)

Choi Min Seok;Han Hyun Bae;Hahn Min Soo
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.40-43
- /
- 1999
본 연구에서는 고립단어 인식시스템에 사용되고 있는 DTW(DynamicTimeWarping) 알고리즘의 계산량을 줄일 수 있는 방법을 제안한다. 일반적으로 고립단어 인식시 가장 인식률이 좋은 알고리즘은 DW라고 알려져 있으나, 인식대상어휘가 늘어나면 계산량이 비례해서 늘어나고 인식률이 저하되는 단점이 있으므로 일반적으로 200단어 이하의 어휘에만 사용되고 있다. 따라서 대상어휘를 감소시켜 계산량을 줄이기 위해 본 논문에서는 유성/무성/묵음 (V/U/S) 정보를 이용하여 코드워드를 구성하고 같은 코드워드에 해당되는 단어들을 추출해이들 만을 비교대상 어휘로 제한하므로서 DW 알고리즘을 적용할 대상 어휘수를 줄이는 방법을 사용하여 계산 속도를 향상시켰다 또한 입력 단어와 대상 단어와의 누적거리 계산 시 끝점 정보 뿐 만 아니라 유성/무성/묵음 경계 정보를 이용하여 piecewise DTW를 구현함으로서 탐색 영역을 축소함으로써 추가적인 계산량 감소가 가능하다. 따라서 상기 기법들을 이용하면 PC상에서도 DTW를 이용한 대어휘 고립단어 음성 인식기의 구현이 가능할 것이다.
PDF

Search Result 756, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)