Search | Korea Science

Performance Analysis of LSP Vector Quantization and New Improved LSP Vector Quantization Method (LSP 벡터 양자화의 성능 분석과 성능이 향상된 새로운 LSP 벡터 양자화 방법)

박호종
- The Journal of the Acoustical Society of Korea
- /
- v.17 no.3
- /
- pp.59-64
- /
- 1998
본 논문에서는 LSP 벡터 양자화의 성능에 대하여 분석하고 성능이 향상된 새로운 LSP 벡터 양자화 방법을 제안한다. 먼저, 10msec프레임 구조를 가지고 Moving Average 예측 필터를 사용한 LSP Split 벡터 양자화의 성능을 여러 훈련 방법과 벡터 Split 방법 및 Bit 할당 방법에 따라 비교한다. 다음, Split 벡터 양자화의 문제점을 해결하기 위하여 새로 운 Split 벡터 양자화 검색 방법을 제안한다. 스펙트럼 왜곡지수를 이용한 양자화 성능 측정 결과 새로 제안된 방법이 기존의 방법보다 우수한 양자화 성능을 보인다.
PDF

Pattern Optimization Using a Fine Search- (미세탐색을 이용한 빔패턴의 최적화)

Chang Byong-Kun;Jeon Chang-Dae
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.285-288
- /
- 2000
본 논문은 시간영역에서 희소어레이의 패턴을 원하는 패턴과 실제 희소어레이의 패턴간의 오차의 계수적 자승치를 Taylor 급수 근사와 미세탐색을 이용하여 최소화하여 빔패턴을 최적화하는 방법을 제시한다. 센서의 간격이 어레이 중심에 관하여 비대칭인 경우에 대하여 어레이 공간의 주어진 영역의 오차함수에 성능 향상을 위하여 계수를 적용한다. 미세탐색을 이용함으로써 계수 최소 방법의 성능이 주 빔 부근의 측면롭에 관련하여 향상되는 것이 판명되었다.
PDF

Clarity Enhancement for Virtual Sound (가상 음향 공간에서의 음의 선명도 향상)

Cho Yong-Choon;Oh Yoonhark;Kim Sunmin;laryguine Serguei;Jang Seongcheol
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.257-260
- /
- 2004
HRTF 를 사용하여 가상적인 음향 효과를 내는 시스템에서의 음의 왜곡은 필수적이다. 하지만 음성과 같은 사람이 민감하게 느끼는 부분에서의 음의 선명도 저하는 다른 대역의 왜곡보다 더욱 크게 느껴진다. 그리고 모노 및 두 채널의 주파수 특성이 비슷한 스테레오에서는 음질을 보상할 수 없다는 단점이 있다. 본 논문에서는 이러한 음의 선명도의 저하를 유발하게 하는 가상 음향 발생 필터의 특징을 분석하고, 간단한 필터 설계에 의해서 본래의 가상 음향을 그대로 유지하면서 선명도를 높일 수 있는 방법을 제시한다. 제시한 방법은 특히 모노 및 뉴스모드와 같이 음성이 많이 들어 있는 부분에서 뛰어난 성능을 보인다.
PDF

Implementation Realtime 3D-Sound Transport System (공간음향의 실시간 전달을 위한 시스템 구현)

Kim, Pung-Hyeok;Oh, Yong-Taek;Choi, Deok-Jai
- Proceedings of the Korean Information Science Society Conference
- /
- 2006.10d
- /
- pp.778-781
- /
- 2006
네트워크 기반 실시간 멀티미디어 응용 프로그램의 발전이 점점 가속화하는 가운데, 보다 높은 질을 갖는 멀티미디어 데이터에 대한 요구도 점점 커지고 있다. 입체음향 기능은 음향 부분의 가상현실성을 향상시킬 수 있는 방법이며, 공간음향은 입체음향 효과를 구현할 수 있는 소프트웨어적인 인터페이스를 제공해 준다. 본 논문에서는 공간음향 기능을 네트워크 기반 어플리케이션에서 활용하는데 필요한 요소들과 처리 방법에 대해 논의하고 이를 실제로 구현한다. 아울러 구현 과정에서 발견한 문제들에 대해 논의한다.
PDF

Acoustic Echo Cancellation Using Independent Component Analysis (독립성분분석을 이용한 음향 반향 제거)

김대성;배현덕
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.5
- /
- pp.351-359
- /
- 2003
In this paper, we proposed a method for acoustic echo cancellation based on independent component analysis. When the large acoustic noise is picked up by the microphone, the performance of echo cancellation decreased. We used two microphones that received echo signal which is linearly mixed with the noise, then separated the echo signals from the received signals with independent component analysis algorithm. The separated echo signal is used for the reference signal of adaptive algorithm which leads to better performance of the echo cancellation. Computer simulation results show the validity of the proposed method.
PDF KSCI

A Robust Acoustic Echo Canceler with Stepsize Predictor for Environment Noise (주변 노이즈에 강건한 Stepsize 예측기를 갖는 음향 반향 제거기)

Lee, Se-Won;Kang, Hee-Hoon;Lee, Won-Seok
- Journal of the Institute of Electronics Engineers of Korea TE
- /
- v.39 no.2
- /
- pp.44-50
- /
- 2002
Conventional acoustic echo cancelers using ES(Exponentially weighted Stepsize) algorithm have simple operational configuration and fast convergence speed batter then NLMS algorithm, but they are very weak in external noise because ES algorithm updates filter taps using an average energy reduction rate of room impulse response in specific acoustical condition. So, a new configuration of acoustic echo canceler with stepsize generator and selector is proposed in this thesis. The proposed stepsize generator and selector improve conventional acoustic echo canceler's weakness in external noise and improve the system robustness. The stepsize generator generates additional stepsize value using moving averager, which is the residual noise energy of error signal multiplied by constant ${\gamma}$. The stepsize selector selects the stepsize value that has better performance in an acoustic echo canceler using a coefficient decision factor ${\Delta}_{differ}$ The simulation results show that the proposed algorithm reduces residual error by 5[dB] to 10[dB], improves misadjustment regardless of external noise's SNR.
PDF KSCI

A Study on Construction of Acoustical Phoneme Models Using Hidden Markov Network (Hidden Markov Network를 이용한 음향학적 음소모델 작성에 관한 검토)

Oh Se-Jin;Lim Young-Choon;Hwang Cheol-Jun;Kim Bum-Koog;Chung Hyun-Yeol
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.29-32
- /
- 2000
본 논문에서는 음성인식 시스템의 음향모델 개선을 위한 기초적 연구로서, 문맥적인 요소를 필요로 하는 SSS(Successive State Splitting)와 필요로 하지 않는 SSS-free 알고리즘을 이용한 HMnet(Hidden Markov Network) 음향모델 작성방법에 대해 검토하고 작성한 음향모델을 한국어에 적용하여 그 유효성을 확인하였다. HMnet을 이용한 음소모델의 작성방법은 전체 학습 데이터에 대해서 각각 2개의 상태를 가지는 초기 모델을 작성한 후, 이를 시간과 문맥방향으로의 최대 분포를 가지는 상태를 재분할한 후 임의의 상태수가 될 때까지 상태분할을 계속적으로 수행케 하여 각 음소모델을 작성하게 된다. 작성한 HMnet 음향모델의 유효성을 확인하기 위해 ETRI 445 단어의 3인에 대한 화자종속 음소인식 실험을 수행하였다. 인식실험 결과, SSS 알고리즘을 이용한 화자종속실험의 경우 상태수 520에서 평균 $62.8\%$의 인식률을, SSS-free 알고리즘의 경우 상태수 420에서 평균 $64.2\%$의 인식률을 얻었다. 이 결과는 HMM을 이용한 경우(약$43.4\%$)보다 $20\%$이상의 인식률 향상을 보여 이 알고리즘의 유효성을 확인할 수 있었다. SSS와 SSS-free를 비교한 경우, SSS-free가 SSS보다 낮은 상태수에서 평균 $1.4\% 향상된 인식률을 보였다.
PDF

A Study on Improved MDL Technique for Optimization of Acoustic Model (향상된 MDL 기법에 의한 음향모델의 최적화 연구)

Cho, Hoon-Young;Kim, Sang-Hun
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.1
- /
- pp.56-61
- /
- 2010
This paper describes optimization methods of acoustic models in HMM-based continuous speech recognition. Most of the conventional speech recognition systems use the same number of Gaussian mixture components for each HMM state. However, since the number of data samples available for each state is different from each other, it is possible to reduce the overall number of model parameters and the computational cost at the decoding step by optimizing the number of Gaussian mixture components. In this study, we introduced the Gaussian mixture weight term at the merging stage of Gaussian components in the minimum description length (MDL) based acoustic modeling optimization. Experimental results showed that the proposed method can obtain better ASR accuracy than the previous optimization method which does not consider the Gaussian mixture weight term.
https://doi.org/10.7776/ASK.2010.29.1.056 인용 PDF KSCI

Global Soft Decision Using Probabilistic Outputs of Support Vector Machine for Speech Enhancement (SVM의 확률 출력을 이용한 새로운 Global Soft Decision 기반의 음성 향상 기법)

Jo, Q-Haing;Chang, Joon-Hyuk
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.2
- /
- pp.75-79
- /
- 2008
In this paper, we propose a novel speech enhancement technique using global soft decision (GSD) based on the probabilistic outputs of support vector machine (SVM). Generally, speech enhancement algorithms applied soft decision gain modification and noise power estimation have bettor performance than those employing hard decision. Especially, global speech absence probability (GSAP), which is known as an effective measure of the speech absence in each frame, has been adopted to SD-based speech enhancement methods. For this reason, we introduce a new GSAP estimated from the probabilistic output of SVM using sigmoid function. The performance of the proposed algorithm is evaluated by the PESQ and MOS test under various noise environments and yields better results compared with the conventional GSD scheme.
https://doi.org/10.7776/ASK.2008.27.2.075 인용 PDF KSCI

Language Models constructed by Iterative Learning and Variation of the Acoustical Parameters (음향학적 파라미터의 변화 및 반복학습으로 작성한 언어모델에 대한 고찰)

Oh Se-Jin;Hwang Cheol-Jun;Kim Bum-Koog;Jung Ho-Youl;Chung Hyun-Yeol
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.35-38
- /
- 2000
본 연구에서는 연속음성인식 시스템의 성능 향상을 위한 기초 연구로서 시스템에 적합한 음향모델과 언어모델을 작성하고 항공편 예약 태스크를 대상으로 인식실험을 실시한 결과 그 유효성을 확인하였다. 이를 위하여 먼저 HMM의 출력확률분포의 mixture와 파라미터의 차원에 대한 정확한 분석을 통한 음향모델을 작성하였다. 또한 반복학습법으로 특정 태스크를 대상으로 N-gram 언어모델을 적용하여 인식 시스템에 적합한 모델을 작성하였다. 인식실험에 있어서는 3인의 화자가 발성한 200문장에 대해 파라미터 차원 및 mixture의 변화에 따른 음향모델과 반복학습에 의해 작성한 언어모델에 대해 multi-pass 탐색 알고리즘을 이용하였다. 그 결과, 25차원에 대한 mixture 수가 9인 음향모델과 10회 반복 학습한 언어모델을 이용한 경우 평균 $81.0\%$의 인식률을 얻었으며, 38차원에 대한 mixture 수가 9인 음향모델과 10회 반복 학습한 언어모델을 이용한 경우 평균 $90.2\%$의 인식률을 보여 인식률 제고를 위해서는 38차원에 대한 mixture 수가 9인 음향모델과 10회 반복학습으로 작성한 언어모델을 이용한 경우가 매우 효과적임을 알 수 있었다.
PDF

Search Result 1,161, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)