Search | Korea Science

Efficient Speaker Verification in Noise Environment with Noise-added Speaker Model Composition (잡음 첨가된 화자 모델 구성에 의한 잡음 환경의 효과적인 화자확인)

안성주;강선미;고한석
- Proceedings of the Korean Information Science Society Conference
- /
- 1999.10b
- /
- pp.542-544
- /
- 1999
본 논문에서는 다수의 화자 모델을 구성함으로써 잡음에 강인한 화자확인 방법을 제안한다. Non-stationary한 잡음을 가진 입력음성의 SNR을 측정하는 것은 어렵기 때문에, 각 화자에 대해 잡음이 없을 때의 화자모델에 여러 SNR에 대한 잡음 모델을 결합시킴으로써 여러 개의 잡음 첨가된 화자 모델을 구성한다. 그리고, 화자확인에서는 이렇게 구한 각 모델에 대한 입력 음성의 likelihood를 구해 그 중 가장 큰 likelihood만을 선택한다. 이 값을 이용하여 화자확인을 수행한다. 실험 결과, 제안한 방법은 입력음성의 SNR을 모르는 잡음환경에서 일반적으로 하나의 모델을 사용하는 것보다 훨씬 좋은 성능을 보였다.
PDF

Robust Distributed Speech Recognition under noise environment using MESS and EH-VAD (멀티밴드 스펙트럼 차감법과 엔트로피 하모닉을 이용한 잡음환경에 강인한 분산음성인식)

Choi, Gab-Keun;Kim, Soon-Hyob
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.48 no.1
- /
- pp.101-107
- /
- 2011
The background noises and distortions by channel are major factors that disturb the practical use of speech recognition. Usually, noise reduce the performance of speech recognition system DSR(Distributed Speech Recognition) based speech recognition also bas difficulty of improving performance for this reason. Therefore, to improve DSR-based speech recognition under noisy environment, this paper proposes a method which detects accurate speech region to extract accurate features. The proposed method distinguish speech and noise by using entropy and detection of spectral energy of speech. The speech detection by the spectral energy of speech shows good performance under relatively high SNR(SNR 15dB). But when the noise environment varies, the threshold between speech and noise also varies, and speech detection performance reduces under low SNR(SNR 0dB) environment. The proposed method uses the spectral entropy and harmonics of speech for better speech detection. Also, the performance of AFE is increased by precise speech detections. According to the result of experiment, the proposed method shows better recognition performance under noise environment.
PDF KSCI

Reduction Algorithm of Environmental Noise by Multi-band Filter (멀티밴드필터에 의한 환경잡음억압 알고리즘)

Choi, Jae-Seung
- Journal of the Korea Society of Computer and Information
- /
- v.17 no.8
- /
- pp.91-97
- /
- 2012
This paper first proposes the speech recognition algorithm by detection of the speech and noise sections at each frame, then proposes the reduction algorithm of environmental noise by multi-band filter which removes the background noises at each frame according to detection of the speech and noise sections. The proposed algorithm reduces the background noises using filter bank sub-band domain after extracting the features from the speech data. In this experiment, experimental results of the proposed noise reduction algorithm by the multi-band filter demonstrate using the speech and noise data, at each frame. Based on measuring the spectral distortion, experiments confirm that the proposed algorithm is effective for the speech by corrupted the noise.
https://doi.org/10.9708/jksci.2012.17.8.091 인용 PDF KSCI

Signal Detection Using Wavelet Transform in Fractional Brownian Motion (Fractional Brownian Motion 잡음환경 하에서 웨이브렛 변환을 이용한 신호의 검출)

김명진
- Proceedings of the Korea Institute of Convergence Signal Processing
- /
- 2000.08a
- /
- pp.21-24
- /
- 2000
Fractional Brownian motion(fBm)은 long-term persistence 특성을 가진 자연 현상, 1/f 잡음, 깊이가 낮은 해저에서의 배경음향잡음 등을 모델링하는데 많이 사용된다. 이 fBm은 nonstationary 유색잡음이다. 이러한 유색잡음 환경 하에서 신호를 검출하기 위한 한 방법은 Fredholm 적분방정식의 해를 구하는 것이다. 이 방정식을 이산화 하면 잡음의 공분산 행렬의 역행렬이 포함되어 계산량이 많다 본 논문에서는 fBm 잡음의 공분산 행렬을 웨이브렛 변환하여 얻어지는 행렬, 즉 fBm의 멀티스케일 성분들의 공분산행렬은 밴드화된 블록들로 근사화할 수 있다는 성질을 이용하여 적은 계산량으로 신호를 검출하는 알고리즘을 제안한다.
PDF

Speech Enhancement the Neural Network Filer (신경망필처를 이용한 음질향상)

김종우;공성근
- Journal of the Korean Institute of Intelligent Systems
- /
- v.10 no.4
- /
- pp.324-329
- /
- 2000
본 논문에서는 잡음환경에서의 음질향상(Speed Ehnacement) 시스템 구현을 목적으로 한다. 이를 위한 적응필터로서 LSM(Least Mean square)알고리즘 FIR필터를 적용한다. 또 정밀 필터로서 다충신경망(MLP, Multi-Layer Perceptorn) 필터를 적용한다. 잡음환경에서의 음성신호 복원 및 음질향상 시스템은 잡음에 의해 왜곡된 음성신호에서 잡음성분만을 제거함으로써 음성신호를 복원하는 시스템이다. 신경망 필터는 오차 역전과 학습 알고리즘에 의해 오차를 최소화 하는 방향으로 필터의 피라미터를 수정한다. 제안한 필터로 잡음환경에서의 음성신호복원 시스템을 구서오하고, 실험을 필터의 성능을 확인한다.
PDF

Robust Kalman based time varying spectral estimation in bursty noise environment (충격성 잡음환경에 강인한 Kanlman 시변 주파수 추정기법)

김한수
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1996.06a
- /
- pp.29-32
- /
- 1996
본 논문에서는 시변 주파수를 추정하기 위한 방법으로 기존의 시간 가중 칼만 추정기법에 변형된 Huber함수를 적용하여 충격성 잡음환경 하에서도 강인한 칼만추정기법을 제안하였다. 기존의 시간 가중 칼만 추정기법은 오차가 정규분포를 가진다고 가정된 상태에서는 적합한 파라메타 추정을 할 수 있지만 충격성 잡음이 존재하는 경우에는 수렴속도나 시변적응능력에서의 성능저하가 나타난다. 제안된 알고리듬은 영향함수 측면에서 충격성 잡음에 의해 생기는 오차의 크기를 제한함으로써 기포나 인위적인 충격성 잡음환경 하에서도 시변 주파수 추정을 할 수 있으며 알고리듬의 타당성은 모의실험을 통해 보였다.
PDF

Adaptive Threshold for Speech Enhancement in Nonstationary Noisy Environments (비정상 잡음환경에서 음질향상을 위한 적응 임계 치 알고리즘)

Lee, Soo-Jeong;Kim, Sun-Hyob
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.7
- /
- pp.386-393
- /
- 2008
This paper proposes a new approach for speech enhancement in highly nonstationary noisy environments. The spectral subtraction (SS) is a well known technique for speech enhancement in stationary noisy environments. However, in real world, noise is mostly nonstationary. The proposed method uses an auto control parameter for an adaptive threshold to work well in highly nonstationary noisy environments. Especially, the auto control parameter is affected by a linear function associated with an a posteriori signal to noise ratio (SNR) according to the increase or the decrease of the noise level. The proposed algorithm is combined with spectral subtraction (SS) using a hangover scheme (HO) for speech enhancement. The performances of the proposed method are evaluated ITU-T P.835 signal distortion (SIG) and the segment signal to-noise ratio (SNR) in various and highly nonstationary noisy environments and is superior to that of conventional spectral subtraction (SS) using a hangover (HO) and SS using a minimum statistics (MS) methods.
https://doi.org/10.7776/ASK.2008.27.7.386 인용 PDF KSCI

PN Code Acquisition at Low Signal-to-Noise Ratio Based on Seed Accumulating Sequential Estimation (시드 누적 순차적 추정 기법을 이용한 낮은 신호대잡음비 환경에서의 의사 잡음 부호 획득)

윤석호;김선용
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.28 no.9A
- /
- pp.678-683
- /
- 2003
The pseudo-noise (PN) code acquisition based on the sequential estimation (SE) proposed by Ward performs well only at relatively high chip signal-to-noise ratios (SNRs). In this paper, a seed accumulating sequential estimation (SASE) method and a PN code acquisition system based on it are proposed, which perform well at low chip SNR (of practical interest) also. Then, the mean acquisition time performance of the proposed system is investigated. Numerical results show that the system based on the SASE performs dramatically better than that based on the SE at low chip SNR, and the improvement becomes larger as the period of PN code increases.
PDF KSCI

Adaptation of Classification Model for Improving Speech Intelligibility in Noise (음성 명료도 향상을 위한 분류 모델의 잡음 환경 적응)

Jung, Junyoung;Kim, Gibak
- Journal of Broadcast Engineering
- /
- v.23 no.4
- /
- pp.511-518
- /
- 2018
This paper deals with improving speech intelligibility by applying binary mask to time-frequency units of speech in noise. The binary mask is set to "0" or "1" according to whether speech is dominant or noise is dominant by comparing signal-to-noise ratio with pre-defined threshold. Bayesian classifier trained with Gaussian mixture model is used to estimate the binary mask of each time-frequency signal. The binary mask based noise suppressor improves speech intelligibility only in noise condition which is included in the training data. In this paper, speaker adaptation techniques for speech recognition are applied to adapt the Gaussian mixture model to a new noise environment. Experiments with noise-corrupted speech are conducted to demonstrate the improvement of speech intelligibility by employing adaption techniques in a new noise environment.
https://doi.org/10.5909/JBE.2018.23.4.511 인용 PDF KSCI KPUBS

An Analysis on Phone-Like Units for Korean Continuous Speech Recognition in Noisy Environments (잡음환경하의 연속 음성인식을 위한 유사음소단위 분석)

Shen Guang-Hu;Lim Soo-Ho;Seo Jun-Bae;Kim Joo-Gon;Jung Ho-Youl;Chung Hyun-Yeol
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.123-126
- /
- 2004
본 논문은 잡음환경 하에서의 효율적인 문맥의존 음향 모델 구성에 대한 기초연구로서 잡음환경 하에서의 유사 음소단위 수에 따른 연속 음성인식 성능을 비교, 평가한 결과에 대한 보고이다. 기존의 연구[1,2]로부터 연속음성 인식의 경우 문맥종속모델은 변이음을 고려한 39유사음소를 이용한 경우가 48유사음소를 이용하는 것보다 더 좋은 인식성능을 나타냄을 알 수 있었다. 이 연구 결과를 바탕으로 본 연구에서는 잡음환경에서도 효율적인 문맥 의존 음향모델을 구성하기 위한 기초 연구를 수행하였다. 다양한 잡음환경을 고려하기 위해 White, Pink, LAB 잡음을 신호 대 잡음비(Signal to Noise Ratio) 5dB, 10dB, 15dB 레벨로 음성에 부가한 후 각 유사음소단위 수에 따른 연속음성인식 실험을 수행하였다. 그 결과, 39유사음소를 이용한 경우가 48유사음소를 이용한 경우보다 clear 환경인 경우에 약 $7\%$와 $17\%$ 향상된 단어인식률과 문장 인식률을 얻을 수 있었으며, 각 잡음환경에서도 39유사음소를 이용한 경우가 48유사음소를 이용한 경우보다 평균 적으로 $17\%$와 $28\%$ 향상된 단어인식률과 문장인식률을 얻을 수 있어 39유사음소 단위가 한국어 연속음성인식에 더 적합하고 잡음환경에서도 유효함을 확인할 수 있었다.
PDF

Search Result 1,907, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)