통합 검색 | Korea Science

인간의 청각 메커니즘을 적용한 웨이블렛 분석을 통한 음성 향상에 대한 연구 (A study of speech. enhancement through wavelet analysis using auditory mechanism)

이준석;길세기;홍준표;홍승홍
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2002년도 하계종합학술대회 논문집(4)
- /
- pp.397-400
- /
- 2002
This paper has been studied speech enhancement method in noisy environment. By mean of that we prefer human auditory mechanism which is perfect system and applied wavelet transform. Multi-resolution of wavelet transform make possible multiband spectrum analysis like human ears. This method was verified very effective way in noisy speech enhancement.
PDF

Eigenvoice를 이용한 이진 마스크 분류 모델 적응 방법 (Eigenvoice Adaptation of Classification Model for Binary Mask Estimation)

김기백
- 방송공학회논문지
- /
- 제20권1호
- /
- pp.164-170
- /
- 2015
본 논문에서는 잡음 환경에서 취득된 음성 신호에서 잡음을 제거하기 위한 방법으로 사용되는 이진 마스크 분류 모델의 적응과정에 대해 다루고자 한다. 기존 연구결과에 의하면, 잡음 환경 데이터에 이진 마스크 기법을 적용하면 음성 명료도를 향상시킬 수 있다고 알려져 있다. 하지만 이진 마스크 분류 모델 학습 시 테스트 환경 데이터가 포함되어야 한다는 단점을 안고 있다. 본 논문에서는 새로운 잡음 환경에서 이진 마스크 분류 모델을 적응하기 위해, 음성 인식에서 널리 사용되는 화자 적응 기법인 eigenvoice 방법을 적용하고자 한다. 실험결과에서는 모델 적응에 사용되는 데이터량에 따른 성능을 정검출율과 오검출율 관점에서 평가하였고, 그 결과 새로운 잡음 환경에서 데이터량을 증가시켜 모델을 적응함으로써 향상된 성능을 나타냄을 확인할 수 있었다.
https://doi.org/10.5909/JBE.2015.20.1.164 인용 PDF KSCI KPUBS HTML

조경요소의 영상을 이용한 도로교통소음 인지도의 심리적인 저감효과에 대한 연구 (Psychological Reduction Effect of Road Traffic Noise Perception by the Visual Information of Landscape components)

국찬;장길수;신용규
- KIEAE Journal
- /
- 제3권2호
- /
- pp.33-36
- /
- 2003
The influence of the visual information on the sound perception would be considerable. Furthermore, if the sound perception ranges in noisiness or annoyance beyond the loudness, it will depend much more on the shape of the visual information. This paper aims to estimate the influence of the several kinds of visual information on the perception of road traffic noise by means of the psycho-acoustic test method. The findings of present study on the influence of visual information on subjective noise perception are summarized as follows: Presenting visual images of mild and comfortable scenery reduced the noise perception reaction at the less noisy environments not exceeding 65 dB(A). At highly noisy environments exceeding 65 dB(A), however, the noise perception can be reduced by strong image of waterfall. Even eliminating the road traffic image may be helpful. Visual image of waterfall reduced the noise perception at all levels. It is inferred that the road traffic noise perception can be effectively ameliorated by presenting strong and real landscape images at any noisy environment.
PDF KSCI

비정상 잡음환경에서 음질향상을 위한 적응 임계 치 알고리즘 (Adaptive Threshold for Speech Enhancement in Nonstationary Noisy Environments)

이수정;김순협
- 한국음향학회지
- /
- 제27권7호
- /
- pp.386-393
- /
- 2008
본 논문에서는 비정상 잡음환경에서 음질향상을 위한 새로운 방법을 제안한다. 정상 잡음환경에서 음질향상을 위한 잡음제거 방법으로 주파수 차감법이 잘 알려져 있다. 그러나 실제 잡음환경은 대 부분 비정상적인 특성을 나타낸다. 제안한 방법은 다양한 잡음 과 비정상 환경에서 잘 동작 할 수 있도록 적응 임계 치를 위한 자동제어 파라미터를 사용한다. 특히, 자동제어 파라미터는 a posteriori SNR을 이용한 선형함수를 적용하여 잡음레벨의 증감에 따라 적응 임계 치를 제어한다. 제안한 알고리즘은 음질향상을 위해 Hangover (HO)을 이용한 주파수 차감법과 결합한다. 알고리즘의 성능은 다양한 잡음환경에서 ITU-T P.835 signal distortion (SIG)와 segment signal to-noise ratio (SNR)로 평가하여 (HO)을 이용한 음성검출과 minimum statistics (MS) 방법에 비해 우수한 결과를 나타냈다
https://doi.org/10.7776/ASK.2008.27.7.386 인용 PDF KSCI

소음 환경에서의 명료한 청취를 위한 음절형태 기반 음소 가중 기술 (Syllable-Type-Based Phoneme Weighting Techniques for Listening Intelligibility in Noisy Environments)

이영호;주종한;최승호
- 말소리와 음성과학
- /
- 제6권3호
- /
- pp.165-169
- /
- 2014
Intelligibility of speech transmitted to listeners can significantly be degraded in noisy environments such as in auditorium and in train station due to ambient noises. Noise-masked speech signal is hard to be recognized by listeners. Among the conventional methods to improve speech intelligibility, consonant-vowel intensity ratio (CVR) approach reinforces the powers of overall consonants. However, excessively reinforced consonant is not helpful in recognition. Furthermore, only some of consonants are improved by the CVR approach. In this paper, we propose the corrective weighting (CW) approach that reinforces the powers of consonants according to syllable-type such as consonant-vowel-consonant (CVC), consonant-vowel (CV) and vowel-consonant (VC) in Korean differently, considering the level of listeners' recognition. The proposed CW approach was evaluated by the subjective test, Comparison Category Rating (CCR) test of ITU-T P.800, showed better performance, that is, 0.18 and 0.24 higher than the unprocessed CVR approach, respectively.
https://doi.org/10.13064/KSSS.2014.6.3.165 인용 PDF KSCI

Noisy Data Aggregation with Independent Sensors: Insights and Open Problems

Murayama, Tatsuto;Davis, Peter
- Journal of Multimedia Information System
- /
- 제3권2호
- /
- pp.21-26
- /
- 2016
Our networked world has been growing exponentially fast. The explosion in volume of machine-to-machine (M2M) transactions threatens to exceed the transport capacity of the networks that link them. Therefore, it is quite essential to reconsider the tradeoff between using many data sets versus using good data sets. We focus on this tradeoff in the context of the quality of information aggregated from many sensors in a noisy environment. We start with a basic theoretical model considered in the famous "CEO problem'' in the field of information theory. From a point of view of large deviations, we successfully find a simple statement for the optimal strategies under the limited network capacity condition. Moreover, we propose an open problem for a sensor network scenario and report a numerical result.
https://doi.org/10.9717/JMIS.2016.3.2.21 인용 PDF

잡음 환경에서 짧은 발화 인식 성능 향상을 위한 선택적 극점 필터링 기반의 특징 정규화 (Selective pole filtering based feature normalization for performance improvement of short utterance recognition in noisy environments)

최보경;반성민;김형순
- 말소리와 음성과학
- /
- 제9권2호
- /
- pp.103-110
- /
- 2017
The pole filtering concept has been successfully applied to cepstral feature normalization techniques for noise-robust speech recognition. In this paper, it is proposed to apply the pole filtering selectively only to the speech intervals, in order to further improve the recognition performance for short utterances in noisy environments. Experimental results on AURORA 2 task with clean-condition training show that the proposed selectively pole-filtered cepstral mean normalization (SPFCMN) and selectively pole-filtered cepstral mean and variance normalization (SPFCMVN) yield error rate reduction of 38.6% and 45.8%, respectively, compared to the baseline system.
https://doi.org/10.13064/KSSS.2017.9.2.103 인용 PDF KSCI

배경잡음을 고려한 가변임계값 Dual Rate ADPCM 음성 CODEC 구현 (Implementation of Variable Threshold Dual Rate ADPCM Speech CODEC Considering the Background Noise)

양재석;한경호
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 2000년도 하계학술대회 논문집 D
- /
- pp.3166-3168
- /
- 2000
This paper proposed variable threshold dual rate ADPCM coding method which is modified from the standard ADPCM of ITU G.726 for speech quality improvement. The speech quality of variable threshold dual rate ADPCM is better than single rate ADPCM at noisy environment without increasing the complexity by using ZCR(Zero Crossing Rate). In this case, ZCR is used to divide input signal samples into two categories(noisy & speech). The samples with higher ZCR is categorized as the noisy region and the samples with lower ZCR is categorized as the speech region. Noisy region uses higher threshold value to be compressed by 16Kbps for reduced bit rates and the speech region uses lower threshold value to be compressed by 40Kbps for improved speech quality. Comparing with the conventional ADPCM, which adapts the fixed coding rate. the proposed variable threshold dual rate ADPCM coding method improves noise character without increasing the bit rate. For real time applications, ZCR calculation was considered as a simple method to obtain the background noise information for preprocess of speech analysis such as FFT and the experiment showed that the simple calculation of ZCR can be used without complexity increase. Dual rate ADPCM can decrease the amount of transferred data efficiently without increasing complexity nor reducing speech quality. Therefore result of this paper can be applied for real-time speech application such as the internet phone or VoIP.
PDF

도로 교통 소음에 대한 교사와 학생들의 반응 (A study of the response of teachers and students on the traffic noise)

김증호;이경종;문영한;노재훈;윤명조
- Journal of Preventive Medicine and Public Health
- /
- 제28권4호
- /
- pp.773-782
- /
- 1995
The purpose of this study is to reveal how the road traffic noise influences on the response of teachers and students, which composed of conversation, studying, relaxation, and physical disturbances. The research method used in this study was self-administrated questionnaire. Samples of the survey were composed of 420 persons(114 teachers and 306 students) who are exposed to traffic noise less than 65 dB(A) from two junior high schools and 410 persons(140 teachers and 270 students) from two noisy junior high schools which the road traffic noise above 65 dB(A). In the response of both of the teachers and students in noisy(above 65 dB) schools complaints of disturbances of conversation, studying, relaxation, and physical disturbances are much higher than that of less noisy schools' teachers and students(p<0.01). On the occasion of time and season, the subjects answered the traffic noise cause high troublesome and stresses in the afternoon(12:00 - 17:00) and summer respectively. It is necessary to provide governmental comprehensive and fundamental measures to improve the noisy school environments.
PDF

노이즈 환경에서 입자 군집 최적화 알고리즘의 성능 향상을 위한 통계적 가설 검정 기반 리샘플링 기법의 적용 (Application of Resampling Method based on Statistical Hypothesis Test for Improving the Performance of Particle Swarm Optimization in a Noisy Environment)

최선한
- 한국시뮬레이션학회논문지
- /
- 제28권4호
- /
- pp.21-32
- /
- 2019
군집에 대한 사회적 행동 모델에 영감을 받은 군집 최적화 알고리즘은 복잡한 최적화 문제 해결에서부터 인공 신경망의 학습에까지 활용되는 대표적인 메타휴리스틱 최적화 알고리즘 중의 하나이다. 하지만 이 알고리즘은 기본적으로 확률적 노이즈가 존재하지 않는 결정적인 환경에서 개발되었기 때문에, 많은 경우 확률적 노이즈가 존재하는 실제 문제에 적용하기에 어려움이 있었다. 본 논문에서는 이를 개선하기 위하여 불확실 평가 기법이라고 정의되는 통계적 가설 검정 기반의 리샘플링 기법을 적용한다. 이 기법을 통하여 입자 군집 최적화 알고리즘의 성능에 가장 큰 영향을 미치는 입자들의 전역 최적을 정확하게 찾으므로 노이즈 환경에서 입자들이 최적해로 보다 정확하고 빠르게 수렴하도록 한다. 다양한 벤치마크 문제들에 대한 기존 알고리즘들과의 비교 실험 결과는 제안하는 알고리즘의 개선된 성능을 입증하고, 사례 연구의 결과는 본 연구의 필요성을 강조한다. 본 연구 결과가 4차 산업혁명 시대에 디지털 트윈 등을 통한 시뮬레이션 기반 시스템 최적화에 효과적으로 적용될 수 있을 것이라 기대한다.
https://doi.org/10.9709/JKSS.2019.28.4.021 인용 PDF KSCI

검색결과 390건 처리시간 0.026초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)