Search | Korea Science

Signal Detection Using Wavelet Transform in Fractional Brownian Motion (Fractional Brownian Motion 잡음환경 하에서 웨이브렛 변환을 이용한 신호의 검출)

김명진
- Proceedings of the Korea Institute of Convergence Signal Processing
- /
- 2000.08a
- /
- pp.21-24
- /
- 2000
Fractional Brownian motion(fBm)은 long-term persistence 특성을 가진 자연 현상, 1/f 잡음, 깊이가 낮은 해저에서의 배경음향잡음 등을 모델링하는데 많이 사용된다. 이 fBm은 nonstationary 유색잡음이다. 이러한 유색잡음 환경 하에서 신호를 검출하기 위한 한 방법은 Fredholm 적분방정식의 해를 구하는 것이다. 이 방정식을 이산화 하면 잡음의 공분산 행렬의 역행렬이 포함되어 계산량이 많다 본 논문에서는 fBm 잡음의 공분산 행렬을 웨이브렛 변환하여 얻어지는 행렬, 즉 fBm의 멀티스케일 성분들의 공분산행렬은 밴드화된 블록들로 근사화할 수 있다는 성질을 이용하여 적은 계산량으로 신호를 검출하는 알고리즘을 제안한다.
PDF

Region Extraction of License Plates in Noise Environment Using YUV Color Space Convert (YUV컬러 공간변환에 의한 잡음환경의 차량번호판 영역추출)

Kim Jae-Nam;Choi Tae-Il;Kim Byung-Ki
- The KIPS Transactions:PartD
- /
- v.13D no.1 s.104
- /
- pp.125-132
- /
- 2006
The existing recognition system of license plates cannot get the satisfactory result in noise environments. The purpose of this paper is to propose an algorithm that can recognize the region of license plates accurately in a noise environment. The algorithm is formulated by reorganizing the U- and V-channels of YUV color space as YUV is insensitive to light and carries less data than RGB color information. The region of license plates has been extracted by the geometric characteristics, sizes, and places of labeling images. The proposed algorithm was found to improve the process of extracting the region of license plates in various noise environments.
https://doi.org/10.3745/KIPSTD.2006.13D.1.125 인용 PDF KSCI

Speech Recognition in Time-varying Noisy Environments using the Histogram Technique (히스토그램 처리방법을 이용한 시변 잡음환경에서의 음성인식)

권영욱;김형순
- The Journal of the Acoustical Society of Korea
- /
- v.17 no.3
- /
- pp.47-51
- /
- 1998
잡음 환경에서의 음성인식을 위해서는 일반적으로 전처리 과정에서 잡음의 스펙트 럼을 잘 추정할 필요가 있다. 본 논문에서는 시변잡음 환경에서 히스토그램 처리방법에 의 해 잡음의 스펙트럼을 추정하고 이를 제거하는 방법으로 스펙트럼 차감법을 사용하였다. 히 스토그램 처리방법은 음성/비음성 구간의 구분을 할 필요가 없으며 서서히 변화하는 잡음의 스펙트럼도 추정할 수 있다는 점에서 기존 방식에 비해 장점을 지닌다. 다양한 SNR 조건하 에서 시간에 따라 에너지, 그리고 주파수가 변화하는 유색 가우시안 잡음을 부가시킨 음성 에 대해, 화자독립 고립단어 인식실험을 수행하였다. 실험결과, 히스토그램 처리방법에 기반 을 둔 스펙트럼 차감법을 적용할 경우가 기존의 잡음 스펙트럼 추정방법에 비해 인식성능이 우수하였다.
PDF

Automatic Syllable Segmentation Algorithm in Noise Additional Continuous Speech (잡음이 첨가된 연속음성에서의 자동 음절분할 알고리즘)

Kim, Young-Sub;Cha, Young-Dong;Kim, Chang-Keun;Lee, Kwang-Seok;Hur, Kang-In
- Proceedings of the Korea Institute of Convergence Signal Processing
- /
- 2006.06a
- /
- pp.17-20
- /
- 2006
본 논문에서는 잡음이 첨가된 연속음성에서의 자동 음절분할을 위해 기존에 사용되고 있는 특징 파라미터인 단구간 에너지 이외에 잡음에 강인한 특성을 가지고 있는 새로운 특징인 스펙트럼 밀도비교척도와 의사역행렬을 이용한 선형판별함수를 제안한다. 기존에 사용되는 단구간 에너지는 잡음이 없는 환경에서는 좋은 성능을 나타내지만 잡음환경에서는 그렇지 못하다. 반면에 논문에서 제안한 척도들은 반대의 성능을 가지므로 주변잡음의 크기에 따라 각각의 파라미터를 적절한 가중치로 조합하는 음절구간 결정함수와 유한상태 머신을 추가로 사용면 무 잡음 환경뿐만 아니라, 잡음이 첨가된 연속음성에서도 일정수준 이상의 음절구간을 분리해 낼 수 있다.
PDF

Using speech enhancement parameter for ASR (잡음환경의 ASR 성능개선을 위한 음성강조 파라미터)

Cha, Young-Dong;Kim, Young-Sub;Hur, Kang-In
- Proceedings of the Korea Institute of Convergence Signal Processing
- /
- 2006.06a
- /
- pp.63-66
- /
- 2006
음성인식시스템은 사람이 별도의 장비 없이 음성만으로 시스템의 사용이 가능한 편리한 장점을 지니고 있으나 여러 가지 기술적인 어려움과 실제 환경의 낮은 인식률로 폭넓게 사용되지 못한 상황이다. 그 중 배경잡음은 음성인식의 인식률을 저하시키는 원인으로 지적 받고 있다. 이러한 잡음환경에 있는 ASR(Automatic Speech Recognition)의 성능 향상을 위해 외측억제 기능 이 추가된 파라미터를 제안한다. ASR 에서 널리 사용되는 파라미터인 MFCC을 본 논문에서 제안한 파라미터와 HMM를 이용하여 인식률을 비교하여 성능을 비교하였다. 실험결과를 통해 제안된 파라미터의 사용을 통해 잡음환경에 있는 ASR의 성능 향상을 확인할 수 있었다.
PDF

Implementation of Speaker Independent Speech Recognizer in Noise Environment based on DSP (DSP기반의 잡음환경에 강인한 화자 독립 음성 인식기 구현)

박진영;권호민;박정원;김창근;허강인
- Proceedings of the IEEK Conference
- /
- 2003.11a
- /
- pp.69-72
- /
- 2003
본 논문에서는 범용 DSP를 이용한 잡음환경에 강인한 음성인식 시스템을 구현하였다. 구현된 시스템은 TI사의 범용 DSP인 TMS320C32를 이용하였고, 실시간 음성 입력을 위한 음성 Codec과 외부 인터페이스를 확장하여 인식결과를 출력하도록 구성하였다. 또한, 기존의 음성 인식 시스템에 사용한 파라메터에 대한 고찰과 ICA를 이용하여 잡음 환경에 강인한 음성 특징 파라메터를 제안하고 성능 비교 실험을 하였다. 제안된 ICA 파라메터를 적용하여 음성인식 시스템을 구현하였다. 그리고, 독립적으로 동작 가능한 음성인식 시스템의 응용 예로 무선자동차에 적용시켜 실험했다.
PDF

Speech Recognition by Integrating Audio, Visual and Contextual Features Based on Neural Networks (신경망 기반 음성, 영상 및 문맥 통합 음성인식)

김명원;한문성;이순신;류정우
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.41 no.3
- /
- pp.67-77
- /
- 2004
The recent research has been focused on fusion of audio and visual features for reliable speech recognition in noisy environments. In this paper, we propose a neural network based model of robust speech recognition by integrating audio, visual, and contextual information. Bimodal Neural Network(BMNN) is a multi-layer perception of 4 layers, each of which performs a certain level of abstraction of input features. In BMNN the third layer combines audio md visual features of speech to compensate loss of audio information caused by noise. In order to improve the accuracy of speech recognition in noisy environments, we also propose a post-processing based on contextual information which are sequential patterns of words spoken by a user. Our experimental results show that our model outperforms any single mode models. Particularly, when we use the contextual information, we can obtain over 90% recognition accuracy even in noisy environments, which is a significant improvement compared with the state of art in speech recognition. Our research demonstrates that diverse sources of information need to be integrated to improve the accuracy of speech recognition particularly in noisy environments.
PDF KSCI

PN Code Acquisition at Low Signal-to-Noise Ratio Based on Seed Accumulating Sequential Estimation (시드 누적 순차적 추정 기법을 이용한 낮은 신호대잡음비 환경에서의 의사 잡음 부호 획득)

윤석호;김선용
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.28 no.9A
- /
- pp.678-683
- /
- 2003
The pseudo-noise (PN) code acquisition based on the sequential estimation (SE) proposed by Ward performs well only at relatively high chip signal-to-noise ratios (SNRs). In this paper, a seed accumulating sequential estimation (SASE) method and a PN code acquisition system based on it are proposed, which perform well at low chip SNR (of practical interest) also. Then, the mean acquisition time performance of the proposed system is investigated. Numerical results show that the system based on the SASE performs dramatically better than that based on the SE at low chip SNR, and the improvement becomes larger as the period of PN code increases.
PDF KSCI

A Robust Speech/Non-Speech Decision Using Voiced Characteristics of Speech (음성의 유성음 특성을 이용한 음성/비음성 판별 방법)

Lee, Sung-Joo;Jung, Ho-Young;Lee, Yun-Keun;Kim, Hyung-Soon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2007.05a
- /
- pp.411-412
- /
- 2007
자동음성인식 시스템을 이용하는 사용자 입장에서 보면 음성인식시스템을 사용하기 위하여 음성을 입력할 때마다 버튼을 눌러야 하는 Push-To-Talk (PTT) 방식은 여간 번거로운 일이 아닐 수 없다. 그리고 사용자가 원거리에서 음성을 입력하는 경우처럼 PTT 방식 자체가 용이하지 못 한 음성인식 응용분야에서는 Non-Push-To-Talk (NON-PTT) 방식의 필요성이 대두되게 된다. NON-PTT 방식의 음성 전처리를 위해서는 입력신호로부터 음성신호만을 구분해내는 음성판별기술이 필수적이다. 하지만 일상적인 잡음환경에서 음성신호만을 구분해내는 일은 매우 어려운 일이 아닐 수 없다. 본 논문에서는 일상적인 가정잡음환경에 강인한 음성판별방식을 제안한다. 여기서는 음성판별을 위해서 음성의 유성음 특성을 이용하였다. 즉, 일정구간 이상의 음성신호에는 일정구간이상의 유성음 구간이 존재하며 만약 잡음환경에서도 유성음 구간을 잘 검출할 수 있다면 이러한 음성의 특성을 이용하여 검출된 신호가 음성인지 아닌지를 판별할 수 있다. 이를 위하여 여기서는 가정잡음환경에서도 유성음을 잘 검출할 수 있도록 11 가지 유성음 특징들과 이를 이용한 음성판별방법을 제안하였다. 제안된 방법의 성능 평가를 위하여 음성의 끝점검출방법과 통합하여 음성/비음성 판별 테스트를 수행하였으며 테스트 수행결과 열악한 잡음환경에서 80%이상의 비음성을 거절하는 성능을 보였다.
PDF

Adaptation of Classification Model for Improving Speech Intelligibility in Noise (음성 명료도 향상을 위한 분류 모델의 잡음 환경 적응)

Jung, Junyoung;Kim, Gibak
- Journal of Broadcast Engineering
- /
- v.23 no.4
- /
- pp.511-518
- /
- 2018
This paper deals with improving speech intelligibility by applying binary mask to time-frequency units of speech in noise. The binary mask is set to "0" or "1" according to whether speech is dominant or noise is dominant by comparing signal-to-noise ratio with pre-defined threshold. Bayesian classifier trained with Gaussian mixture model is used to estimate the binary mask of each time-frequency signal. The binary mask based noise suppressor improves speech intelligibility only in noise condition which is included in the training data. In this paper, speaker adaptation techniques for speech recognition are applied to adapt the Gaussian mixture model to a new noise environment. Experiments with noise-corrupted speech are conducted to demonstrate the improvement of speech intelligibility by employing adaption techniques in a new noise environment.
https://doi.org/10.5909/JBE.2018.23.4.511 인용 PDF KSCI KPUBS

Search Result 1,911, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)