통합 검색 | Korea Science

청각 및 시가 정보를 이용한 강인한 음성 인식 시스템의 구현 (Constructing a Noise-Robust Speech Recognition System using Acoustic and Visual Information)

이종석;박철훈
- 제어로봇시스템학회논문지
- /
- 제13권8호
- /
- pp.719-725
- /
- 2007
In this paper, we present an audio-visual speech recognition system for noise-robust human-computer interaction. Unlike usual speech recognition systems, our system utilizes the visual signal containing speakers' lip movements along with the acoustic signal to obtain robust speech recognition performance against environmental noise. The procedures of acoustic speech processing, visual speech processing, and audio-visual integration are described in detail. Experimental results demonstrate the constructed system significantly enhances the recognition performance in noisy circumstances compared to acoustic-only recognition by using the complementary nature of the two signals.
https://doi.org/10.5302/J.ICROS.2007.13.8.719 인용 PDF KSCI

Acoustic releaser 제어를 위한 강인한 수중음향신호 인식 알고리즘의 개발 (A Development of Robust Underwater Sound Signal Recognition Algorithm for Acoustic Releaser)

김영진;허경무
- 전자공학회논문지SC
- /
- 제41권3호
- /
- pp.33-38
- /
- 2004
본 논문에서는 해저의 환경변화에 따른 외란 요소에 영향을 받지 않고, 안정적으로 음파신호를 인식할 수 있는 수중음향 신호인식 알고리즘을 제안하였다. 제안하는 알고리즘은 배터리에 의존하여 장시간 운용되어야하는 시스템에 적합한 저소비전력형으로서, 신속하게 음파신호를 인식할 수 있고 해양환경 변화에 대한 안정성을 확보할 수 있으며, 이의 효율성을 수학적모델에 따른 수치시험과 회로실험을 통하여 확인하였다.
PDF KSCI

AE 신호 형상 인식법에 의한 회전체의 신호 검출 및 분류 연구 (Detection and Classification of Defect Signals from Rotator by AE Signal Pattern Recognition)

김구영;이강용;김희수;이현
- 한국철도학회논문집
- /
- 제4권3호
- /
- pp.79-86
- /
- 2001
The signal pattern recognition method by acoustic emission signal is applied to detect and classify the defects of a journal bearing in a power plant. AE signals of main defects such as overheating, wear and corrosion are obtained from a small scale model. To detect and classify the defects, AE signal pattern recognition program is developed. As the classification methods, the wavelet transformation analysis, the frequency domain analysis and time domain analysis are used. Among three analyses, the wavelet transformation analysis is most effective to detect and classify the defects of the journal bearing..
PDF

패턴인식기법을 이용한 공구마멸상태의 분류 (The Classification of Tool Wear States Using Pattern Recognition Technique)

이종항;이상조
- 대한기계학회논문집
- /
- 제17권7호
- /
- pp.1783-1793
- /
- 1993
Pattern recognition technique using fuzzy c-means algorithm and multilayer perceptron was applied to classify tool wear states in turning. The tool wear states were categorized into the three regions 'Initial', 'Normal', 'Severe' wear. The root mean square(RMS) value of acoustic emission(AE) and current signal was used for the classification of tool wear states. The simulation results showed that a fuzzy c-means algorithm was better than the conventional pattern recognition techniques for classifying ambiguous informations. And normalized RMS signal can provide good results for classifying tool wear. In addition, a fuzzy c-means algorithm(success rate for tool wear classification : 87%) is more efficient than the multilayer perceptron(success rate for tool wear classification : 70%).
https://doi.org/10.22634/KSME.1993.17.7.1783 인용 PDF

심해저용 원격 착탈 시스템 제어를 위한 수중음향신호 인식 알고리즘의 개발 (A Development of Underwater Sound Signal Recognition Algorithm for Acoustic Releaser in the Seafloor)

김영진;우종식;조영준;허경무
- 제어로봇시스템학회논문지
- /
- 제10권5호
- /
- pp.421-427
- /
- 2004
In order to exploit underwater resources successfully, the first step would be a marine environmental research and exploration in the seafloor. Generally one sets up a long-term underwater experimental unit in the seafloor and retrieves the unit later after a certain period time. Essential to these applications is the reliable teleoperation and telemetering of the unit. In this paper we presents a robust underwater sound recognition algorithm by which we can identify the sound signal without the influence of disturbances due to underwater environmental changes. The proposed method provides a means suitable for the acoustic releaser which requires low power dissipation and long-time underwater operation. We demonstrate its ability of securing stability and fast sound recognition through simulation methods.
https://doi.org/10.5302/J.ICROS.2004.10.5.421 인용 PDF KSCI

감마톤 특징 추출 음향 모델을 이용한 음성 인식 성능 향상 (Speech Recognition Performance Improvement using Gamma-tone Feature Extraction Acoustic Model)

안찬식;최기호
- 디지털융복합연구
- /
- 제11권7호
- /
- pp.209-214
- /
- 2013
음성 인식 시스템에서는 인식 성능 향상을 위한 방법으로 인간의 청취 능력을 인식 시스템에 접목하였으며 잡음 환경에서 음성 신호와 잡음을 분리하여 원하는 음성 신호만을 선택할 수 있도록 구성되었다. 하지만 실용적 측면에서 음성 인식 시스템의 성능 저하 요인으로 인식 환경 변화에 따른 잡음으로 인한 음성 검출이 정확하지 못하여 일어나는 것과 학습 모델이 일치하지 않는 것을 들 수 있다. 따라서 본 논문에서는 음성 인식 향상을 위해 감마톤을 이용하여 특징을 추출하고 음향 모델을 이용한 학습 모델을 제안하였다. 제안한 방법은 청각 장면 분석을 이용한 특징을 추출을 통해 인간의 청각 인지 능력을 반영하였으며 인식을 위한 학습 모델 과정에서 음향 모델을 이용하여 인식 성능을 향상시켰다. 성능 평가를 위해 잡음 환경의 -10dB, -5dB 신호에서 잡음 제거를 수행하여 SNR을 측정한 결과 3.12dB, 2.04dB의 성능이 향상됨을 확인하였다.
https://doi.org/10.14400/JDPM.2013.11.7.209 인용 PDF

음향학적 및 언어적 탐색을 이용한 어휘 인식 최적화 (The Vocabulary Recognition Optimize using Acoustic and Lexical Search)

안찬식;오상엽
- 한국멀티미디어학회논문지
- /
- 제13권4호
- /
- pp.496-503
- /
- 2010
어휘인식 시스템은 스탠드 얼론(Standalone)으로 개발되어 지고 있으며 휴대용 단말기에서 사용하였을 경우 메모리 공간의 제약과 오디오 압축으로 인해 인식률이 낮게 나타난다. 본 연구에서는 휴대용 단말기의 성능과 인식률 향상을 위하여 음향학적 탐색과 언어적 탐색을 분리하여 어휘 인식 속도를 개선한 시스템을 제안하였다. 음향학적 탐색은 휴대용 단말기에서 수행하고 보다 복잡한 언어적 탐색은 서버에서 처리하는 시스템으로 음성신호로부터 특징벡터를 추출하여 GMM을 이용한 음소인식을 수행하고, 인식된 음소 열을 서버로 전송하여 렉시컬 트리 탐색 알고리즘을 사용하여 언어적 탐색 단계에서 어휘 인식을 수행하였다. 시스템 성능 평가 결과 어휘 종속 인식률은 98.01%, 어휘 독립 인식률은 97.71%의 인식률을 나타냈으며 인식속도는 1.58초로 나타내었다.
PDF KSCI

딕셔너리 러닝을 이용한 음파 신호 분류기 설계 (Acoustic Signal Classifier Design using Dictionary Learning)

박성민;사성진;오광명;이희승
- 자동차안전학회지
- /
- 제8권1호
- /
- pp.19-25
- /
- 2016
As new car technology is developing, temporal interaction is needed in automotive. Rhythmic pattern is one of the practical examples of temporal interaction in vehicle. To recognize rhythmic pattern and its input medium, dictionary learning is applicable algorithm. In this paper, performance and memory requirement of the learning algorithm is tested and is sufficiently good for use this acoustic sound.
https://doi.org/10.22680/kasa.2016.8.1.019 인용 PDF

MLHF 모델을 적용한 어휘 인식 탐색 최적화 시스템 (Vocabulary Recognition Retrieval Optimized System using MLHF Model)

안찬식;오상엽
- 한국컴퓨터정보학회논문지
- /
- 제14권10호
- /
- pp.217-223
- /
- 2009
모바일 단말기의 어휘 인식 시스템에서는 통계적 방법에 의한 어휘인식을 수행하고 N-gram을 이용한 통계적 문법 인식 시스템을 사용한다. 인식 대상이 되는 어휘의 수가 증가하면 어휘 인식 알고리즘이 복잡해지고 대규모의 탐색공간을 필요로 하게 되며 처리시간이 길어지므로 제한된 연산처리 능력과 메모리로는 처리하기가 불가능하다. 따라서 본 논문에서는 이러한 단점을 개선하고 어휘 인식을 최적화하기 위하여 MLHF 시스템을 제안한다. MLHF는 FLaVoR의 구조를 이용하여 음향학적 탐색과 언어적 탐색을 분리하여 음향학적 탐색에서는 HMM을 사용하고 언어적 탐색 단계에서는 Levenshtein distance 알고리즘을 사용한다. 시스템 성능 평가 결과 어휘 종속 인식률은 98.63%, 어휘 독립 인식률은 97.91%의 인식률을 나타냈으며 인식속도는 1.61초로 나타내었다.
https://doi.org/10.9708/jksci.2009.14.10.217 인용 PDF

음성인식 성능 개선을 위한 다중작업 오토인코더와 와설스타인식 생성적 적대 신경망의 결합 (Combining multi-task autoencoder with Wasserstein generative adversarial networks for improving speech recognition performance)

고조원;고한석
- 한국음향학회지
- /
- 제38권6호
- /
- pp.670-677
- /
- 2019
음성 또는 음향 이벤트 신호에서 발생하는 배경 잡음은 인식기의 성능을 저하시키는 원인이 되며, 잡음에 강인한 특징을 찾는데 많은 노력을 필요로 한다. 본 논문에서는 딥러닝을 기반으로 다중작업 오토인코더(Multi-Task AutoEncoder, MTAE) 와 와설스타인식 생성적 적대 신경망(Wasserstein GAN, WGAN)의 장점을 결합하여, 잡음이 섞인 음향신호에서 잡음과 음성신호를 추정하는 네트워크를 제안한다. 본 논문에서 제안하는 MTAE-WGAN는 구조는 구배 페널티(Gradient Penalty) 및 누설 Leaky Rectified Linear Unit (LReLU) 모수 Parametric ReLU (PReLU)를 활용한 변수 초기화 작업을 통해 음성과 잡음 성분을 추정한다. 직교 구배 페널티와 파라미터 초기화 방법이 적용된 MTAE-WGAN 구조를 통해 잡음에 강인한 음성특징 생성 및 기존 방법 대비 음소 오인식률(Phoneme Error Rate, PER)이 크게 감소하는 성능을 보여준다.
https://doi.org/10.7776/ASK.2019.38.6.670 인용 PDF KSCI

검색결과 71건 처리시간 0.024초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)