Search | Korea Science

An Adaptive Speech Enhancement System Using Lateral Inhibition and Time-Delay Neural Network (상호억제와 시간지연 신경회로망을 사용한 적응적인 음성강조시스템)

Choi, Jae-Seung
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.45 no.2
- /
- pp.95-102
- /
- 2008
This paper proposes an adaptive speech enhancement system based on an auditory system to enhance speech that is degraded by various background noises. As such, the proposed system detects voiced and unvoiced sections, adaptively adjusts the coefficients for both the lateral inhibition and the amplitude component according to the detected sections for each input fame, then reduces the noise signal using a time-delay neural network. Based on measuring the signal-to-noise ratio, experiments confirm that the proposed system is effective for speech degraded by various noises.
PDF KSCI

I/O device of Minicomputer Using the Audio Cassette Deck (음성 Cassette Deck를 이용한 Minicomputer의 I/O 장치)

이주근;박찬곤
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.12 no.3
- /
- pp.1-7
- /
- 1975
In this paper, a method of writing and reproducing high density data with ordinary Audio cassette deck is discribed. In writing, the data N or NRZ code are modulated into PM code to take the positive code N(1) and the negative code N(0) are taken from the complement of the NRZ code, each of which are written into 2 channel track. In reading, the error corrected and the clock pulse can be generated from the reading pulse itself. Also, without modifying the interior circuit of the deck it is possible to use the deck in both the data and audio by adapting a few simple circuits. Over the range of 25HZ-4KHZ, it was possible to write and reproduce at the speed of 787 bps transmission rate.
PDF

Scoring Methods for Improvement of Speech Recognizer Detecting Mispronunciation of Foreign Language (외국어 발화오류 검출 음성인식기의 성능 개선을 위한 스코어링 기법)

Kang Hyo-Won;Kwon Chul-Hong
- MALSORI
- /
- no.49
- /
- pp.95-105
- /
- 2004
An automatic pronunciation correction system provides learners with correction guidelines for each mispronunciation. For this purpose we develope a speech recognizer which automatically classifies pronunciation errors when Koreans speak a foreign language. In order to develope the methods for automatic assessment of pronunciation quality, we propose a language model based score as a machine score in the speech recognizer. Experimental results show that the language model based score had higher correlation with human scores than that obtained using the conventional log-likelihood based score.
PDF

Pronunciation Network Construction of Speech Recognizer for Mispronunciation Detection of Foreign Language (한국인의 외국어 발화오류 검출을 위한 음성인식기의 발음 네트워크 구성)

Lee Sang-Pil;Kwon Chul-Hong
- MALSORI
- /
- no.49
- /
- pp.123-134
- /
- 2004
An automatic pronunciation correction system provides learners with correction guidelines for each mispronunciation. In this paper we propose an HMM based speech recognizer which automatically classifies pronunciation errors when Koreans speak Japanese. We also propose two pronunciation networks for automatic detection of mispronunciation. In this paper, we evaluated performances of the networks by computing the correlation between the human ratings and the machine scores obtained from the speech recognizer.
PDF

Automatic Detection of Mispronunciation Using Phoneme Recognition For Foreign Language Instruction (음성인식기를 이용한 한국인의 외국어 발화오류 자동 검출)

Kwon Chul-Hong;Kang Hyo-Won;Lee Sang-Pil
- MALSORI
- /
- no.48
- /
- pp.127-139
- /
- 2003
An automatic pronunciation correction system provides learners with correction guidelines for each mispronunciation. In this paper we propose an HMM based speech recognizer which automatically classifies pronunciation errors when Korean speak Japanese. For this purpose we also develop phoneme recognizers for Korean and Japanese. Experimental results show that the machine scores of the proposed recognizer correlate with expert ratings well.
PDF

Voiced/Unvoiced/Silence Classification of Speech Signal Using Wavelet Transform (웨이브렛 변환을 이용한 음성신호의 유성음/무성음/묵음 분류)

손영호
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.08a
- /
- pp.449-453
- /
- 1998
일반적으로 음성신호는 파형의 특성에 따라 파형이 준주기적인 유성음과 주기성 없이 잡음과 유사한 무성음 그리고 배경 잡음에 해당하는 묵음의 세 종류로 분류된다. 기존의 유성음/무성음/묵음 분류 방법에서는 피치정보, 에너지 및 영교차율 등이 분류를 위한 파라미터로 널리 사용되었다. 본 논문에서는 음성신호를 웨이브렛 변환한 신호에서 스펙트럼상에서이 변화를 파라미터로 하는 유성음/무성음/묵음 분류 알고리즘을 제안하고 제안된 알고리즘으로 검출한 결과와 이에 따른 문제점을 검토하였다.
PDF

A Comparative Study of Voice Activity Detection Algorithms in Adverse Environments (잡음 환경에서의 음성 검출 알고리즘 비교 연구)

Yang Kyong-Chul;Yook Dong-Suk
- Proceedings of the KSPS conference
- /
- 2006.05a
- /
- pp.45-48
- /
- 2006
As the speech recognition systems are used in many emerging applications, robust performance of speech recognition systems under extremely noisy conditions become more important. The voice activity detection (VAD) has been taken into account as one of the important factors for robust speech recognition. In this paper, we investigate conventional VAD algorithms and analyze the weak and the strong points of each algorithm.
PDF

단시간 스펙트럼에 기초한 주파수특성을 고려한 잡음차감 기법

Choe, Jae-Seung
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2015.10a
- /
- pp.824-826
- /
- 2015
최근 음성인식 시스템의 성능 향상은 많이 개선되었지만 아직도 잡음과 같은 문제로 인하여 문제점이 나타나고 있다. 음성인식 시스템에 있어서의 잡음 문제를 해결함으로써 인식 성능을 향상할 목적으로 본 논문에서는 단시간 스펙트럼에 기초한 주파수특성을 고려한 위너필터를 사용한 잡음 차감 알고리즘을 제안한다. 제안한 알고리즘은 먼저 각 프레임에서 문턱값을 검출한 후에 비묵음 구간과 묵음 구간을 식별한다. 각 프레임에 대해서 비묵음 구간에서는 위너필터법에 의한 잡음 차감법을 실시하며, 묵음 구간에 대해서는 일반적인 잡음 차감법을 적용한다.
PDF

Rejection using Entropy in Speech Recognition System (음성인식 시스템에서 엔트로피를 이용한 거절)

정미옥;김현숙;송점동;이정현
- Proceedings of the Korean Information Science Society Conference
- /
- 1999.10b
- /
- pp.195-197
- /
- 1999
본 논문은 음성인식 시스템에서 정확도를 높이기 위해 후처리 단계에서 후보 단어들의 엔트로피 정보를 이용하였다. 기존의 우도비 검출방법은 음성 데이터에 따라 음성인식 시스템의 성능이 변하고 N개의 후보단어들의 우도값이 비슷하여 오인식 발생확률이 높았다. 그러나 본 논문에서는 각 후보 단어들의 엔트로피 값보다 인식대상 단어 외의 단어들의 엔트로피 값이 상대적으로 낮은 후보를 거절하는 후처리 방법을 사용하여 음성 데이터에 독립적이면서도 변별력을 높인 정확한 음성인식 시스템을 얻을 수 있었다. 실험 결과 본 논문에서 제안하는 엔트로피에 의한 후처리 방법은 우도비에 의한 방법보다 인식 시스템의 성능을 falser alarm이 20%일 때 최대 3.6% 향상시킬 수 있었다.
PDF

A Study on Performance of Voice Activity Detector in Vocoder (음성부호화기에서의 음성 활동 검출 장치 성능에 관한 연구)

Min, So-Yeon;Lee, Kwang-Hyoung;Kim, Jung-Jae
- Proceedings of the KAIS Fall Conference
- /
- 2009.05a
- /
- pp.491-494
- /
- 2009
ITU-T에서 인터넷 폰과 화상회의에 사용하기 위하여 개발된 G.723.1 음성 부호화기는 잡음 구간에서의 전송률을 낮추기 위한 방법으로 VAD(Voice Activity Detector)와 CNG(Comfort Noise Generator)를 사용하고 있다. 여기서 VAD는 최종적으로 현재 프레임의 에너지 레벨을 비교하여 음성의 활동 유무를 판정하고 있다. 하지만 G.723.1 VAD에서는 보다 안정적인 판정을 위해 음성 활동 구간 사이에 삽입되어 있는 묵음 구간에 대해서는 거의 대부분 음성이 활동하는 영역으로 판정을 하고 있다. 본 논문에서는 묵음 구간에 대해 보다 정확한 판정을 통하여 기존의 방법에 비해 전송율을 더욱 감소 시킬 수 있는 방법을 제안한다. 실험에서는 묵음구간을 길게 조절한 문장을 사용하여 측정한 결과 약 50% 정도의 전송율을 감소시킬 수 있었으며, MOS 테스트 결과, 음질의 열하는 발생하지 않았다.
PDF

Search Result 726, Processing Time 0.042 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)