Search | Korea Science

Probabilistic Target Speech Detection and Its Application to Multi-Input-Based Speech Enhancement (확률적 목표 음성 검출을 통한 다채널 입력 기반 음성개선)

Lee, Young-Jae;Kim, Su-Hwan;Han, Seung-Ho;Han, Min-Soo;Kim, Young-Il;Jeong, Sang-Bae
- Phonetics and Speech Sciences
- /
- v.1 no.3
- /
- pp.95-102
- /
- 2009
In this paper, an efficient target speech detection algorithm is proposed for the performance improvement of multi-input speech enhancement. Using the normalized cross correlation value between two selected channels, the proposed algorithm estimates the probabilistic distribution function of the value from the pure noise interval. Then, log-likelihoods are calculated with the function and the normalized cross correlation value to detect the target speech interval precisely. The detection results are applied to the generalized sidelobe canceller-based algorithm. Experimental results show that the proposed algorithm significantly improves the speech recognition performance and the signal-to-noise ratios.
PDF

Recognition resolution enhancement of ultrasonic sensors via multiple steps of transmitter voltages

Na, Seung-You;Park, Min-Sang
- 제어로봇시스템학회:학술대회논문집
- /
- 1996.10a
- /
- pp.409-412
- /
- 1996
Ultrasonic sensors are widely used in various applications due to advantages of low cost, simplicity in construction, mechanical robustness, and little environmental restriction in usage. But the main purposes of the noncontact sensing are rather narrowly confined within object detection and distance measurement. For the application of object recognition, ultrasonic sensors exhibit several shortcomings of poor directionality which results in low spatial resolution of objects, and specularity which gives frequent erroneous range readings. To resolve these problems in object recognition, an array of the sensor has been used. To improve the spatial resolution, more number of sensors are used in essence throughout the various devices of the sensor arrays. Under the disguise of a fixed number of the sensors, the array can be shifted mechanically in several steps. In this paper we propose a practical sensor resolution enhancement method using an electronic circuit accompanying the sensor array. The circuit changes the transmitter output voltage in several steps. Using the known sensor characteristics, a set of different return echo signals provide enhanced spatial resolution. The improvement is obtained with neither the cost of the increased number of the sensors nor extra mechanical devices.
PDF

A Study on Hybrid Track Circuit Tag Recognition Enhancement (하이브리드 궤도회로 태그 인식율 향상에 관한 연구)

Yang, Dong-In;Li, Chang-Long;Jin, Zhe-Huan;Lee, Key-Seo;Ko, Yun-Seok
- The Journal of the Korea institute of electronic communication sciences
- /
- v.9 no.4
- /
- pp.537-542
- /
- 2014
Track circuit is a simple electrical device which lies in the connection of the two rails by the wheels and axle of locomotives and rolling stock to short out an electrical circuit, used to detect the absence of a train on rail tracks. In railway signaling system, there are similar systems such as RFID and wheel sensor, GPS etc, are research and developing. Hybrid track circuit is using RFID antenna and reader on the cab and RFID tag on the sleeper. because of the safety in railway operation, tag detection of train position detection function in the hybrid track circuit needs high reliability. This paper studied tag recognition enhancement used tag angles.
https://doi.org/10.13067/JKIECS.2014.9.4.537 인용 PDF KSCI

Pre-Processing for Performance Enhancement of Speech Recognition in Digital Communication Systems (디지털 통신 시스템에서의 음성 인식 성능 향상을 위한 전처리 기술)

Seo, Jin-Ho;Park, Ho-Chong
- The Journal of the Acoustical Society of Korea
- /
- v.24 no.7
- /
- pp.416-422
- /
- 2005
Speech recognition in digital communication systems has very low performance due to the spectral distortion caused by speech codecs. In this paper, the spectral distortion by speech codecs is analyzed and a pre-processing method which compensates for the spectral distortion is proposed for performance enhancement of speech recognition. Three standard speech codecs. IS-127 EVRC. ITU G.729 CS-ACELP and IS-96 QCELP. are considered for algorithm development and evaluation, and a single method which can be applied commonly to all codecs is developed. The performance of the proposed method is evaluated for three codecs, and by using the speech features extracted from the compensated spectrum. the recognition rate is improved by the maximum of $15.6\%$ compared with that using the degraded speech features.
PDF KSCI

A Model for Post-processing of Speech Recognition Using Syntactic Unit of Morphemes (구문형태소 단위를 이용한 음성 인식의 후처리 모델)

양승원;황이규
- Journal of Korea Society of Industrial Information Systems
- /
- v.7 no.3
- /
- pp.74-80
- /
- 2002
There are many researches on post-processing methods for the Korean continuous speech recognition enhancement using natural language processing techniques. It is very difficult to use a formal morphological analyzer for improving the speech recognition because the analysis technique of natural language processing is mainly for formal written languages. In this paper, we propose a speech recognition enhancement model using syntactic unit of morphemes. This approach uses the functional word level longest match which dose not consider spacing words. We describe the post-processing mechanism for the improving speech recognition by using proposed model which uses the relationship of phonological structure information between predicates md auxiliary predicates or bound nouns that are frequently occurred in Korean sentences.
PDF

Front-End Processing for Speech Recognition in the Telephone Network (전화망에서의 음성인식을 위한 전처리 연구)

Jun, Won-Suk;Shin, Won-Ho;Yang, Tae-Young;Kim, Weon-Goo;Youn, Dae-Hee
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.4
- /
- pp.57-63
- /
- 1997
In this paper, we study the efficient feature vector extraction method and front-end processing to improve the performance of the speech recognition system using KT(Korea Telecommunication) database collected through various telephone channels. First of all, we compare the recognition performances of the feature vectors known to be robust to noise and environmental variation and verify the performance enhancement of the recognition system using weighted cepstral distance measure methods. The experiment result shows that the recognition rate is increasedby using both PLP(Perceptual Linear Prediction) and MFCC(Mel Frequency Cepstral Coefficient) in comparison with LPC cepstrum used in KT recognition system. In cepstral distance measure, the weighted cepstral distance measure functions such as RPS(Root Power Sums) and BPL(Band-Pass Lifter) help the recognition enhancement. The application of the spectral subtraction method decrease the recognition rate because of the effect of distortion. However, RASTA(RelAtive SpecTrAl) processing, CMS(Cepstral Mean Subtraction) and SBR(Signal Bias Removal) enhance the recognition performance. Especially, the CMS method is simple but shows high recognition enhancement. Finally, the performances of the modified methods for the real-time implementation of CMS are compared and the improved method is suggested to prevent the performance degradation.
PDF

Fingerprint Image Enhancement Based on a Modified Gator Filter (변형된 게이버 필터를 사용한 지문영상의 향상)

장원철;이동재;김재희
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.40 no.1
- /
- pp.103-113
- /
- 2003
We must enhance a fingerprint image to improve the performance of a fingerprint recognition. Because of this reason, many researches were achieved about the fingerprint image enhancement. Representative method is to use Gabor-Filter among them. However GF has the weakness which a processing hour takes long. In this paper, we proposed Half Gabor Filter (HGF) to enhance the fingerprint image fast in the on-line. The HGF, however, can make calculation much simpler, as well as both minutiae-extraction rate and recognition rate. On the other hand, the fingerprint image to enhance using HGF has almost same with the case effectiveness to apply GF. In this paper, we confirme it mathematically and experimentally.
PDF KSCI

A Study of Image Enhancement Processing for Letter Extraction of Image Using Terahertz Signal (테라헤르츠 신호를 이용한 영상의 글자 추출을 위한 화질 개선처리에 대한 연구)

Kim, Seongyoon;Choi, Hyunkeun;Park, Inho;Kim, Youngseop;Lee, Yonghwan
- Journal of the Semiconductor & Display Technology
- /
- v.16 no.3
- /
- pp.111-115
- /
- 2017
Terahertz waves are superior to conventional X-ray or Magnetic Resonance Tomography(MRI), and the amount of information that can be transmitted is as large as thousands of times that conventional X-ray or MRI. In addition, Terahertz waves have great performance in analyzing an object which have some layered structure. By using this advantage, we can extract the letters of a page by analyzing information such as absorption amount and reflection amount by irradiating a closed book with pulses of various frequencies within gap of a terahertz wave. However, in the image of each page using the Terahertz wave might be obtained various kinds of noise and the different character occlusion region. So, to extract letters from the terahertz image, we must take the noise and occlusion region away. We have been working to enhancement the image quality in various ways, and keep on studying de-noising processing for enhancement about the image quality and high resolution. Finally, we also keep on studying about OCR(Optical Character Recognition) technology, which based on pattern matching technique, to read letters.
PDF

Fingerprint Image Enhancement using a Modified Anisotropic Gaussian Filter (개선된 Anisotropic Gaussian 필터를 이용한 지문 영상 향상)

조희덕;김상희;박원우
- Proceedings of the IEEK Conference
- /
- 2003.11a
- /
- pp.293-296
- /
- 2003
The enhancement of fingerprint image is necessary to improve the performance of fingerprint recognition. The enhancement of fingerprint image with Gabor Filter(GF) is widely used. However GF has the weakness such as long processing time and the sensitivity to ridge frequency. To overcome these weaknesses, we propose a Modified Anisotropic Gaussian Filter(MAGF) which is modified from Anisotropic Filter proposed by S. Greenburg's(SAF). This proposed MAGF can reduce the calculation time of ridge frequency and improve the weakness of sensitivity to ridge frequency. We also explained that MAGF is better than others mathematically and experimentally.
PDF

A Noise Robust Speech Recognition Method Using Model Compensation Based on Speech Enhancement (음성 개선 기반의 모델 보상 기법을 이용한 강인한 잡음 음성 인식)

Shen, Guang-Hu;Jung, Ho-Youl;Chung, Hyun-Yeol
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.4
- /
- pp.191-199
- /
- 2008
In this paper, we propose a MWF-PMC noise processing method which enhances the input speech by using Mel-warped Wiener Filtering (MWF) at pre-processing stage and compensates the recognition model by using PMC (Parallel Model Combination) at post-processing stage for speech recognition in noisy environments. The PMC uses the residual noise extracted from the silence region of enhanced speech at pre-processing stage to compensate the clean speech model and thus this method is considered to improve the performance of speech recognition in noisy environments. For recognition experiments we dew.-sampled KLE PBW (Phoneme Balanced Words) 452 word speech data to 8kHz and made 5 different SNR levels of noisy speech, i.e., 0dB. 5dB, 10dB, 15dB and 20dB, by adding Subway, Car and Exhibition noise to clean speech. From the recognition results, we could confirm the effectiveness of the proposed MWF-PMC method by obtaining the improved recognition performances over all compared with the existing combined methods.
https://doi.org/10.7776/ASK.2008.27.4.191 인용 PDF KSCI

Search Result 362, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)