Search | Korea Science

Noise Reduction Using the Standard Deviation of the Time-Frequency Bin and Modified Gain Function for Speech Enhancement in Stationary and Nonstationary Noisy Environments

Lee, Soo-Jeong;Kim, Soon-Hyob
- The Journal of the Acoustical Society of Korea
- /
- v.26 no.3E
- /
- pp.87-96
- /
- 2007
In this paper we propose a new noise reduction algorithm for stationary and nonstationary noisy environments. Our algorithm classifies the speech and noise signal contributions in time-frequency bins, and is not based on a spectral algorithm or a minimum statistics approach. It relies on calculating the ratio of the standard deviation of the noisy power spectrum in time-frequency bins to its normalized time-frequency average. We show that good quality can be achieved for enhancement speech signal by choosing appropriate values for ${\delta}_t\;and\;{\delta}_f$. The proposed method greatly reduces the noise while providing enhanced speech with lower residual noise and somewhat higher mean opinion score (MOS), background intrusiveness (BAK) and signal distortion (SIG) scores than conventional methods.
PDF KSCI

A Welding Inspection of Small-sized Metalized Film Capacitor with Large Capacity (소형.대용량 Metalized Film Capacitor의 용접 오차 검출 개발)

Jeong, Won-Young;Oh, Choon-Suk;Ryu, Young-Kee;Lim, Jong-Seul;Lee, Seo-Young
- Proceedings of the KIEE Conference
- /
- 2004.11c
- /
- pp.135-137
- /
- 2004
In this study we'll deal with the small-sized metalized film capacitors with large capacity which head have $5mm{\times}5mm{\times}2.5mm$ dimension. The lead wire is used to weld at both sides of capacitors. At that time the position gap between the welding machine and lead wire supplier would cause the welding error. Also, during the tapping processing of metalized film capacitors, the interval error among the capacitors, the length error of lead frame attached at the capacitors, and the straightness distortion of the lead frame could happen. As mentioned, four kinds of error parameters will be measured and analyzed by using the automatic visual inspection system that is implemented with CCD camera, optical parts, background lighting, and image processing algorithms. Finally we are able to achieve success rate above 99% to detect the welding faults of capacitors in the field test.
PDF

Harmonics-based Spectral Subtraction and Feature Vector Normalization for Robust Speech Recognition

Beh, Joung-Hoon;Lee, Heung-Kyu;Kwon, Oh-Il;Ko, Han-Seok
- Speech Sciences
- /
- v.11 no.1
- /
- pp.7-20
- /
- 2004
In this paper, we propose a two-step noise compensation algorithm in feature extraction for achieving robust speech recognition. The proposed method frees us from requiring a priori information on noisy environments and is simple to implement. First, in frequency domain, the Harmonics-based Spectral Subtraction (HSS) is applied so that it reduces the additive background noise and makes the shape of harmonics in speech spectrum more pronounced. We then apply a judiciously weighted variance Feature Vector Normalization (FVN) to compensate for both the channel distortion and additive noise. The weighted variance FVN compensates for the variance mismatch in both the speech and the non-speech regions respectively. Representative performance evaluation using Aurora 2 database shows that the proposed method yields 27.18% relative improvement in accuracy under a multi-noise training task and 57.94% relative improvement under a clean training task.
PDF

Face Detection based on Video Sequence (비디오 영상 기반의 얼굴 검색)

Ahn, Hyo-Chang;Rhee, Sang-Burm
- Journal of the Semiconductor & Display Technology
- /
- v.7 no.3
- /
- pp.45-49
- /
- 2008
Face detection and tracking technology on video sequence has developed indebted to commercialization of teleconference, telecommunication, front stage of surveillance system using face recognition, and video-phone applications. Complex background, color distortion by luminance effect and condition of luminance has hindered face recognition system. In this paper, we have proceeded to research of face recognition on video sequence. We extracted facial area using luminance and chrominance component on $YC_bC_r$ color space. After extracting facial area, we have developed the face recognition system applied to our improved algorithm that combined PCA and LDA. Our proposed algorithm has shown 92% recognition rate which is more accurate performance than previous methods that are applied to PCA, or combined PCA and LDA.
PDF

A Research of Circular Polarized Wave Antenna for the Improvement of Transmitting/Receiving Ability of Telemetry System (원격측정 시스템의 송수신 능력 향상을 위한 원편파 안테나 연구)

유제택;이장명구상화
- Proceedings of the IEEK Conference
- /
- 1998.06a
- /
- pp.141-144
- /
- 1998
An L-band omnidirectional circular polarized wave antenna is designed and evaluated for transmitting/receiving of vehicle data. Conventional linear polarized wave antenna can not reveive clearly all of the vehicle data which come from the wide driving test range on account of distortion. To overcome this problem, an omnidirectional circular polarized wave antenna is required for the design, first of all, the characteristics, design principle and theoretical background of circular polarized wave with a little signal loss have been reviewed. The designed antenna characteristics an analysed and compared to the desired ones. Our results demonstrate that the strength of vehicle data is flat enough over the full test range using this new antenna.
PDF

Speech Recognition in the Car Noise Environment (자동차 소음 환경에서 음성 인식)

김완구;차일환;윤대희
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.30B no.2
- /
- pp.51-58
- /
- 1993
This paper describes the development of a speaker-dependent isolated word recognizer as applied to voice dialing in a car noise environment. for this purpose, several methods to improve performance under such condition are evaluated using database collected in a small car moving at 100km/h The main features of the recognizer are as follow: The endpoint detection error can be reduced by using the magnitude of the signal which is inverse filtered by the AR model of the background noise, and it can be compensated by using variants of the DTW algorithm. To remove the noise, an autocorrelation subtraction method is used with the constraint that residual energy obtainable by linear predictive analysis should be positive. By using the noise rubust distance measure, distortion of the feature vector is minimized. The speech recognizer is implemented using the Motorola DSP56001(24-bit general purpose digital signal processor). The recognition database is composed of 50 Korean names spoken by 3 male speakers. The recognition error rate of the system is reduced to 4.3% using a single reference pattern for each word and 1.5% using 2 reference patterns for each word.
PDF

Object Boundary Block Coding Using Block Merging Method (블록 병합 기법을 이용한 객체 경계 부분 부호화)

이희습;김정식;김정우;이근영
- Proceedings of the IEEK Conference
- /
- 1999.11a
- /
- pp.577-580
- /
- 1999
Padding is a technique that enables applying conventional discrete cosine transform to encode boundary blocks of arbitrarily shaped objects by assigning imaginary values to the pixels that are not included in the object. Padding prevents the increase of high frequency DCT coefficients. However, in some boundary blocks, too many padded pixels are coded due to a small portion of object pixels. To reduce the number of padded pixels and to improve coding efficiency, we propose a block merging method for texture coding. The proposed mothed searches the shape information of boundary blocks and excludes the 4$\times$4 pixels of 8$\times$8 blocks if all the 4$\times$4 pixels are in the background region, and merges the remained 4$\times$4 pixels into new 8$\times$8 blocks. Experimental results show that our proposed method yields a rate-distortion gain about 0.5~1.6㏈ compared to conventional padding method, LPE
PDF

The Interchange in Drawing Styles between Cartoon and Fashion illustration (만화와 패션 일러스트레이션의 그림체적 교류)

Sung, Kwang-Sook
- Journal of the Korean Society of Costume
- /
- v.59 no.4
- /
- pp.82-97
- /
- 2009
In this study, it can be identified that drawing style of cartoon and fashion illustration are mutually linked and interchanged. The common background of drawing style between cartoon and fashion illustration, is as follows; 1. A means of image communication through mass communication 2. Similarities as visual signs 3. The borderless of painting, illustration and cartoon. 4. Usage of common drawing expressions such as deformation, distortion, exaggeration, metaphor, metonymy. Drawing style interchanging between cartoon and fashion illustration, is as follows; 1. Similar to figure and face are contemporary style, similar figure, Anime style and humourous style. 2. Similar to the way of express is focusing on the line, simplification, mixed computer graphics with hand drawing, artistic expression, the way of multimedia.
PDF KSCI

Correction of Specular Region on Document Images (문서 영상의 전반사 영역 보정 기법)

Simon, Christian;Williem;Park, In Kyu
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2013.11a
- /
- pp.239-240
- /
- 2013
The quality of document images captured by digital camera might be degraded because of non-uniform illumination condition. The high illumination (glare distortion) affects on the contrast condition of the document images. This condition leads to the poor contrast condition of the text in document image. So, optical character recognition (OCR) system might hardly recognize text in the high illuminated area. The method to increase the contrast condition between text (foreground) and background in high illuminated area is proposed in this paper.
PDF

Text Extraction in HIS Color Space by Weighting Scheme

Le, Thi Khue Van;Lee, Gueesang
- Smart Media Journal
- /
- v.2 no.1
- /
- pp.31-36
- /
- 2013
A robust and efficient text extraction is very important for an accuracy of Optical Character Recognition (OCR) systems. Natural scene images with degradations such as uneven illumination, perspective distortion, complex background and multi color text give many challenges to computer vision task, especially in text extraction. In this paper, we propose a method for extraction of the text in signboard images based on a combination of mean shift algorithm and weighting scheme of hue and saturation in HSI color space for clustering algorithm. The number of clusters is determined automatically by mean shift-based density estimation, in which local clusters are estimated by repeatedly searching for higher density points in feature vector space. Weighting scheme of hue and saturation is used for formulation a new distance measure in cylindrical coordinate for text extraction. The obtained experimental results through various natural scene images are presented to demonstrate the effectiveness of our approach.
PDF

Search Result 103, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)