Search | Korea Science

A New Endpoint Detection Method Based on Chaotic System Features for Digital Isolated Word Recognition System

Zang, Xian;Chong, Kil-To
- Proceedings of the IEEK Conference
- /
- 2009.05a
- /
- pp.37-39
- /
- 2009
In the research of speech recognition, locating the beginning and end of a speech utterance in a background of noise is of great importance. Since the background noise presenting to record will introduce disturbance while we just want to get the stationary parameters to represent the corresponding speech section, in particular, a major source of error in automatic recognition system of isolated words is the inaccurate detection of beginning and ending boundaries of test and reference templates, thus we must find potent method to remove the unnecessary regions of a speech signal. The conventional methods for speech endpoint detection are based on two simple time-domain measurements - short-time energy, and short-time zero-crossing rate, which couldn't guarantee the precise results if in the low signal-to-noise ratio environments. This paper proposes a novel approach that finds the Lyapunov exponent of time-domain waveform. This proposed method has no use for obtaining the frequency-domain parameters for endpoint detection process, e.g. Mel-Scale Features, which have been introduced in other paper. Comparing with the conventional methods based on short-time energy and short-time zero-crossing rate, the novel approach based on time-domain Lyapunov Exponents(LEs) is low complexity and suitable for Digital Isolated Word Recognition System.
PDF

Efficient Vocabulary Optimization Management using VCOR (VCOR를 이용한 효율적인 어휘 최적화 관리)

Oh, Sang-Yeob
- Journal of Korea Multimedia Society
- /
- v.13 no.10
- /
- pp.1436-1443
- /
- 2010
In vocabulary recognition system has it's bad points of processing vocabulary unseen triphone and then no got distribution of confidence measure by cannot normalization. According to this problem to improve suggested VCOR(Version Control for Out-of Rejection) system by out-of vocabulary rejection algorithm use vocabulary management optimization and then phone data search support. In VCOR system to provide vocabulary information efficiently offering for user's vocabulary information using extend facet classification that improved for vocabulary measure management function offering accuracy of recognition for vocabulary. In this paper proposed system performance as a result of represent vocabulary dependence recognition rate of 97.56%, vocabulary independence recognition rate of 96.23%.
PDF KSCI

Speech Recognition System in Car Noise Environment (자동차 잡음환경에서의 음성인식시스템)

Kim, Soo-Hoon;Ahn, Jong-Young
- Journal of Digital Contents Society
- /
- v.10 no.1
- /
- pp.121-127
- /
- 2009
The automotive ECU(Electronic Control Unit) becomes more complicated and is demanding many functions. For example, many automobile companies are developing driver convenience systems such as power window switch, LCM(Light Control Module), mirror control system, seat memory. In addition, many researches and developments for DIS(Driver Information System) are in progress. It is dangerous to operate such systems in driving. In this paper, we implement the speech recognition system which controls the car convenience system using speech, and apply the preprocessing filter to improve the speech recognition rate in car noise environment. As a result, we get the good speech recognition rate in car noise environment.
PDF

On the Use of a Parallel-Branch Subunit Mod디 in Continuous HMM for improved Word Recognition (연속분포 HMM에서 평행분기 음성단위를 사용한 단어인식율 향상연구)

Park, Yong-Kyuo;Un, Chong-Kwan
- The Journal of the Acoustical Society of Korea
- /
- v.14 no.2E
- /
- pp.25-32
- /
- 1995
In this paper, we propose to use a parallel-branch subunit model for improved word recognition. The model is obtained by splitting off each subunit branch based on mixture component in continuous hidden Markov model(continuous HMM). According to simulation results, the proposed model yields higher recognition rate than the single-branch subunit model or the parallel-branch subunit model proposed by Rabiner et al[1]. We show that a proper combination of the number of mixture components and the number of branches for each subunit results in increased recognition rate. To study the recognition performance of the proposed algorithms, the speech material used in this work was a vocabulary with 1036 Korean words.
PDF

Face Recognition Using Wavelet Coefficients and Hidden Markov Model (웨이블렛 계수와 Hidden Markov Model을 이용한 얼굴인식 기법)

Lee, Kyung-Ah;Lee, Dae-Jong;Park, Jang-Hwan;Chun, Myung-Geun
- Journal of the Korean Institute of Intelligent Systems
- /
- v.13 no.6
- /
- pp.673-678
- /
- 2003
In this paper, we proposes a method for face recognition using HMM(hidden Markov model) and wavelet coefficients First, input images are compressed by using the multi-resolution analysis based on the discrete wavelet transform. And then, the wavelet coefficients obtained from each subband are used as feature vectors to construct the HMMs. In the recognition stage, we obtained higher recognition rate by summing of each recognition rate of wavelet subband. The usefulness of the proposed method was shown by comparing with conventional VQ and DCT-HMM ones. The experimental results show that the proposed method is more satisfactory than previous ones.
https://doi.org/10.5391/JKIIS.2003.13.6.673 인용 PDF KSCI

The Effect of Membership Concentration in FVQ/HMM for Speaker-Independent Speech Recognition

Lee, Chang-Young;Nam, Ho-Soo;Jung, Hyun-Seok;Lee, Chai-Bong
- Speech Sciences
- /
- v.12 no.4
- /
- pp.7-16
- /
- 2005
We investigate the effect of membership concentration on the performance of the speaker-independent recognition system by FVQ/HMM. For the membership function, we adopt the result obtained from the objective function approach by Bezdek. Membership concentration is done by varying the exponent in the membership function. The number of selected clusters is constrained to two for the sake of cheap computational cost. Experimental results showed that the recognition rate has its maximum value when the membership function was taken to be inversely proportional to the distance of the input vector from the cluster centroid. When the membership concentration was two weak or too strong, the performance was found to be relatively poor as expected. Except these extreme cases, the membership concentration was not shown to affect the recognition rate significantly. This is in accordance with the general observation that the fuzzy system is not much sensitive. to the detailed shape of the membership function as long as it is overlapped over multiple classes.
PDF

Decision Tree State Tying Modeling Using Parameter Estimation of Bayesian Method (Bayesian 기법의 모수 추정을 이용한 결정트리 상태 공유 모델링)

Oh, SangYeob
- Journal of Digital Convergence
- /
- v.13 no.1
- /
- pp.243-248
- /
- 2015
Recognition model is not defined when you configure a model, Been added to the model after model building awareness, Model a model of the clustering due to lack of recognition models are generated by modeling is causes the degradation of the recognition rate. In order to improve decision tree state tying modeling using parameter estimation of Bayesian method. The parameter estimation method is proposed Bayesian method to navigate through the model from the results of the decision tree based on the tying state according to the maximum probability method to determine the recognition model. According to our experiments on the simulation data generated by adding noise to clean speech, the proposed clustering method error rate reduction of 1.29% compared with baseline model, which is slightly better performance than the existing approach.
https://doi.org/10.14400/JDC.2015.13.1.243 인용 PDF KSCI

Isolated-Word Speech Recognition in Telephone Environment Using Perceptual Auditory Characteristic (인지적 청각 특성을 이용한 고립 단어 전화 음성 인식)

Choi, Hyung-Ki;Park, Ki-Young;Kim, Chong-Kyo
- Journal of the Institute of Electronics Engineers of Korea TE
- /
- v.39 no.2
- /
- pp.60-65
- /
- 2002
In this paper, we propose GFCC(gammatone filter frequency cepstrum coefficient) parameter which was based on the auditory characteristic for accomplishing better speech recognition rate. And it is performed the experiment of speech recognition for isolated word acquired from telephone network. For the purpose of comparing GFCC parameter with other parameter, the experiment of speech recognition are carried out using MFCC and LPCC parameter. Also, for each parameter, we are implemented CMS(cepstral mean subtraction)which was applied or not in order to compensate channel distortion in telephone network. Accordingly, we found that the recognition rate using GFCC parameter is better than other parameter in the experimental result.
PDF KSCI

Improvement Method of Recognition Rate Using Brightness Control of Vehicle License Plate (차량 번호판 밝기 제어를 이용한 인식률 개선 방안)

Lee, Kwang Ok;Bae, Sang Hyun
- Smart Media Journal
- /
- v.6 no.3
- /
- pp.57-63
- /
- 2017
The most important, essential prerequisite for the improvement of vehicle license plate recognition is the acquisition of high-quality vehicle images. Because typical images acquired from roads are affected by different environmental factors including the time of day, sunlight, and the weather, the brightness and the shape of the license plates in the images are inconsistent. To this end, many image corrections are performed, resulting in slower recognition and lower recognition rate. Therefore, in this study, we used the images acquired from roads to test the proposed method for fast capturing of vivid, high-quality vehicle images by measuring the brightness around license plates during real-time image capturing to control in real time the factors, such as shutter speed, brightness, and gain of the camera, that affect the brightness and the quality of the images.
PDF KSCI

Machine-printed Numeral Recognition using Weighted Template Matching (가중 원형 정합을 이용한 인쇄체 숫자 인식)

Jung, Min-Chul
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.10 no.3
- /
- pp.554-559
- /
- 2009
This paper proposes a new method of weighted template matching fur machine-printed numeral recognition. The proposed weighted template matching, which emphasizes the feature of a pattern using adaptive Hamming distance on local feature areas, improves the recognition rate while template matching processes an input image as one global feature. The experiment compares confusion matrices of the template matching, error back propagation neural network classifier, and the proposed weighted template matching respectively. The result shows that the proposed method improves fairly the recognition rate of the machine-printed numerals.
https://doi.org/10.5762/KAIS.2009.10.3.554 인용 PDF

Search Result 2,809, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)