• Title/Summary/Keyword: recognition-rate

Search Result 2,809, Processing Time 0.027 seconds

Gaussian Model Optimization using Configuration Thread Control In CHMM Vocabulary Recognition (CHMM 어휘 인식에서 형상 형성 제어를 이용한 가우시안 모델 최적화)

  • Ahn, Chan-Shik;Oh, Sang-Yeob
    • Journal of Digital Convergence
    • /
    • v.10 no.7
    • /
    • pp.167-172
    • /
    • 2012
  • In vocabulary recognition using HMM(Hidden Markov Model) by model for the observation of a discrete probability distribution indicates the advantages of low computational complexity, but relatively low recognition rate has the disadvantage that require sophisticated smoothing process. Gaussian mixtures in order to improve them with a continuous probability density CHMM (Continuous Hidden Markov Model) model is proposed for the optimization of the library system. In this paper is system configuration thread control in recognition Gaussian mixtures model provides a model to optimize of the CHMM vocabulary recognition. The result of applying the proposed system, the recognition rate of 98.1% in vocabulary recognition, respectively.

A Study on the Isolated word Recognition Using One-Stage DMS/DP for the Implementation of Voice Dialing System

  • Seong-Kwon Lee
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.1039-1045
    • /
    • 1994
  • The speech recognition systems using VQ have usually the problem decreasing recognition rate, MSVQ assigning the dissimilar vectors to a segment. In this paper, applying One-stage DMS/DP algorithm to the recognition experiments, we can solve these problems to what degree. Recognition experiment is peformed for Korean DDD area names with DMS model of 20 sections and word unit template. We carried out the experiment in speaker dependent and speaker independent, and get a recognition rates of 97.7% and 81.7% respectively.

  • PDF

Noise Robust Automatic Speech Recognition Scheme with Histogram of Oriented Gradient Features

  • Park, Taejin;Beack, SeungKwan;Lee, Taejin
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.3 no.5
    • /
    • pp.259-266
    • /
    • 2014
  • In this paper, we propose a novel technique for noise robust automatic speech recognition (ASR). The development of ASR techniques has made it possible to recognize isolated words with a near perfect word recognition rate. However, in a highly noisy environment, a distinct mismatch between the trained speech and the test data results in a significantly degraded word recognition rate (WRA). Unlike conventional ASR systems employing Mel-frequency cepstral coefficients (MFCCs) and a hidden Markov model (HMM), this study employ histogram of oriented gradient (HOG) features and a Support Vector Machine (SVM) to ASR tasks to overcome this problem. Our proposed ASR system is less vulnerable to external interference noise, and achieves a higher WRA compared to a conventional ASR system equipped with MFCCs and an HMM. The performance of our proposed ASR system was evaluated using a phonetically balanced word (PBW) set mixed with artificially added noise.

Retrieve System for Performance support of Vocabulary Clustering Model In Continuous Vocabulary Recognition System (연속 어휘 인식 시스템에서 어휘 클러스터링 모델의 성능 지원을 위한 검색 시스템)

  • Oh, Sang Yeob
    • Journal of Digital Convergence
    • /
    • v.10 no.9
    • /
    • pp.339-344
    • /
    • 2012
  • Established continuous vocabulary recognition system improved recognition rate by using decision tree based tying modeling method. However, since system model cannot support the retrieve of phoneme data, it is hard to secure the accuracy. In order to improve this problem, we remodeled a system that could retrieve probabilistic model from continuous vocabulary clustering model to phoneme unit. Therefore in this paper showed 95.88%of recognition rate in system performance.

Recognition of Unconstrained Handwritten Numerals using Chaotic Neural Network (카오틱 신경망을 이용한 서체 숫자 인식)

  • 조재홍;성정원
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.1301-1304
    • /
    • 1998
  • Several neural networks have been successfully used to classify complex patterns such as handwritten numerals or words. This paper describes the discrimination of totally unconstrained handwritten numerals using the proposed chaotic neural network (CNN) to improve the recognition rate. The recognition system in the paper consists of the preprocessing stage to extract features using Kirsch mask and the classification stage to recognize numerals using the CNN. In order to evaluate the performance of the proposed network, we performed the recognition with unconstrained handwritten numeral database of Concordia university, Canada. Experimental results show that the CNN based recognizer performs higher recognition rate than other neural network-based methods reported using same database.

  • PDF

A Study on the Analysis and Recognition of Korean Speech Signal using the Phoneme (음소를 이용한 한국어 음성 신호의 분석과 인식에 관한 연구)

  • Kim Y. I.;Hwang Y. S.;Youn D. H.;Cha I. W.
    • The Journal of the Acoustical Society of Korea
    • /
    • v.8 no.5
    • /
    • pp.70-77
    • /
    • 1989
  • In this paper, Korean language recognition using the phoneme is studied. The experiment is carried out by dividing 545 isolated words into phonemes. Using linear prediction coefficients the recognition rate of consonants, vowels, and end-consonants are $87.3(\%), 91.0(\%), 91.7(\%)$, respectively. Recognition rate of isolated words combined with the phonemes is $71.4(\%)$. Itakura-saito distortion measure is used to phoneme segmentation and phoneme recognition.

  • PDF

Face Recognition Based on Improved Fuzzy RBF Neural Network for Smar t Device

  • Lee, Eung-Joo
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.11
    • /
    • pp.1338-1347
    • /
    • 2013
  • Face recognition is a science of automatically identifying individuals based their unique facial features. In order to avoid overfitting and reduce the computational reduce the computational burden, a new face recognition algorithm using PCA-fisher linear discriminant (PCA-FLD) and fuzzy radial basis function neural network (RBFNN) is proposed in this paper. First, face features are extracted by the principal component analysis (PCA) method. Then, the extracted features are further processed by the Fisher's linear discriminant technique to acquire lower-dimensional discriminant patterns, the processed features will be considered as the input of the fuzzy RBFNN. As a widely applied algorithm in fuzzy RBF neural network, BP learning algorithm has the low rate of convergence, therefore, an improved learning algorithm based on Levenberg-Marquart (L-M) for fuzzy RBF neural network is introduced in this paper, which combined the Gradient Descent algorithm with the Gauss-Newton algorithm. Experimental results on the ORL face database demonstrate that the proposed algorithm has satisfactory performance and high recognition rate.

Object Recognition Using Neuro-Fuzzy Inference System (뉴로-퍼지 추론 시스템을 이용한 물체인식)

  • 김형근;최갑석
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.17 no.5
    • /
    • pp.482-494
    • /
    • 1992
  • In this paper, the neuro-fuzzy inferene system for the effective object recognition is studied. The proposed neuro-fuzzy inference system combines learning capability of neural network with inference process of fuzzy theory, and the system executes the fuzzy inference by neural network automatically. The proposed system consists of the antecedence neural network, the consequent neural network, and the fuzzy operational part, For dissolving the ambiguity of recognition due to input variance in the neuro-fuzzy inference system, the antecedence’s fuzzy proposition of the inference rules are automatically produced by error back propagation learining rule. Therefore, when the fuzzy inference is made, the shape of membership functions os adaptively modified according to the variation. The antecedence neural netwerk constructs a separated MNN(Model Classification Neural Network)and LNN(Line segment Classification Neural Networks)for dissolving the degradation of recognition rate. The antecedence neural network can overcome the limitation of boundary decisoion characteristics of nrural network due to the similarity of extracted features. The increased recognition rate is gained by the consequent neural network which is designed to learn inference rules for the effective system output.

  • PDF

Korean Speech Recognition using Dynamic Multisection Model (DMS 모델을 이용한 한국어 음성 인식)

  • 안태옥;변용규;김순협
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.27 no.12
    • /
    • pp.1933-1939
    • /
    • 1990
  • In this paper, we proposed an algorithm which used backtracking method to get time information, and it be modelled DMS (Dynamic Multisection) by feature vectors and time information whic are represented to similiar feature in word patterns spoken during continuous time domain, for Korean Speech recognition by independent speaker using DMS. Each state of model is represented time sequence, and have time information and feature vector. Typical feature vector is determined as the feature vector of each state to minimize the distance between word patterns. DDD Area names are selected as recognition wcabulary and 12th LPC cepstrum coefficients are used as the feature parameter. State of model is made 8 multisection and is used 0.2 as weight for time information. Through the experiment result, recognition rate by DMS model is 94.8%, and it is shown that this is better than recognition rate (89.3%) by MSVQ(Multisection Vector Quantization) method.

  • PDF

Automatic Generation of Handwritten Hangul Character Images and Its Application to the Evaluation of Hangul Character Recognition Systems (변형에 의한 필기체 한글의 생성과 이를 이용한 한글 문자인식 시스템의 정량적 평가)

  • 박상태;방승양
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.30B no.3
    • /
    • pp.50-59
    • /
    • 1993
  • There is basic problem with the current evaluation method for character recognition systems. The current method evaluates the average recognition rate by applying the test data to the target system. The average recognition rate tells no more than and no less than the overall performance and it depends on the data. In this paper we propose a testing method which will analyze the target system and point out its strong points and weak points. This can be made possible through using the data which are generated cy distorting the standard character images according to a carefully controlled manner. This paper will describe how to automatically generate such distorted images. Also we will show the method is actually effective and useful by applying it to evaluating existing recognition algorithms.

  • PDF