Search | Korea Science

Speech Recognition Using MSVQ/TDRNN (MSVQ/TDRNN을 이용한 음성인식)

Kim, Sung-Suk
- The Journal of the Acoustical Society of Korea
- /
- v.33 no.4
- /
- pp.268-272
- /
- 2014
This paper presents a method for speech recognition using multi-section vector-quantization (MSVQ) and time-delay recurrent neural network (TDTNN). The MSVQ generates the codebook with normalized uniform sections of voice signal, and the TDRNN performs the speech recognition using the MSVQ codebook. The TDRNN is a time-delay recurrent neural network classifier with two different representations of dynamic context: the time-delayed input nodes represent local dynamic context, while the recursive nodes are able to represent long-term dynamic context of voice signal. The cepstral PLP coefficients were used as speech features. In the speech recognition experiments, the MSVQ/TDRNN speech recognizer shows 97.9 % word recognition rate for speaker independent recognition.
https://doi.org/10.7776/ASK.2014.33.4.268 인용 PDF KSCI

A Study on Speech Recognition by One Stage MSVQ/DP (One stage MSVQ/DP를 이용한 음성 인식에 관한연구)

Jeoung, Eui-Bung
- The Journal of the Acoustical Society of Korea
- /
- v.13 no.2
- /
- pp.5-12
- /
- 1994
This paper proposes One Stage MSVQ/DP method for word recognition system university administration branch names are selected for the recognition experiment and 10 LPC cepstrum coefficients is used as the feature parameter. Besides the speech recognition experiments by proposed method, for comparision with it, we perform the experiments on the same data by Level Building DTW and One Stage DP method. The Recognition rates with the LBDTW and the One Stage method are $83.3\%$ and $87.5\%$, but the recognition rate with the proposed method is $91.6\%$.
PDF

Discrimination Between Natural and Artificial Seismic Sounds by Using 20 MSVQ Algorithm (20 MSVQ 알고리즘을 이용한 자연 및 인공 지진음 식별)

Yoon, Sang-Hoon;Song, Young-Hwan;Bae, Myung-Jin
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.3
- /
- pp.251-259
- /
- 2009
This paper proposes an identification technique to discriminate natural and artificial seismic sounds by using the 20 MSVQ algorithm with the data measured by using a hydrophone. Spectrum band energy and MFCC were used as representative parameters for sake of discriminating natural and artificial seismic sounds, and the orders of characterized parameters were determined through experiments. As a result of using 20 MSVQ algorithm with the 2 characterized parameters, MFCC had 99.9% and the spectrum energy parameter had 83.9% percent of success. It was verified that it is extremely accurate when seismic sounds were discriminated by using the method suggested by this paper.
https://doi.org/10.7776/ASK.2009.28.3.251 인용 PDF KSCI

A Study on Isolated Word Recognition using Improved Multisection Vector Quantization Recognition System (개선된 MSVQ 인식 시스템을 이용한 단독어 인식에 관한 연구)

An, Tae-Ok;Kim, Nam-Joong;Song, Chul;Kim, Soon-Hyeob
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.16 no.2
- /
- pp.196-205
- /
- 1991
This paper is a study on the isolated word recognition of speaker independent which proposes to newly improved MSVQ(multisection vector quantization) recognition system which improve the classical MSVQ recognition system. It is a difference that test pattern has on more section than reference pattern in recognition system 146 DDD area names are selected as recognition vocabulary. 12th LPC cepstral coefficients is used as feature parameter. and when codebook is generated, MINSUM and MINMAX are used in finding the centroid. According to the experiment result. it is proved that this method is better than VQ(vector quantization) recognition methods, DTW(dynamic time warping) pattern matching methods and classical MSVQ methods for recognition rate and recognition time.
PDF

Speech Recognition using MSHMM based on Fuzzy Concept

Ann, Tae-Ock
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.2E
- /
- pp.55-61
- /
- 1997
This paper proposes a MSHMM(Multi-Section Hidden Markov Model) recognition method based on Fuzzy Concept, as a method on the speech recognition of speaker-independent. In this recognition method, training data are divided into several section and multi-observation sequences given proper probabilities by fuzzy rule according to order of short distance from MSVQ codebook per each section are obtained. Thereafter, the HMM per each section using this multi-observation sequences is generated, and in case of recognition, a word that has the most highest probability is selected as a recognized word. In this paper, other experiments to compare with the results of these experiments are implemented by the various conventional recognition methods(DP, MSVQ, DMS, general HMM) under the same data. Through results of all-round experiment, it is proved that the proposed MSHMM based on fuzzy concept is superior to DP method, MSVQ method, DMS model and general HMM model in recognition rate and computational time, and does not decreases recognition rate as 92.91% in spite of increment of speaker number.
PDF

A study on the speech recognition by HMM based on multi-observation sequence (다중 관측열을 토대로한 HMM에 의한 음성 인식에 관한 연구)

정의봉
- Journal of the Korean Institute of Telematics and Electronics S
- /
- v.34S no.4
- /
- pp.57-65
- /
- 1997
The purpose of this paper is to propose the HMM (hidden markov model) based on multi-observation sequence for the isolated word recognition. The proosed model generates the codebook of MSVQ by dividing each word into several sections followed by dividing training data into several sections. Then, we are to obtain the sequential value of multi-observation per each section by weighting the vectors of distance form lower values to higher ones. Thereafter, this the sequential with high probability value while in recognition. 146 DDD area names are selected as the vocabularies for the target recognition, and 10LPC cepstrum coefficients are used as the feature parameters. Besides the speech recognition experiments by way of the proposed model, for the comparison with it, the experiments by DP, MSVQ, and genral HMM are made with the same data under the same condition. The experiment results have shown that HMM based on multi-observation sequence proposed in this paper is proved superior to any other methods such as the ones using DP, MSVQ and general HMM models in recognition rate and time.
PDF

Comparison of Vector Quantization for Image Coding (영상 코딩을 위한 벡터 양자화 방법의 성능 비교)

박광훈;박용철;차일환;윤대희
- Proceedings of the Korean Institute of Communication Sciences Conference
- /
- 1987.04a
- /
- pp.35-38
- /
- 1987
The purpose of this paper is to compare a class of vector quantization techniques which include GVQ(Genera VQ) MSVQ(Mean separated VQ) and DCT_VQ The VQ techniques are applied to six images and both subjective and objective performance comparison are made The results indicate that the transform domain approach(DCT_VQ) yields more syable results than the spatial domain method (GVQ, MSVQ)
PDF

Isolated Word Recognition By HMM using Multisection MSVQ (MSVQ를 이용한 HMM에 의한 단독어 인식)

안태옥;변용규;김순협
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.27 no.9
- /
- pp.1468-1475
- /
- 1990
In this paper, isolated words are recognized using multisection VQ and HMM. As recognition vocabuaries, 20 area-name which is uttered 5 times by 3 speakers is selected. In generating codebook, we devide recognition vocabulary into equal length, section, and make standard VQ codebook to each section and calculate observation by section and than recognize isolated words by HMM training. Multisection VQ codebook has time information and as observation is calculated by eacy section, computation is lesser and recongnition rate is higher than by whole codword. As a result, it is proved that recognition rate is higher in case of HMM using multisection VQ codebook.
PDF

Speech Recognition Using HMM Based on Fuzzy (피지에 기초를 둔 HMM을 이용한 음성 인식)

안태옥;김순협
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.28B no.12
- /
- pp.68-74
- /
- 1991
This paper proposes a HMM model based on fuzzy, as a method on the speech recognition of speaker-independent. In this recognition method, multi-observation sequences which give proper probabilities by fuzzy rule according to order of short distance from VQ codebook are obtained. Thereafter, the HMM model using this multi-observation sequences is generated, and in case of recognition, a word that has the most highest probability is selected as a recognized word. The vocabularies for recognition experiment are 146 DDD are names, and the feature parameter is 10S0thT LPC cepstrum coefficients. Besides the speech recognition experiments of proposed model, for comparison with it, we perform the experiments by DP, MSVQ and general HMM under same condition and data. Through the experiment results, it is proved that HMM model using fuzzy proposed in this paper is superior to DP method, MSVQ and general HMM model in recognition rate and computational time.
PDF

A Study on the Isolated word Recognition Using One-Stage DMS/DP for the Implementation of Voice Dialing System

Seong-Kwon Lee
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06a
- /
- pp.1039-1045
- /
- 1994
The speech recognition systems using VQ have usually the problem decreasing recognition rate, MSVQ assigning the dissimilar vectors to a segment. In this paper, applying One-stage DMS/DP algorithm to the recognition experiments, we can solve these problems to what degree. Recognition experiment is peformed for Korean DDD area names with DMS model of 20 sections and word unit template. We carried out the experiment in speaker dependent and speaker independent, and get a recognition rates of 97.7% and 81.7% respectively.
PDF

Search Result 13, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)