Search | Korea Science

Control of Duration Model Parameters in HMM-based Korean Speech Synthesis (HMM 기반의 한국어 음성합성에서 지속시간 모델 파라미터 제어)

Kim, Il-Hwan;Bae, Keun-Sung
- Speech Sciences
- /
- v.15 no.4
- /
- pp.97-105
- /
- 2008
Nowadays an HMM-based text-to-speech system (HTS) has been very widely studied because it needs less memory and low computation complexity and is suitable for embedded systems in comparison with a corpus-based unit concatenation text-to-speech one. It also has the advantage that voice characteristics and the speaking rate of the synthetic speech can be converted easily by modifying HMM parameters appropriately. We implemented an HMM-based Korean text-to-speech system using a small size Korean speech DB and proposes a method to increase the naturalness of the synthetic speech by controlling duration model parameters in the HMM-based Korean text-to speech system. We performed a paired comparison test to verify that theses techniques are effective. The test result with the preference scores of 73.8% has shown the improvement of the naturalness of the synthetic speech through controlling the duration model parameters.
PDF

An EMG Signals Discrimination Using Hybrid HMM and MLP Classifier for Prosthetic Arm Control Purpose (의수 제어를 위한 HMM-MLP 근전도 신호 인식 기법)

권장우;홍승홍
- Journal of Biomedical Engineering Research
- /
- v.17 no.3
- /
- pp.379-386
- /
- 1996
This paper describes an approach for classifying myoelectric patterns using a multilayer perceptrons (MLP's) and hidden Markov models (HMM's) hybrid classifier. The dynamic aspects of EMG are important for tasks such as continuous prosthetic control or vari- ous time length EMG signal recognition, which have not been successfully mastered by the most neural approaches. It is known that the hidden Markov model (HMM) is suitable for modeling temporal patterns. In contrasts the multilayer feedforward networks are suitable for static patterns. Ank a lot of investigators have shown that the HMM's to be an excellent tool for handling the dynamical problems. Considering these facts, we suggest the combination of MLP and HMM algorithms that might lead to further improved EMG recognition systems.
PDF

Speech Recognition Using HMM Based on Fuzzy (피지에 기초를 둔 HMM을 이용한 음성 인식)

안태옥;김순협
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.28B no.12
- /
- pp.68-74
- /
- 1991
This paper proposes a HMM model based on fuzzy, as a method on the speech recognition of speaker-independent. In this recognition method, multi-observation sequences which give proper probabilities by fuzzy rule according to order of short distance from VQ codebook are obtained. Thereafter, the HMM model using this multi-observation sequences is generated, and in case of recognition, a word that has the most highest probability is selected as a recognized word. The vocabularies for recognition experiment are 146 DDD are names, and the feature parameter is 10S0thT LPC cepstrum coefficients. Besides the speech recognition experiments of proposed model, for comparison with it, we perform the experiments by DP, MSVQ and general HMM under same condition and data. Through the experiment results, it is proved that HMM model using fuzzy proposed in this paper is superior to DP method, MSVQ and general HMM model in recognition rate and computational time.
PDF

Verification of Graphemes Using Neural Networks in HMM Based On-line Koran Handwriting Recognition (인공신경망을 이용한 HMM 기반 온라인 한글인식 시스템의 자모 검증)

Cho, Sung-Jung;Kim, Ja-Hwan;Kim, Jin-Hyung
- Proceedings of the Korea Information Processing Society Conference
- /
- 2000.04a
- /
- pp.890-895
- /
- 2000
본 논문에서는 인공신경망을 이용한 자모 검증을 HMM 기반 온라인 한글인식 시스템에 적용하는 방법론을 제시한다. 본 시스템에서 각각의 자모는 한 개의 HMM 모델과 한 개의 인공신경망 검증기를 갖는다. 자모 검증기는 HMM 네트웍이 생성한 자모 후보 가정을 입력으로 받은 후, 이 가정의 타당성에 대한 사후 확률을 출력한다. 이 사후 확률은 Viterbi 탐색시 탐색 경로에 반영된다. 기존 HMM 시스템의 국소적 특징의 한계를 보완하기 위하여, 한글 자모의 기본획 분석에서 얻어진 구조적, 전역적 특징이 자모 검증기에 사용되었다. 한글 낱자인식에 대한 실험 결과 HMM 기반 인식기에 자모 검증기를 도입함으로서 38.5%의 인식 오류를 줄일 수 있었다.
PDF

Segmental Corrective Training for HMM Parameter Estimation in Speech Recognition (음성인식 시스템의 HMM 파라메터 추정을 위한 분절단위 교정 학습)

김회린;이황수
- The Journal of the Acoustical Society of Korea
- /
- v.12 no.2E
- /
- pp.5-11
- /
- 1993
본 논문에서 HMM 파라메터 추정을 위해 분절단위 정보를 이용하는 수정된 교정학습방법을 제안한다. 수정된 교정학습방법은 기존의 교정학습 방법에서 사용하는 전향·후향 알고리즘 대신에 분절단위 K-means 알고리즘을 사용하여 HMM 파라메터를 교정한다. 이 방식은 분절단위 K-means 알고리즘이 음성신호내의 공통의 통계적 특성을 가지는 상태단위 정보를 강조한다는 사실을 이용하였다. 화자종속 음소 및 단어인식 실험에서 제안된 알고리즘이 기존의 교정학습 방법보다 적은 계산량으로도 향상된 인식률을 보여주었다. 이것은 HMM 교정학습에서 상태다누이 정보가 중요함을 보여준다.
PDF

A Study of Phoneme Modeling for Improvement of Automatic Segmentation Performance (자동 음소 분할 성능 개선을 위한 음소 모델링에 관한 연구)

Park Hae Young;Kim Hyung Soon
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.175-178
- /
- 2002
본 논문에서는 Hidden Markov Model(HMM)을 이용하여 corpus 기반 TTS에 사용할 DB를 자동 음소 분할 해주는 시스템을 구현하였다. HMM을 이용해서 음소 분할 할 경우 HMM을 모델링 하는 방법에 따라 많은 성능의 차이가 난다. 따라서 본 논문에서는 HMM 모델링 방법에 따른 몇 가지 실험 및 성능 평가를 하였다. 실험 결과 음성 인식과는 달리 HMM모델링 시 triphone 모델보다 monophone 모델의 성능이 더 우수하였으며, 에너지 기반의 후처리를 통해 성능 향상을 얻을 수 있었다.
PDF

The Performance Improvement of Speech recognition system using Hierarchical Classification Method (대분류기법을 이용한 음성인식 시스템의 속도향상)

전화성;김길연;윤영선;오영환
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.10b
- /
- pp.476-478
- /
- 2000
본 논문에서는 HMM 학습모델을 이용하여 1445단어 음성인식기를 구현하고, 대분류기법을 이용하여 그 성능을 향상시키는 방법에 대하여 연구를 수행하였으며, 속도개선에 중점을 두었다. 속도개선을 위해서 HMM모델에 계층적 대분류 기법을 적용시켰다. HMM의 상태수가 많을수록 속도가 저하된다는 점을 고려하여, 적은 상태수의 HMM모델로 후보를 정하고, 가변적으로 해당하는 상태수의 HMM모델로 목적단어를 인식하는 방법을 제안하였다. 후보를 정하는 방법을 후보수와 특징파라미터의 종류와 수를 고려하여 다양하게 설정, 실험하여 가장 이상적인 경우를 찾아내었다.
PDF

A study on recognition improvement of velopharyngeal insufficiency patient's speech using various types of deep neural network (심층신경망 구조에 따른 구개인두부전증 환자 음성 인식 향상 연구)

Kim, Min-seok;Jung, Jae-hee;Jung, Bo-kyung;Yoon, Ki-mu;Bae, Ara;Kim, Wooil
- The Journal of the Acoustical Society of Korea
- /
- v.38 no.6
- /
- pp.703-709
- /
- 2019
This paper proposes speech recognition systems employing Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM) structures combined with Hidden Markov Moldel (HMM) to effectively recognize the speech of VeloPharyngeal Insufficiency (VPI) patients, and compares the recognition performance of the systems to the Gaussian Mixture Model (GMM-HMM) and fully-connected Deep Neural Network (DNNHMM) based speech recognition systems. In this paper, the initial model is trained using normal speakers' speech and simulated VPI speech is used for generating a prior model for speaker adaptation. For VPI speaker adaptation, selected layers are trained in the CNN-HMM based model, and dropout regulatory technique is applied in the LSTM-HMM based model, showing 3.68 % improvement in recognition accuracy. The experimental results demonstrate that the proposed LSTM-HMM-based speech recognition system is effective for VPI speech with small-sized speech data, compared to conventional GMM-HMM and fully-connected DNN-HMM system.
https://doi.org/10.7776/ASK.2019.38.6.703 인용 PDF KSCI

An Efficient Approach for Noise Robust Speech Recognition by Using the Deterministic Noise Model (결정적 잡음 모델을 이용한 효율적인 잡음음성 인식 접근 방법)

정용주
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.6
- /
- pp.559-565
- /
- 2002
In this paper, we proposed an efficient method that estimates the HMM (Hidden Marke Model) parameters of the noisy speech. In previous methods, noisy speech HMM parameters are usually obtained by analytical methods using the assumed noise statistics. However, as they assume some simplication in the methods, it is difficult to come closely to the real statistics for the noisy speech. Instead of using the simplication, we used some useful statistics from the clean speech HMMs and employed the deterministic noise model. We could find that the new scheme showed improved results with reduced computation cost.
PDF KSCI

The Decision Method of A Threshold in Sequence-based Anomaly Detection Sensor (순서기반 비정상행위 탐지 센서의 임계치 결정 방법)

Kim, Yong-Min;Kim, Min-Su;Kim, Hong-Geun;No, Bong-Nam
- The KIPS Transactions:PartC
- /
- v.8C no.5
- /
- pp.507-516
- /
- 2001
In this paper, we implement sequence-based anomaly detection sensor using SOM and HMM, and analyze what is important information in system call and how a threshold is decided. The new filtering and reduction rules of SOM reduces the input size of HMM. This gives real-time processing to HMM-based anomaly detection sensor. Also, we introduced an anomaly count into the sensor. Due to lessened sensibility, a user easily understand easily the detection information and false-positive was decreased. And the active coordination of the threshold value makes the detection sensor adapt according to the system condition.
PDF

Search Result 962, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)