Search | Korea Science

Speaker-Adaptive Speech Synthesis based on Fuzzy Vector Quantizer Mapping and Neural Networks (퍼지 벡터 양자화기 사상화와 신경망에 의한 화자적응 음성합성)

Lee, Jin-Yi;Lee, Gwang-Hyeong
- The Transactions of the Korea Information Processing Society
- /
- v.4 no.1
- /
- pp.149-160
- /
- 1997
This paper is concerned with the problem of speaker-adaptive speech synthes is method using a mapped codebook designed by fuzzy mapping on FLVQ (Fuzzy Learning Vector Quantization). The FLVQ is used to design both input and reference speaker's codebook. This algorithm is incorporated fuzzy membership function into the LVQ(learning vector quantization) networks. Unlike the LVQ algorithm, this algorithm minimizes the network output errors which are the differences of clas s membership target and actual membership values, and results to minimize the distances between training patterns and competing neurons. Speaker Adaptation in speech synthesis is performed as follow;input speaker's codebook is mapped a reference speaker's codebook in fuzzy concepts. The Fuzzy VQ mapping replaces a codevector preserving its fuzzy membership function. The codevector correspondence histogram is obtained by accumulating the vector correspondence along the DTW optimal path. We use the Fuzzy VQ mapping to design a mapped codebook. The mapped codebook is defined as a linear combination of reference speaker's vectors using each fuzzy histogram as a weighting function with membership values. In adaptive-speech synthesis stage, input speech is fuzzy vector-quantized by the mapped codcbook, and then FCM arithmetic is used to synthesize speech adapted to input speaker. The speaker adaption experiments are carried out using speech of males in their thirties as input speaker's speech, and a female in her twenties as reference speaker's speech. Speeches used in experiments are sentences /anyoung hasim nika/ and /good morning/. As a results of experiments, we obtained a synthesized speech adapted to input speaker.
PDF

Fuzzy Neural Network Model Using Asymmetric Fuzzy Learning Rates (비대칭 퍼지 학습률을 이용한 퍼지 신경회로망 모델)

Kim Yong-Soo
- Journal of the Korean Institute of Intelligent Systems
- /
- v.15 no.7
- /
- pp.800-804
- /
- 2005
This paper presents a fuzzy learning rule which is the fuzzified version of LVQ(Learning Vector Quantization). This fuzzy learning rule 3 uses fuzzy learning rates. instead of the traditional learning rates. LVQ uses the same learning rate regardless of correctness of classification. But, the new fuzzy learning rule uses the different learning rates depending on whether classification is correct or not. The new fuzzy learning rule is integrated into the improved IAFC(Integrated Adaptive Fuzzy Clustering) neural network. The improved IAFC neural network is both stable and plastic. The iris data set is used to compare the performance of the supervised IAFC neural network 3 with the performance of backprogation neural network. The results show that the supervised IAFC neural network 3 is better than backpropagation neural network.
https://doi.org/10.5391/JKIIS.2005.15.7.800 인용 PDF KSCI

Speaker-Adaptive Speech Synthesis by Fuzzy Vector Quantization Mapping (FVQ(Fuzzy Vector Quantization) 사상화에 의한 화자적응 음성합성)

이진이;이광형
- Journal of the Korean Institute of Intelligent Systems
- /
- v.3 no.4
- /
- pp.3-20
- /
- 1993
본 연구에서는 퍼지사상화(fuzzy mapping)에 의한 사상된(mapped) 코드북을 사용하는 화자적은 음성합성 알고리즘을 제안한다. 입력화자와 기준화자의 코드북은 신경망 클러스터링 알고리즘인 자율경쟁 학습을 사용하여 작성된다. 사상된 코드북은 입력 음성벡터에 대한 두 화자의 대응 코드벡터의 소속갑(membership value)으로 퍼지 히스토그랩을 작성하여 이들을 1차 결합함으로써 얻어지는 퍼지사상화에 의하여 작성된다. 음성합성시에는 사상된 코드북을 사용하여 입력화자의 음것을 퍼지 벡터양자화한 다음, CFM 연산으로 합성함으로써 입력화자에 적응된 합성음을 얻는다. 실험에서 여러 입력화자로 30대의 남성, 20대의 여성음을 사용하였고 기준음석으로 입력음성과는 다른 20대의 여성음성을 사용하였다.실험에 사용된 음성데이타는 문장/안녕하십니까/와/굿모닝/이다. 실험결과는 각각의 입력화자에 기준화자 음성이 적응된 합성음을 얻었다.
PDF

Vector Quantization for Medical Image Compression Based on DCT and Fuzzy C-Means

Supot, Sookpotharom;Nopparat, Rantsaena;Surapan, Airphaiboon;Manas, Sangworasil
- Proceedings of the IEEK Conference
- /
- 2002.07a
- /
- pp.285-288
- /
- 2002
Compression of magnetic resonance images (MRI) has proved to be more difficult than other medical imaging modalities. In an average sized hospital, many tora bytes of digital imaging data (MRI) are generated every year, almost all of which has to be kept. The medical image compression is currently being performed by using different algorithms. In this paper, Fuzzy C-Means (FCM) algorithm is used for the Vector Quantization (VQ). First, a digital image is divided into subblocks of fixed size, which consists of 4${\times}$4 blocks of pixels. By performing 2-D Discrete Cosine Transform (DCT), we select six DCT coefficients to form the feature vector. And using FCM algorithm in constructing the VQ codebook. By doing so, the algorithm can make good time quality, and reduce the processing time while constructing the VQ codebook.
PDF

HMM-based Speech Recognition using FSVQ and Fuzzy Concept (FSVQ와 퍼지 개념을 이용한 HMM에 기초를 둔 음성 인식)

안태옥
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.40 no.6
- /
- pp.90-97
- /
- 2003
This paper proposes a speech recognition based on HMM(Hidden Markov Model) using FSVQ(First Section Vector Quantization) and fuzzy concept. In the proposed paper, we generate codebook of First Section, and then obtain multi-observation sequences by order of large propabilistic values based on fuzzy rule from the codebook of the first section. Thereafter, this observation sequences of first section from codebooks is trained and in case of recognition, a word that has the most highest probability of first section is selected as a recognized word by same concept. Train station names are selected as the target recognition vocabulary and LPC cepstrum coefficients are used as the feature parameters. Besides the speech recognition experiments of proposed method, we experiment the other methods under same conditions and data. Through the experiment results, it is proved that the proposed method based on HMM using FSVQ and fuzzy concept is superior to tile others in recognition rate.
PDF KSCI

Fuzzy Quantization and Rate Control for Very Low Bitrate Video Coder (초저전송율 동영상 부호기를 위한 퍼지 양자화 및 율 제어에 관한 연구)

양근호
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.7 no.8
- /
- pp.1684-1690
- /
- 2003
In this paper, we proposed a fuzzy controller for the evaluation of the quantization Parameters in the H.263 coder to optimize the subjective quality of each coded frame, keeping the transmission rate constant. We adopted the Mamdani method for fuzzification and the centroid method for defuzzification. The energy and entropy are correlated to features of the HVS in spatial domain, while motion vectors are used to estimate the temporal characteristics of the signal. And then, the fuzzy inputs adapted the variance and the entropy in spatial domain, and the motion vector in temporal domain. We induced the fuzzy membership function and decided the fuzzy relevance to be compatible in visual characteristics. And then, we designed FAM banks. The fuzzy technology has been applied to a practical video compression. This results is obtained an effective rate control technique, an optimum bit allocation and a high subjective quality using fuzzy quantization.
PDF KSCI

Pattern Classification Model using LVQ Optimized by Fuzzy Membership Function (퍼지 멤버쉽 함수로 최적화된 LVQ를 이용한 패턴 분류 모델)

Kim, Do-Tlyeon;Kang, Min-Kyeong;Cha, Eui-Young
- Journal of KIISE:Software and Applications
- /
- v.29 no.8
- /
- pp.573-583
- /
- 2002
Pattern recognition process is made up of the feature extraction in the pre-processing, the pattern clustering by training and the recognition process. This paper presents the F-LVQ (Fuzzy Learning Vector Quantization) pattern classification model which is optimized by the fuzzy membership function for the OCR(Optical Character Recognition) system. We trained 220 numeric patterns of 22 Hangul and English fonts and tested 4840 patterns whose forms are changed variously. As a result of this experiment, it is proved that the proposed model is more effective and robust than other typical LVQ models.
PDF KSCI

Speech Recognition Based on VQ/NN using Fuzzy (Fuzzy를 이용한 VQ/NN에 기초를 둔 음성 인식)

Ann, Tae-Ock
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.6
- /
- pp.5-11
- /
- 1996
This paper is the study for recognizing single vowels of speaker-independent, and we suppose a method of speech recognition using VQ(Vector Quantization)/NN(Neural Network). This method makes a VQ codebook, which is used for obtaining the observation sequence, and then claculates the probability value by comparing each codeword with the data, finally uses these probability values for the input value of the neural network. Korean signle vowels are selected for our recognition experiment, and ten male speakers pronounced eight single vowels ten times. We compare the performance of our method with those of fuzzy VQ/HMM and conventional VQ/NN According to the experiment result, the recognition rate by VQ/NN is 92.3%, by VQ/HMM using fuzzy is 93.8% and by VQ/NN using fuzzy is 95.7%. Therefore, it is shown that recognition rate of speech recognition by fuzzy VQ/NN is better than those of fuzzy VQ/HMM and conventional VQ/HMM because of its excellent learning ability.
PDF

Robust Control of Variable Hydraulic System using Multiple Fuzzy Rules (다수의 퍼지규칙을 이용한 가변유압시스템의 강건제어)

양경춘;안경관;이수한
- 제어로봇시스템학회:학술대회논문집
- /
- 2000.10a
- /
- pp.134-134
- /
- 2000
A switching control using multiple gains in the fuzzy rule is newly proposed for an abruptly changing hydraulic servo system. The proposed scheme employs fuzzy PID control, where modified input parameters are used, and LVQNN(Learning Vector Quantization Neural Network) as a switching controller (supervisor). Simulation and experimental studies have been carried out to validate and illustrate the proposed controller.
PDF

Fuzzy Learning Rule Using the Distance between Datum and the Centroids of Clusters (데이터와 클러스터들의 대표값들 사이의 거리를 이용한 퍼지학습법칙)

Kim, Yong-Soo
- Journal of the Korean Institute of Intelligent Systems
- /
- v.17 no.4
- /
- pp.472-476
- /
- 2007
Learning rule affects importantly the performance of neural network. This paper proposes a new fuzzy learning rule that uses the learning rate considering the distance between the input vector and the prototypes of classes. When the learning rule updates the prototypes of classes, this consideration reduces the effect of outlier on the prototypes of classes. This comes from making the effect of the input vector, which locates near the decision boundary, larger than an outlier. Therefore, it can prevents an outlier from deteriorating the decision boundary. This new fuzzy learning rule is integrated into IAFC(Integrated Adaptive Fuzzy Clustering) fuzzy neural network. Iris data set is used to compare the performance of the proposed fuzzy neural network with those of other supervised neural networks. The results show that the proposed fuzzy neural network is better than other supervised neural networks.
https://doi.org/10.5391/JKIIS.2007.17.4.472 인용 PDF KSCI

Search Result 47, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)