Search | Korea Science

Speaker Recognition using LPC cepstrum Coefficients and Neural Network (LPC 켑스트럼 계수와 신경회로망을 사용한 화자인식)

Choi, Jae-Seung
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.15 no.12
- /
- pp.2521-2526
- /
- 2011
This paper proposes a speaker recognition algorithm using a perceptron neural network and LPC (Linear Predictive Coding) cepstrum coefficients. The proposed algorithm first detects the voiced sections at each frame. Then, the LPC cepstrum coefficients which have speaker characteristics are obtained by the linear predictive analysis for the detected voiced sections. To classify the obtained LPC cepstrum coefficients, a neural network is trained using the LPC cepstrum coefficients. In this experiment, the performance of the proposed algorithm was evaluated using the speech recognition rates based on the LPC cepstrum coefficients and the neural network.
https://doi.org/10.6109/jkiice.2011.15.12.2521 인용 PDF KSCI

Comparison of Characteristic Vector of Speech for Gender Recognition of Male and Female (남녀 성별인식을 위한 음성 특징벡터의 비교)

Jeong, Byeong-Goo;Choi, Jae-Seung
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.16 no.7
- /
- pp.1370-1376
- /
- 2012
This paper proposes a gender recognition algorithm which classifies a male or female speaker. In this paper, characteristic vectors for the male and female speaker are analyzed, and recognition experiments for the proposed gender recognition by a neural network are performed using these characteristic vectors for the male and female. Input characteristic vectors of the proposed neural network are 10 LPC (Linear Predictive Coding) cepstrum coefficients, 12 LPC cepstrum coefficients, 12 FFT (Fast Fourier Transform) cepstrum coefficients and 1 RMS (Root Mean Square), and 12 LPC cepstrum coefficients and 8 FFT spectrum. The proposed neural network trained by 20-20-2 network are especially used in this experiment, using 12 LPC cepstrum coefficients and 8 FFT spectrum. From the experiment results, the average recognition rates obtained by the gender recognition algorithm is 99.8% for the male speaker and 96.5% for the female speaker.
https://doi.org/10.6109/jkiice.2012.16.7.1370 인용 PDF KSCI

Implementation of a CELP coder based on optimum quantization of the LPC coefficients (LPC 계수의 최적 양자화에 기초한 음성 코더 구현)

Lee, W.J.;Park, J.T.;Chang, T.G.
- Proceedings of the KIEE Conference
- /
- 2001.07d
- /
- pp.2516-2518
- /
- 2001
The quantization of the LPC parameters is a very important aspect of the speech compression algorithm. This paper analyzes the quantization effect of the LPC coefficients and presents the implementation of a fixed-point CELP coder based on the LPC analysis.
PDF

Snorer-Dependent Snore Recognition Using LPC Cepstral Coefficients (LPC 켑스트럼 계수를 이용한 특정인의 코골이 인식)

최호선;장원규;이경중
- The Transactions of the Korean Institute of Electrical Engineers D
- /
- v.52 no.9
- /
- pp.554-559
- /
- 2003
In this paper the possibility of snorer-dependent snore recognition using cepstral coefficients was suggested. We assumed that snore and speech sounds have some similarities and we used cepstral coefficients which are widely used for speech recognition. Snoring data were acquired from 18 persons including 5 patients diagnosed as snore patient. To evaluate the performance of proposed method, the distance ratio based on LPC cepstral coefficients was selected as an index for snorer-dependent snore recognition. As a result, distance ratio of 3 was selected as optimal value showing the most efficient snorer-dependent snore recognition, which is high accuracy of 95.05％ on average. In conclusion, the proposed method showed the possibilities to be applied in clinical applications for snorer-dependent snore recognition.
PDF KSCI

A Study on Function Recognition of EMG Signal Using LPC Cepstrum Coefficients (LPC 켑스트럼 계수를 이용한 EMG 신호의 기능 인식에 관한 연구)

Wang, Sung-Moon;Chung, Tae-Yun;Choi, Yun-Ho;Byun, Youn-Shik;Park, Sang-Hui
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.27 no.2
- /
- pp.126-134
- /
- 1990
In this study, eight function discrimination and recognition of the EMG signal from the biceps and triceps of 4 subjects were executed, using the Euclidean and weighted cepstral distance measure with LPC cepstrum coefficients. In case of Euclidean cepstral distance measure, as the number of LPC cepstrum coefficients was increased in 8, 10, 12, 14 the recognition rates of functions are 94.69, 95.63, 96.56, and 96.88[%], respectively, but increasing rates of recognition were inclined to decrease. In case of weighted cepstral distance measure, when the number of LPC cepstrum coefficients was 8, 10, 12 and 14, the recognition rates of functions were 91.88, 95, 99.69, and 96.63[%], respectively.
PDF

Quantization of LPC Coefficients Using a Multi-frame AR-model (Multi-frame AR model을 이용한 LPC 계수 양자화)

Jung, Won-Jin;Kim, Moo-Young
- The Journal of the Acoustical Society of Korea
- /
- v.31 no.2
- /
- pp.93-99
- /
- 2012
For speech coding, a vocal tract is modeled using Linear Predictive Coding (LPC) coefficients. The LPC coefficients are typically transformed to Line Spectral Frequency (LSF) parameters which are advantageous for linear interpolation and quantization. If multidimensional LSF data are quantized directly using Vector-Quantization (VQ), high rate-distortion performance can be obtained by fully utilizing intra-frame correlation. In practice, since this direct VQ system cannot be used due to high computational complexity and memory requirement, Split VQ (SVQ) is used where a multidimensional vector is split into multilple sub-vectors for quantization. The LSF parameters also have high inter-frame correlation, and thus Predictive SVQ (PSVQ) is utilized. PSVQ provides better rate-distortion performance than SVQ. In this paper, to implement the optimal predictors in PSVQ for voice storage devices, we propose Multi-Frame AR-model based SVQ (MF-AR-SVQ) that considers the inter-frame correlations with multiple previous frames. Compared with conventional PSVQ, the proposed MF-AR-SVQ provides 1 bit gain in terms of spectral distortion without significant increase in complexity and memory requirement.
https://doi.org/10.7776/ASK.2012.31.2.093 인용 PDF KSCI

A Study on the Algorithm Development for Speech Recognition of Korean and Japanese (한국어와 일본어의 음성 인식을 위한 알고리즘 개발에 관한 연구)

Lee, Sung-Hwa;Kim, Hyung-Lae
- Journal of IKEEE
- /
- v.2 no.1 s.2
- /
- pp.61-67
- /
- 1998
In this thesis, experiment have performed with the speaker recognition using multilayer feedforward neural network(MFNN) model using Korean and Japanese digits . The 5 adult males and 5 adult females pronounciate form 0 to 9 digits of Korean, Japanese 7 times. And then, they are extracted characteristics coefficient through Pitch deletion algorithm, LPC analysis, and LPC Cepstral analysis to generate input pattern of MFNN. 5 times among them are used to train a neural network, and 2 times is used to measure the performance of neural network. Both Korean and Japanese, Pitch coefficients is about 4%t more enhanced than LPC or LPC Cepstral coefficients.
PDF

A Study on Speech Recognition using Vocal Tract Area Function (성도 면적 함수를 이용한 음성 인식에 관한 연구)

송제혁;김동준
- Journal of Biomedical Engineering Research
- /
- v.16 no.3
- /
- pp.345-352
- /
- 1995
The LPC cepstrum coefficients, which are an acoustic features of speech signal, have been widely used as the feature parameter for various speech recognition systems and showed good performance. The vocal tract area function is a kind of articulatory feature, which is related with the physiological mechanism of speech production. This paper proposes the vocal tract area function as an alternative feature parameter for speech recognition. The linear predictive analysis using Burg algorithm and the vector quantization are performed. Then, recognition experiments for 5 Korean vowels and 10 digits are executed using the conventional LPC cepstrum coefficients and the vocal tract area function. The recognitions using the area function showed the slightly better results than those using the conventional LPC cepstrum coefficients.
PDF

EMG signal identification using LPC cepstrum coefficients (LPC cepstrum 계수를 이용한 근전도 신호의 동작판별)

Chung, T.Y.;Park, S.H.;Kim, H.R.;Wang, M.S.;Choi, Y.H.;Byun, Y.S.
- Proceedings of the KIEE Conference
- /
- 1988.07a
- /
- pp.738-741
- /
- 1988
In this paper, we deal with the movements identification of EMG signals by LPC cepstrum coefficients. Movements were identified by extration of characteristics of similar patterns in Euclid distance measurement method for EMG signals generated by voluntary contractions of subject's musculature. As number of coefficients is larger, we obtain the better rate of movements identification. By exact extraction of signals and decision of optimal coefficient, it is expected that these results will apply to prosthesis control in real-time.
PDF

Speech Recognition with Image Information (영상정보 보완에 의한 음성인식)

이천우;이상원;양근모;박인정
- Proceedings of the IEEK Conference
- /
- 1999.06a
- /
- pp.511-515
- /
- 1999
The main factor decreasing speech recognition rate is the surrounding noise. To lower the noise effect, we generally used the filter bank at preprocessing stage. But, in this paper, we tried to recognize the 10 numeral numbers using 2-D LPC to extract image feature. At first, we obtained the result of speech-only recognition using 13th-order LPC coefficients and then, for distorted speech recognition results of ‘0’, ‘4’, ‘5’, ‘6’ and 9’, we added image parameters such as 12th-order 2-D LPC coefficients. At each frame, we extracted the 2-D LPC coefficients, and simulated recognizer with two parameters such as speech and image. Finally, for the numbers, such as ‘4’and ‘9’, the better results were obtained.
PDF

Search Result 79, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)