Search | Korea Science

Speech Emotion Recognition Based on GMM Using FFT and MFB Spectral Entropy (FFT와 MFB Spectral Entropy를 이용한 GMM 기반의 감정인식)

Lee, Woo-Seok;Roh, Yong-Wan;Hong, Hwang-Seok
- Proceedings of the KIEE Conference
- /
- 2008.04a
- /
- pp.99-100
- /
- 2008
This paper proposes a Gaussian Mixture Model (GMM) - based speech emotion recognition methods using four feature parameters; 1) Fast Fourier Transform(FFT) spectral entropy, 2) delta FFT spectral entropy, 3) Mel-frequency Filter Bank (MFB) spectral entropy, and 4) delta MFB spectral entropy. In addition, we use four emotions in a speech database including anger, sadness, happiness, and neutrality. We perform speech emotion recognition experiments using each pre-defined emotion and gender. The experimental results show that the proposed emotion recognition using FFT spectral-based entropy and MFB spectral-based entropy performs better than existing emotion recognition based on GMM using energy, Zero Crossing Rate (ZCR), Linear Prediction Coefficient (LPC), and pitch parameters. In experimental Results, we attained a maximum recognition rate of 75.1% when we used MFB spectral entropy and delta MFB spectral entropy.
PDF

CHAIN TRANSITIVE SETS AND DOMINATED SPLITTING FOR GENERIC DIFFEOMORPHISMS

Lee, Manseob
- Journal of the Chungcheong Mathematical Society
- /
- v.30 no.2
- /
- pp.177-181
- /
- 2017
Let $f:M{\rightarrow}M$ be a diffeomorphism of a compact smooth manifold M. In this paper, we show that $C^1$ generically, if a chain transitive set ${\Lambda}$ is locally maximal then it admits a dominated splitting. Moreover, $C^1$ generically if a chain transitive set ${\Lambda}$ of f is locally maximal then it has zero entropy.
https://doi.org/10.14403/jcms.2017.30.2.177 인용 PDF KSCI

COMPACT MANIFOLDS WITH THE MINIMAL ENTROPY

Yim, Jin-Whan
- Communications of the Korean Mathematical Society
- /
- v.10 no.2
- /
- pp.365-374
- /
- 1995
On a compact manifold without conjugate points, the volume entropy can be obtained as the average mean curvature of the horospheres in the universal covering space. In the case when the volume entropy is zero, we prove that the universal covering space is diffeomorphic to a product space with a line factor. This fact can be considered as a surporting evidence for the Mane's conjecture, which claims the flatness of the mainfold.
PDF

ALGEBRAIC ENTROPIES OF NATURAL NUMBERS WITH ONE OR TWO PRIME FACTORS

JEONG, SEUNGPIL;KIM, KYONG HOON;KIM, GWANGIL
- The Pure and Applied Mathematics
- /
- v.23 no.3
- /
- pp.205-221
- /
- 2016
We formulate the additive entropy of a natural number in terms of the additive partition function, and show that its multiplicative entropy is directly related to the multiplicative partition function. We give a practical formula for the multiplicative entropy of natural numbers with two prime factors. We use this formula to analyze the comparative density of additive and multiplicative entropy, prove that this density converges to zero as the number tends to infinity, and empirically observe this asymptotic behavior.
https://doi.org/10.7468/jksmeb.2016.23.3.205 인용 PDF KSCI KPUBS HTML

Voice Activity Detection Based on Entropy in Noisy Car Environment (차량 잡음 환경에서 엔트로피 기반의 음성 구간 검출)

Roh, Yong-Wan;Lee, Kue-Bum;Lee, Woo-Seok;Hong, Kwang-Seok
- Journal of the Institute of Convergence Signal Processing
- /
- v.9 no.2
- /
- pp.121-128
- /
- 2008
Accurate voice activity detection have a great impact on performance of speech applications including speech recognition, speech coding, and speech communication. In this paper, we propose methods for voice activity detection that can adapt to various car noise situations during driving. Existing voice activity detection used various method such as time energy, frequency energy, zero crossing rate, and spectral entropy that have a weak point of rapid. decline performance in noisy environments. In this paper, the approach is based on existing spectral entropy for VAD that we propose voice activity detection method using MFB(Met-frequency filter banks) spectral entropy, gradient FFT(Fast Fourier Transform) spectral entropy. and gradient MFB spectral entropy. FFT multiplied by Mel-scale is MFB and Mel-scale is non linear scale when human sound perception reflects characteristic of speech. Proposed MFB spectral entropy method clearly improve the ability to discriminate between speech and non-speech for various in noisy car environments that achieves 93.21% accuracy as a result of experiments. Compared to the spectral entropy method, the proposed voice activity detection gives an average improvement in the correct detection rate of more than 3.2%.
PDF

Efficient Adaptive Algorithms Based on Zero-Error Probability Maximization (영확률 최대화에 근거한 효율적인 적응 알고리듬)

Kim, Namyong
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.39A no.5
- /
- pp.237-243
- /
- 2014
In this paper, a calculation-efficient method for weight update in the algorithm based on maximization of the zero-error probability (MZEP) is proposed. This method is to utilize the current slope value in calculation of the next slope value, replacing the block processing that requires a summation operation in a sample time period. The simulation results shows that the proposed method yields the same performance as the original MZEP algorithm while significantly reducing the computational time and complexity with no need for a buffer for error samples. Also the proposed algorithm produces faster convergence speed than the algorithm that is based on the error-entropy minimization.
https://doi.org/10.7840/kics.2014.39A.5.237 인용 PDF KSCI

High Efficient Entropy Coding For Edge Image Compression

Han, Jong-Woo;Kim, Do-Hyun;Kim, Yoon
- Journal of the Korea Society of Computer and Information
- /
- v.21 no.5
- /
- pp.31-40
- /
- 2016
In this paper, we analyse the characteristics of the edge image and propose a new entropy coding optimized to the compression of the edge image. The pixel values of the edge image have the Gaussian distribution around '0', and most of the pixel values are '0'. By using this analysis, the Zero Block technique is utilized in spatial domain. And the Intra Prediction Mode of the edge image is similar to the mode of the surrounding blocks or likely to be the Planar Mode or the Horizontal Mode. In this paper, we make use of the MPM technique that produces the Intra Prediction Mode with high probability modes. By utilizing the above properties, we design a new entropy coding method that is suitable for edge image and perform the compression. In case the existing compression techniques are applied to edge image, compression ratio is low and the algorithm is complicated as more than necessity and the running time is very long, because those techniques are based on the natural images. However, the compression ratio and the running time of the proposed technique is high and very short, respectively, because the proposed algorithm is optimized to the compression of the edge image. Experimental results indicate that the proposed algorithm provides better visual and PSNR performance up to 11 times than the JPEG.
https://doi.org/10.9708/jksci.2016.21.5.031 인용 PDF KSCI

Maximization of Zero-Error Probability for Adaptive Channel Equalization

Kim, Nam-Yong;Jeong, Kyu-Hwa;Yang, Liuqing
- Journal of Communications and Networks
- /
- v.12 no.5
- /
- pp.459-465
- /
- 2010
A new blind equalization algorithm that is based on maximizing the probability that the constant modulus errors concentrate near zero is proposed. The cost function of the proposed algorithm is to maximize the probability that the equalizer output power is equal to the constant modulus of the transmitted symbols. Two blind information-theoretic learning (ITL) algorithms based on constant modulus error signals are also introduced: One for minimizing the Euclidean probability density function distance and the other for minimizing the constant modulus error entropy. The relations between the algorithms and their characteristics are investigated, and their performance is compared and analyzed through simulations in multi-path channel environments. The proposed algorithm has a lower computational complexity and a faster convergence speed than the other ITL algorithms that are based on a constant modulus error. The error samples of the proposed blind algorithm exhibit more concentrated density functions and superior error rate performance in severe multi-path channel environments when compared with the other algorithms.
PDF KSCI

A study on monitoring of fatigue using the $2^{nd}$ order maximum entropy method ($2^{nd}$ order maximum entropy method를 이용한 근피로도의 측정에 관한 연구)

Cho, S.J.;Kim, M.S.;Lee, K.W.;Kim, K.G.;Kim, S.L.;Park, H.S.;Lee, K.M.
- Proceedings of the KOSOMBE Conference
- /
- v.1990 no.05
- /
- pp.47-50
- /
- 1990
In this study, the degree of spectral transfer to lower frequency caused by accumulation of Latic acid inside the muscle is estimated the convintional dip analysis, zero-crossing method and FFT method have intrinsic errors and estimation problems in case of severe noise. The new spectral analysis method using "$2^{nd}$ order Maximum Entropy Method" was applied to estimate mean frequency and we confirmed that this new method yields fast and reliable estimation over the FFT method.
PDF

Performance Comparison of Feature Parameters and Classifiers for Speech/Music Discrimination (음성/음악 판별을 위한 특징 파라미터와 분류기의 성능비교)

Kim Hyung Soon;Kim Su Mi
- MALSORI
- /
- no.46
- /
- pp.37-50
- /
- 2003
In this paper, we evaluate and compare the performance of speech/music discrimination based on various feature parameters and classifiers. As for feature parameters, we consider High Zero Crossing Rate Ratio (HZCRR), Low Short Time Energy Ratio (LSTER), Spectral Flux (SF), Line Spectral Pair (LSP) distance, entropy and dynamism. We also examine three classifiers: k Nearest Neighbor (k-NN), Gaussian Mixure Model (GMM), and Hidden Markov Model (HMM). According to our experiments, LSP distance and phoneme-recognizer-based feature set (entropy and dunamism) show good performance, while performance differences due to different classifiers are not significant. When all the six feature parameters are employed, average speech/music discrimination accuracy up to 96.6% is achieved.
PDF

Search Result 31, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)