Search | Korea Science

Feature Selection for Multi-Class Genre Classification using Gaussian Mixture Model (Gaussian Mixture Model을 이용한 다중 범주 분류를 위한 특징벡터 선택 알고리즘)

Moon, Sun-Kuk;Choi, Tack-Sung;Park, Young-Cheol;Youn, Dae-Hee
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.32 no.10C
- /
- pp.965-974
- /
- 2007
In this paper, we proposed the feature selection algorithm for multi-class genre classification. In our proposed algorithm, we developed GMM separation score based on Gaussian mixture model for measuring separability between two genres. Additionally, we improved feature subset selection algorithm based on sequential forward selection for multi-class genre classification. Instead of setting criterion as entire genre separability measures, we set criterion as worst genre separability measure for each sequential selection step. In order to assess the performance proposed algorithm, we extracted various features which represent characteristics such as timbre, rhythm, pitch and so on. Then, we investigate classification performance by GMM classifier and k-NN classifier for selected features using conventional algorithm and proposed algorithm. Proposed algorithm showed improved performance in classification accuracy up to 10 percent for classification experiments of low dimension feature vector especially.
PDF KSCI

IMAGE DENOISING BASED ON MIXTURE DISTRIBUTIONS IN WAVELET DOMAIN

Bae, Byoung-Suk;Lee, Jong-In;Kang, Moon-Gi
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2009.01a
- /
- pp.246-249
- /
- 2009
Due to the additive white Gaussian noise (AWGN), images are often corrupted. In recent days, Bayesian estimation techniques to recover noisy images in the wavelet domain have been studied. The probability density function (PDF) of an image in wavelet domain can be described using highly-sharp head and long-tailed shapes. If a priori probability density function having the above properties would be applied well adaptively, better results could be obtained. There were some frequently proposed PDFs such as Gaussian, Laplace distributions, and so on. These functions model the wavelet coefficients satisfactorily and have its own of characteristics. In this paper, mixture distributions of Gaussian and Laplace distribution are proposed, which attempt to corporate these distributions' merits. Such mixture model will be used to remove the noise in images by adopting Maximum a Posteriori (MAP) estimation method. With respect to visual quality, numerical performance and computational complexity, the proposed technique gained better results.
PDF

A Hardware Implementation of Moving Object Detection Algorithm using Gaussian Mixture Model (가우시안 혼합 모델을 이용한 이동 객체 검출 알고리듬의 하드웨어 구현)

Kim, Gyeong-hun;An, Hyo-Sik;Shin, Kyung-wook
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2015.05a
- /
- pp.407-409
- /
- 2015
In this paper, a hardware implementation of MOD(Moving Object Detection) algorithm is described, which is based GMM(Gaussian Mixture Model) and background subtraction. The EGML(Effective Gaussian Mixture Learning) is used to model and update background. Some approximations of EGML calculations are applied to reduce hardware complexity, and pipelining technique is used to improve operating speed. Gaussian parameters are adjustable according to various environment conditions to achieve better MOD performance. MOD processor is verified by using FPGA-in-the-loop verification, and it can operate with 109 MHz clock frequency on XC5VSX95T FPGA device.
PDF

Gaussian Mixture based K2 Rifle Chamber Pressure Modeling of M193 and K100 Bullets (가우시안 혼합모델 기반 탄종별 K2 소화기의 약실압력 모델링)

Kim, Jong-Hwan;Lee, Byounghwak;Kim, Kyoungmin;Shin, Kyuyong;Lee, Wonwoo
- Journal of the Korea Institute of Military Science and Technology
- /
- v.22 no.1
- /
- pp.27-34
- /
- 2019
This paper presents a chamber pressure model development of K2 rifle by applying Gaussian mixture model. In order to materialize a real recoil force of a virtual reality shooting rifle in military combat training, the chamber pressure which is one of major components of the recoil force needs to be investigated and modeled. Over 200,000 data of the chamber pressure were collected by implementing live fire experiments with both K100 and M193 of 5.56 mm bullets. Gaussian mixture method was also applied to create a mathematical model that satisfies nonlinear, asymmetry, and deviations of the chamber pressure which is caused by irregular characteristics of propellant combustion. In addition, Polynomial and Fourier Regression were used for comparison of results, and the sum of squared errors, the coefficient of determination and root-mean-square errors were analyzed for performance measurement.
https://doi.org/10.9766/KIMST.2019.22.1.027 인용 PDF KSCI HTML

Optimization of Gaussian Mixture in CDHMM Training for Improved Speech Recognition

Lee, Seo-Gu;Kim, Sung-Gil;Kang, Sun-Mee;Ko, Han-Seok
- Speech Sciences
- /
- v.5 no.1
- /
- pp.7-21
- /
- 1999
This paper proposes an improved training procedure in speech recognition based on the continuous density of the Hidden Markov Model (CDHMM). Of the three parameters (initial state distribution probability, state transition probability, output probability density function (p.d.f.) of state) governing the CDHMM model, we focus on the third parameter and propose an efficient algorithm that determines the p.d.f. of each state. It is known that the resulting CDHMM model converges to a local maximum point of parameter estimation via the iterative Expectation Maximization procedure. Specifically, we propose two independent algorithms that can be embedded in the segmental K -means training procedure by replacing relevant key steps; the adaptation of the number of mixture Gaussian p.d.f. and the initialization using the CDHMM parameters previously estimated. The proposed adaptation algorithm searches for the optimal number of mixture Gaussian humps to ensure that the p.d.f. is consistently re-estimated, enabling the model to converge toward the global maximum point. By applying an appropriate threshold value, which measures the amount of collective changes of weighted variances, the optimized number of mixture Gaussian branch is determined. The initialization algorithm essentially exploits the CDHMM parameters previously estimated and uses them as the basis for the current initial segmentation subroutine. It captures the trend of previous training history whereas the uniform segmentation decimates it. The recognition performance of the proposed adaptation procedures along with the suggested initialization is verified to be always better than that of existing training procedure using fixed number of mixture Gaussian p.d.f.
PDF

GMM based Speaker Identification using Pitch Information (피치 정보를 이용한 GMM 기반의 화자 식별)

Park Taesun;Hahn Minsoo
- MALSORI
- /
- no.47
- /
- pp.121-129
- /
- 2003
This paper describes the use of pitch information for speaker identification. The recognition system is a GMM based one with 4 connected Korean digits speech database. The mean of the pitch period in voiced sections of speech are shown to be ,useful at discriminating between speakers. Utilizing this feature with Gaussian mixture model in the speaker identification system gave a marked improvement, maximum 6% improvement comparing to the baseline Gaussian mixture model.
PDF

Background Subtraction based on GMM for Night-time Video Surveillance (야간 영상 감시를 위한 GMM기반의 배경 차분)

Yeo, Jung Yeon;Lee, Guee Sang
- Smart Media Journal
- /
- v.4 no.3
- /
- pp.50-55
- /
- 2015
In this paper, we present background modeling method based on Gaussian mixture model to subtract background for night-time video surveillance. In night-time video, it is hard work to distinguish the object from the background because a background pixel is similar to a object pixel. To solve this problem, we change the pixel of input frame to more advantageous value to make the Gaussian mixture model using scaled histogram stretching in preprocessing step. Using scaled pixel value of input frame, we then exploit GMM to find the ideal background pixelwisely. In case that the pixel of next frame is not included in any Gaussian, the matching test in old GMM method ignores the information of stored background by eliminating the Gaussian distribution with low weight. Therefore we consider the stacked data by applying the difference between the old mean and new pixel intensity to new mean instead of removing the Gaussian with low weight. Some experiments demonstrate that the proposed background modeling method shows the superiority of our algorithm effectively.
PDF KSCI

Text Segmentation from Images with Various Light Conditions Based on Gaussian Mixture Model

Tran, Khoa Anh;Lee, Gueesang
- International Journal of Contents
- /
- v.9 no.1
- /
- pp.1-5
- /
- 2013
Standard Gaussian Mixture Model (GMM) is a well-known method for image segmentation. However, one of its problems is that we consider the pixel as independent to each other, which can cause the segmentation results sensitive to noise. It explains why some of existing algorithms still cannot segment texts from the background clearly. Therefore, we present a new method in which we incorporate the spatial relationship between a pixel and its neighbors inside $3{\times}3$ windows to segment the text. Our approach works well with images containing texts, which has different sizes, shapes or colors in case of light changes or complex background. Experimental results demonstrate the robustness, accuracy and effectiveness of the proposed model in image segmentation compared to other methods.
https://doi.org/10.5392/IJoC.2013.9.1.001 인용 PDF KSCI

Emotion Recognition Algorithm Based on Minimum Classification Error incorporating Multi-modal System (최소 분류 오차 기법과 멀티 모달 시스템을 이용한 감정 인식 알고리즘)

Lee, Kye-Hwan;Chang, Joon-Hyuk
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.46 no.4
- /
- pp.76-81
- /
- 2009
We propose an effective emotion recognition algorithm based on the minimum classification error (MCE) incorporating multi-modal system The emotion recognition is performed based on a Gaussian mixture model (GMM) based on MCE method employing on log-likelihood. In particular, the reposed technique is based on the fusion of feature vectors based on voice signal and galvanic skin response (GSR) from the body sensor. The experimental results indicate that performance of the proposal approach based on MCE incorporating the multi-modal system outperforms the conventional approach.
PDF KSCI

Performance of GMM and ANN as a Classifier for Pathological Voice

Wang, Jianglin;Jo, Cheol-Woo
- Speech Sciences
- /
- v.14 no.1
- /
- pp.151-162
- /
- 2007
This study focuses on the classification of pathological voice using GMM (Gaussian Mixture Model) and compares the results to the previous work which was done by ANN (Artificial Neural Network). Speech data from normal people and patients were collected, then diagnosed and classified into two different categories. Six characteristic parameters (Jitter, Shimmer, NHR, SPI, APQ and RAP) were chosen. Then the classification method based on the artificial neural network and Gaussian mixture method was employed to discriminate the data into normal and pathological speech. The GMM method attained 98.4% average correct classification rate with training data and 95.2% average correct classification rate with test data. The different mixture number (3 to 15) of GMM was used in order to obtain an optimal condition for classification. We also compared the average classification rate based on GMM, ANN and HMM. The proper number of mixtures on Gaussian model needs to be investigated in our future work.
PDF

Search Result 271, Processing Time 0.031 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)