• Title/Summary/Keyword: GMM모델

Search Result 131, Processing Time 0.021 seconds

다양한 특징 파라미터와 선형변별분석을 이용한 후두암의 선별검사

  • 이원범;왕수건;권순복;전경명;전계록;김수미;김형순;양병곤;조철우
    • Proceedings of the KSLP Conference
    • /
    • 2003.11a
    • /
    • pp.149-149
    • /
    • 2003
  • 후두질환 감별용 음성 분석방법인 multi-dimensional voice program (MDVP)으로 분석이 불가능할 정도로 주기성이 크게 훼손된 후두암 말기의 음성 에 대하여 효과적인 감별을 하기 위하여, 몇 가지 켑스트럼(cepstrum) 파라미터를 비롯하여, 주기성 및 그 동요 정도, 영교차율(zero-crossing rate, ZCR), 스텍트럼 중심 (spectral centroid, SC) 등 다양한 특징 파라미터를 이용한 감별 실험을 수행하였다. 후두암 감별 실험을 위해 부산대학교 병원 이비인후과에서 수집한 정상 남자 음성 데이터 50개, 양성 후두질환 남자 음성 데이터 50개 및 남성 후두암 환자 음성 데이터 105개를 사용하였다. 음성 데이터는 단모음 /아/ 발성만을 사용하였고, 정상인과 양성후두질환 환자, 그리고 MDVP 분석이 가능한 후두암 환자 음성 데이터 중 2/3는 학습에, 나머지 113은 감별실험에 사용하였다. 후두암 감별을 위한 분류기로는 Gaussian Mixture Model(GMM) 분류기를 사용하였으며, 이때 모델의 복잡도를 표현하는 mixture 수는 1에서 10까지 가변시키면서 가장 좋은 성능을 나타내는 값으로 결정하였다. 또한 모든 실험에서 켑스트럼 분석의 차수는 동일하게 12차로 고정시켰다. (중략)

  • PDF

Machine Learning Model for Low Frequency Noise and Bias Temperature Instability (저주파 노이즈와 BTI의 머신 러닝 모델)

  • Kim, Yongwoo;Lee, Jonghwan
    • Journal of the Semiconductor & Display Technology
    • /
    • v.19 no.4
    • /
    • pp.88-93
    • /
    • 2020
  • Based on the capture-emission energy (CEE) maps of CMOS devices, a physics-informed machine learning model for the bias temperature instability (BTI)-induced threshold voltage shifts and low frequency noise is presented. In order to incorporate physics theories into the machine learning model, the integration of artificial neural network (IANN) is employed for the computation of the threshold voltage shifts and low frequency noise. The model combines the computational efficiency of IANN with the optimal estimation of Gaussian mixture model (GMM) with soft clustering. It enables full lifetime prediction of BTI under various stress and recovery conditions and provides accurate prediction of the dynamic behavior of the original measured data.

Accurate Parked Vehicle Detection using GMM-based 3D Vehicle Model in Complex Urban Environments (가우시안 혼합모델 기반 3차원 차량 모델을 이용한 복잡한 도시환경에서의 정확한 주차 차량 검출 방법)

  • Cho, Younggun;Roh, Hyun Chul;Chung, Myung Jin
    • The Journal of Korea Robotics Society
    • /
    • v.10 no.1
    • /
    • pp.33-41
    • /
    • 2015
  • Recent developments in robotics and intelligent vehicle area, bring interests of people in an autonomous driving ability and advanced driving assistance system. Especially fully automatic parking ability is one of the key issues of intelligent vehicles, and accurate parked vehicles detection is essential for this issue. In previous researches, many types of sensors are used for detecting vehicles, 2D LiDAR is popular since it offers accurate range information without preprocessing. The L shape feature is most popular 2D feature for vehicle detection, however it has an ambiguity on different objects such as building, bushes and this occurs misdetection problem. Therefore we propose the accurate vehicle detection method by using a 3D complete vehicle model in 3D point clouds acquired from front inclined 2D LiDAR. The proposed method is decomposed into two steps: vehicle candidate extraction, vehicle detection. By combination of L shape feature and point clouds segmentation, we extract the objects which are highly related to vehicles and apply 3D model to detect vehicles accurately. The method guarantees high detection performance and gives plentiful information for autonomous parking. To evaluate the method, we use various parking situation in complex urban scene data. Experimental results shows the qualitative and quantitative performance efficiently.

A Training Method for Emotion Recognition using Emotional Adaptation (감정 적응을 이용한 감정 인식 학습 방법)

  • Kim, Weon-Goo
    • Journal of IKEEE
    • /
    • v.24 no.4
    • /
    • pp.998-1003
    • /
    • 2020
  • In this paper, an emotion training method using emotional adaptation is proposed to improve the performance of the existing emotion recognition system. For emotion adaptation, an emotion speech model was created from a speech model without emotion using a small number of training emotion voices and emotion adaptation methods. This method showed superior performance even when using a smaller number of emotional voices than the existing method. Since it is not easy to obtain enough emotional voices for training, it is very practical to use a small number of emotional voices in real situations. In the experimental results using a Korean database containing four emotions, the proposed method using emotional adaptation showed better performance than the existing method.

A PCA-based MFDWC Feature Parameter for Speaker Verification System (화자 검증 시스템을 위한 PCA 기반 MFDWC 특징 파라미터)

  • Hahm Seong-Jun;Jung Ho-Youl;Chung Hyun-Yeol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.1
    • /
    • pp.36-42
    • /
    • 2006
  • A Principal component analysis (PCA)-based Mel-Frequency Discrete Wavelet Coefficients (MFDWC) feature Parameters for speaker verification system is Presented in this Paper In this method, we used the 1st-eigenvector obtained from PCA to calculate the energy of each node of level that was approximated by. met-scale. This eigenvector satisfies the constraint of general weighting function that the squared sum of each component of weighting function is unity and is considered to represent speaker's characteristic closely because the 1st-eigenvector of each speaker is fairly different from the others. For verification. we used Universal Background Model (UBM) approach that compares claimed speaker s model with UBM on frame-level. We performed experiments to test the effectiveness of PCA-based parameter and found that our Proposed Parameters could obtain improved average Performance of $0.80\%$compared to MFCC. $5.14\%$ to LPCC and 6.69 to existing MFDWC.

A Neuro-Fuzzy System Modeling using Gaussian Mixture Model and Clustering Method (GMM과 클러스터링 기법에 의한 뉴로-퍼지 시스템 모델링)

  • Kim, Sung-Suk;Kwak, Keun-Chang;Ryu, Jeong-Woong;Chun, Myung-Geun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.12 no.6
    • /
    • pp.571-576
    • /
    • 2002
  • There have been a lot of considerations dealing with improving the performance of neuro-fuzzy system. The studies on the neuro-fuzzy modeling have largely been devoted to two approaches. First is to improve performance index of system. The other is to reduce the structure size. In spite of its satisfactory result, it should be noted that these are difficult to extend to high dimensional input or to increase the membership functions. We propose a novel neuro-fuzzy system based on the efficient clustering method for initializing the parameters of the premise part. It is a very useful method that maintains a few number of rules and improves the performance. It combine the various algorithms to improve the performance. The Expectation-Maximization algorithm of Gaussian mixture model is an efficient estimation method for unknown parameter estimation of mirture model. The obtained parameters are used for fuzzy clustering method. The proposed method satisfies these two requirements using the Gaussian mixture model and neuro-fuzzy modeling. Experimental results indicate that the proposed method is capable of giving reliable performance.

Speech Enhancement using RNN Phoneme based VAD (음소기반의 순환 신경망 음성 검출기를 이용한 음성 향상)

  • Lee, Kang;Kang, Sang-Ick;Kwon, Jang-woo;Lee, Samgmin
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.54 no.5
    • /
    • pp.85-89
    • /
    • 2017
  • In this papers, we apply high performance hardware and machine learning algorithm to build an advanced VAD algorithm for speech enhancement. Since speech is made of series of phoneme, using recurrent neural network (RNN) which consider previous data is proper method to build a speech model. It is impossible to study every noise in real world. So our algorithm is builded by phoneme based study. we detect voice present frames in noisy speech signal and make enhancement of the speech signal. Phoneme based RNN model shows advanced performance in speech signal which has high correlation among each frames. To verify the performance of proposed algorithm, we compare VAD result with label data and speech enhancement result in various noise environments with previous speech enhancement algorithm.

Measuring of Effectiveness of Tracking Based Accident Detection Algorithm Using Gaussian Mixture Model (가우시안 배경혼합모델을 이용한 Tracking기반 사고검지 알고리즘의 적용 및 평가)

  • Oh, Ju-Taek;Min, Jun-Young
    • International Journal of Highway Engineering
    • /
    • v.14 no.3
    • /
    • pp.77-85
    • /
    • 2012
  • Most of Automatic Accident Detection Algorithm has a problem of detecting an accident as traffic congestion. Actually, center's managers deal with accidents depend on watching CCTV or accident report by drivers even though they run the Automatic Accident Detection system. It is because of the system's detecting errors such as detecting non-accidents as accidents, and it makes decreasing in the system's overall reliability. It means that Automatic Accident Detection Algorithm should not only have high detection probability but also have low false alarm probability, and it has to detect accurate accident spot. The study tries to verify and evaluate the effectiveness of using Gaussian Mixture Model and individual vehicle tracking to adapt Accident Detection Algorithm to Center Management System by measuring accident detection probability and false alarm probability's frequency in the real accident.

Road Sign Detection with Weather/Illumination Classifications and Adaptive Color Models in Various Road Images (날씨·조명 판단 및 적응적 색상모델을 이용한 도로주행 영상에서의 이정표 검출)

  • Kim, Tae Hung;Lim, Kwang Yong;Byun, Hye Ran;Choi, Yeong Woo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.11
    • /
    • pp.521-528
    • /
    • 2015
  • Road-view object classification methods are mostly influenced by weather and illumination conditions, thus the most of the research activities are based on dataset in clean weathers. In this paper, we present a road-view object classification method based on color segmentation that works for all kinds of weathers. The proposed method first classifies the weather and illumination conditions and then applies the weather-specified color models to find the road traffic signs. Using 5 different features of the road-view images, we classify the weather and light conditions as sunny, cloudy, rainy, night, and backlight. Based on the classified weather and illuminations, our model selects the weather-specific color ranges to generate Gaussian Mixture Model for each colors, Green, Yellow, and Blue. The proposed method successfully detects the traffic signs regardless of the weather and illumination conditions.

Effective Classification Method of Hierarchical CNN for Multi-Class Outlier Detection (다중 클래스 이상치 탐지를 위한 계층 CNN의 효과적인 클래스 분할 방법)

  • Kim, Jee-Hyun;Lee, Seyoung;Kim, Yerim;Ahn, Seo-Yeong;Park, Saerom
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.07a
    • /
    • pp.81-84
    • /
    • 2022
  • 제조 산업에서의 이상치 검출은 생산품의 품질과 운영비용을 절감하기 위한 중요한 요소로 최근 딥러닝을 사용하여 자동화되고 있다. 이상치 검출을 위한 딥러닝 기법에는 CNN이 있으며, CNN을 계층적으로 구성할 경우 단일 CNN 모델에 비해 상대적으로 성능의 향상을 보일 수 있다는 것이 많은 선행 연구에서 나타났다. 이에 MVTec-AD 데이터셋을 이용하여 계층 CNN이 다중 클래스 이상치 판별 문제에 대해 효과적인지를 탐구하고자 하였다. 실험 결과 단일 CNN의 정확도는 0.7715, 계층 CNN의 정확도는 0.7838로 다중 클래스 이상치 판별 문제에 있어 계층 CNN 방식 접근이 다중 클래스 이상치 탐지 문제에서 알고리즘의 성능을 향상할 수 있음을 확인할 수 있었다. 계층 CNN은 모델과 파라미터의 개수와 리소스의 사용이 단일 CNN에 비하여 기하급수적으로 증가한다는 단점이 존재한다. 이에 계층 CNN의 장점을 유지하며 사용 리소스를 절약하고자 하였고 K-means, GMM, 계층적 클러스터링 알고리즘을 통해 제작한 새로운 클래스를 이용해 계층 CNN을 구성하여 각각 정확도 0.7930, 0.7891, 0.7936의 결과를 얻을 수 있었다. 이를 통해 Clustering 알고리즘을 사용하여 적절히 물체를 분류할 경우 물체에 따른 개별 상태 판단 모델을 제작하는 것과 비슷하거나 더 좋은 성능을 내며 리소스 사용을 줄일 수 있음을 확인할 수 있었다.

  • PDF