• 제목/요약/키워드: probabilistic SVM

검색결과 18건 처리시간 0.025초

확률출력 SVM을 이용한 감정식별 및 감정검출 (Identification and Detection of Emotion Using Probabilistic Output SVM)

  • 조훈영;정규준
    • 한국음향학회지
    • /
    • 제25권8호
    • /
    • pp.375-382
    • /
    • 2006
  • 본 논문에서는 음성신호에 포함된 감정정보를 자동으로 식별하는 방법과 특정 감정을 검출하는 방법에 대해 다룬다. 자동 감정식별 및 검출을 위해 장구간 (long-term) 음향 특징을 사용하였고, F-score 기반의 특징선택 기법을 적용하여 최적의 특징 파라미터들을 선정하였다. 기존의 일반적인 SVM을 확률출력 SVM으로 변환하여 감정식별 및 감정검출 시스템을 구축하였으며, 가설검정에 기반한 감정검출을 위해 세 가지의 대수 우도비 (log-likelihood) 근사법을 제안하여 그 성능을 비교하였다. SUSAS 데이터베이스를 사용한 실험 결과, F-score를 이용한 특징선택 기법에 의해 감정식별 성능이 향상되었으며, 확률출력 SVM의 유효성을 검증할 수 있었다. 감정검출의 경우, 제안한 방법에 의해 91.3%의 정확도로 화난 감정을 검출할 수 있었다.

Probabilistic Support Vector Machine Localization in Wireless Sensor Networks

  • Samadian, Reza;Noorhosseini, Seyed Majid
    • ETRI Journal
    • /
    • 제33권6호
    • /
    • pp.924-934
    • /
    • 2011
  • Sensor networks play an important role in making the dream of ubiquitous computing a reality. With a variety of applications, sensor networks have the potential to influence everyone's life in the near future. However, there are a number of issues in deployment and exploitation of these networks that must be dealt with for sensor network applications to realize such potential. Localization of the sensor nodes, which is the subject of this paper, is one of the basic problems that must be solved for sensor networks to be effectively used. This paper proposes a probabilistic support vector machine (SVM)-based method to gain a fairly accurate localization of sensor nodes. As opposed to many existing methods, our method assumes almost no extra equipment on the sensor nodes. Our experiments demonstrate that the probabilistic SVM method (PSVM) provides a significant improvement over existing localization methods, particularly in sparse networks and rough environments. In addition, a post processing step for PSVM, called attractive/repulsive potential field localization, is proposed, which provides even more improvement on the accuracy of the sensor node locations.

SVM의 확률 출력을 이용한 새로운 Global Soft Decision 기반의 음성 향상 기법 (Global Soft Decision Using Probabilistic Outputs of Support Vector Machine for Speech Enhancement)

  • 조규행;장준혁
    • 한국음향학회지
    • /
    • 제27권2호
    • /
    • pp.75-79
    • /
    • 2008
  • 본 논문에서는 support vector machine (SVM) 기반의 global soft decison (GSD)을 이용한 새로운 음성 향상 기법을 제시한다. 일반적으로 soft decision (SD) 이득 수정 및 잡음 전력 추정에 근거한 음성 향상 기법이 hard decision을 이용한 음성향상 기법 보다 우수한 성능을 보이는 것으로 알려져 있다. 특히, 각 프레임에서의 음성 부재에 대한 효과적인 척도인 전역음성 부재확률 (global speech absence probability, GSAP)을 SD 기반의 음성 향상 기법에 적용한 여러 연구가 진행되었다. 본 논문에서는 sigmoid 함수를 이용하여 얻어진 SVM의 확률 출력에 의해 추정된 새로운 GSAP를 음성 향상 기법에 적용한다. 제안된 알고리즘의 성능은 다양한 잡음 환경에 적용하여 PESQ 및 MOS 평가 방법을 바탕으로 기존의 GSD 기반의 스펙트럼 향상 기법과 비교하여 향상된 결과를 나타내었다.

Deterministic and probabilistic analysis of tunnel face stability using support vector machine

  • Li, Bin;Fu, Yong;Hong, Yi;Cao, Zijun
    • Geomechanics and Engineering
    • /
    • 제25권1호
    • /
    • pp.17-30
    • /
    • 2021
  • This paper develops a convenient approach for deterministic and probabilistic evaluations of tunnel face stability using support vector machine classifiers. The proposed method is comprised of two major steps, i.e., construction of the training dataset and determination of instance-based classifiers. In step one, the orthogonal design is utilized to produce representative samples after the ranges and levels of the factors that influence tunnel face stability are specified. The training dataset is then labeled by two-dimensional strength reduction analyses embedded within OptumG2. For any unknown instance, the second step applies the training dataset for classification, which is achieved by an ad hoc Python program. The classification of unknown samples starts with selection of instance-based training samples using the k-nearest neighbors algorithm, followed by the construction of an instance-based SVM-KNN classifier. It eventually provides labels of the unknown instances, avoiding calculate its corresponding performance function. Probabilistic evaluations are performed by Monte Carlo simulation based on the SVM-KNN classifier. The ratio of the number of unstable samples to the total number of simulated samples is computed and is taken as the failure probability, which is validated and compared with the response surface method.

Survey on Nucleotide Encoding Techniques and SVM Kernel Design for Human Splice Site Prediction

  • Bari, A.T.M. Golam;Reaz, Mst. Rokeya;Choi, Ho-Jin;Jeong, Byeong-Soo
    • Interdisciplinary Bio Central
    • /
    • 제4권4호
    • /
    • pp.14.1-14.6
    • /
    • 2012
  • Splice site prediction in DNA sequence is a basic search problem for finding exon/intron and intron/exon boundaries. Removing introns and then joining the exons together forms the mRNA sequence. These sequences are the input of the translation process. It is a necessary step in the central dogma of molecular biology. The main task of splice site prediction is to find out the exact GT and AG ended sequences. Then it identifies the true and false GT and AG ended sequences among those candidate sequences. In this paper, we survey research works on splice site prediction based on support vector machine (SVM). The basic difference between these research works is nucleotide encoding technique and SVM kernel selection. Some methods encode the DNA sequence in a sparse way whereas others encode in a probabilistic manner. The encoded sequences serve as input of SVM. The task of SVM is to classify them using its learning model. The accuracy of classification largely depends on the proper kernel selection for sequence data as well as a selection of kernel parameter. We observe each encoding technique and classify them according to their similarity. Then we discuss about kernel and their parameter selection. Our survey paper provides a basic understanding of encoding approaches and proper kernel selection of SVM for splice site prediction.

데이터 마이닝에서 패턴 분류를 위한 다중 SVM 분류기 (Multiple SVM Classifier for Pattern Classification in Data Mining)

  • 김만선;이상용
    • 한국지능시스템학회논문지
    • /
    • 제15권3호
    • /
    • pp.289-293
    • /
    • 2005
  • 패턴 분류는 실세계의 객체를 표현한 다양한 형태의 패턴 정보를 추출하여, 이것이 어떤 부류(클래스)인가를 결정하는 것이다. 패턴 분류 기술은 데이터 마이닝, 산업 자동화나 업무자동화를 위한 컴퓨터 응용 소프트웨어 기술로서 현재 다양한 분야에서 활용되고 있다. 패턴 분류 기술의 최대 목표는 분류 성능 향상이며 이것을 위해 지난 40년간 많은 연구자들이 다양한 접근 방법들을 시도해 왔다. 주로 이용되는 단일 분류 방법들로는 패턴들의 확률적 추론에 기반한 베이즈 분류기, 결정 트리, 거리함수를 이용하는 방법, 신경망, 군집화 등이 있으나 대용량 다차원 데이터를 분석하기에는 효율적이지 못하다. 따라서 상호 보완적인 여러 분류기들을 사용해 결합을 통하여 성능 향상에 도움을 주고 있는 다중 분류기 시스템에 대한 연구가 활발하게 진행되고 있다. 본 논문에서는 다중 SVM(Support Vector Machine) 분류기에 관한 기존 연구의 문제점을 지적하고 새로운 모델을 제안한다. SVM을 다중 클래스 분류기로 확장하기 위해 일대다 정책을 기반으로 하여 각각의 SVM 출력값을 비선형 패턴을 갖는 신호로 간주하고 이를 신경망에 학습하여 최종 분류 성능 결과를 결합하는 모델인 BORSE(Bootstrap Resampling SVM by Ensemble)를 제안한다.

Comparison of Machine Learning-Based Radioisotope Identifiers for Plastic Scintillation Detector

  • Jeon, Byoungil;Kim, Jongyul;Yu, Yonggyun;Moon, Myungkook
    • Journal of Radiation Protection and Research
    • /
    • 제46권4호
    • /
    • pp.204-212
    • /
    • 2021
  • Background: Identification of radioisotopes for plastic scintillation detectors is challenging because their spectra have poor energy resolutions and lack photo peaks. To overcome this weakness, many researchers have conducted radioisotope identification studies using machine learning algorithms; however, the effect of data normalization on radioisotope identification has not been addressed yet. Furthermore, studies on machine learning-based radioisotope identifiers for plastic scintillation detectors are limited. Materials and Methods: In this study, machine learning-based radioisotope identifiers were implemented, and their performances according to data normalization methods were compared. Eight classes of radioisotopes consisting of combinations of 22Na, 60Co, and 137Cs, and the background, were defined. The training set was generated by the random sampling technique based on probabilistic density functions acquired by experiments and simulations, and test set was acquired by experiments. Support vector machine (SVM), artificial neural network (ANN), and convolutional neural network (CNN) were implemented as radioisotope identifiers with six data normalization methods, and trained using the generated training set. Results and Discussion: The implemented identifiers were evaluated by test sets acquired by experiments with and without gain shifts to confirm the robustness of the identifiers against the gain shift effect. Among the three machine learning-based radioisotope identifiers, prediction accuracy followed the order SVM > ANN > CNN, while the training time followed the order SVM > ANN > CNN. Conclusion: The prediction accuracy for the combined test sets was highest with the SVM. The CNN exhibited a minimum variation in prediction accuracy for each class, even though it had the lowest prediction accuracy for the combined test sets among three identifiers. The SVM exhibited the highest prediction accuracy for the combined test sets, and its training time was the shortest among three identifiers.

Moment-rotational analysis of soil during mining induced ground movements by hybrid machine learning assisted quantification models of ELM-SVM

  • Dai, Bibo;Xu, Zhijun;Zeng, Jie;Zandi, Yousef;Rahimi, Abouzar;Pourkhorshidi, Sara;Khadimallah, Mohamed Amine;Zhao, Xingdong;El-Arab, Islam Ezz
    • Steel and Composite Structures
    • /
    • 제41권6호
    • /
    • pp.831-850
    • /
    • 2021
  • Surface subsidence caused by mining subsidence has an impact on neighboring structures and utilities. In other words, subsurface voids created by mining or tunneling activities induce soil movement, exposing buildings to physical and/or functional destruction. Soil-structure is evaluated employing probability distribution laws to account for their uncertainty and complexity to estimate structural vulnerability. In this study, to investigate the displacement field and surface settlement profile caused by mining subsidence, on the basis of a Winklersoil model, analytical equations for the moment-rotation response ofsoil during mining induced ground movements are developed. To define the full static moment-rotation response, an equation for the uplift-yield state is constructed and integrated with equations for the uplift- and yield-only conditions. The constructed model's findings reveal that the inverse of the factor of safety (x) has a considerable influence on the moment-rotation curve. The maximal moment-rotation response of the footing is defined by X = 0:6. Despite the use of Winkler model, the computed moment-rotation response results derived from the literature were analyzed through the ELM-SVM hybrid of Extreme Learning Machine (ELM) and Support Vector Machine (SVM). Also, Monte Carlo simulations are used to apply continuous random parameters to assess the transmission of ground motions to structures. Following the findings of RMSE and R2, the results show that the choice of probabilistic laws of input parameters has a substantial impact on the outcome of analysis performed.

Half Hanning 윈도우 전처리를 통한 기저 세포암 자동 검출 성능 개선 (Performance Improvement of Automatic Basal Cell Carcinoma Detection Using Half Hanning Window)

  • 박아론;백성준;민소희;유홍연;김진영;홍성훈
    • 한국콘텐츠학회논문지
    • /
    • 제6권12호
    • /
    • pp.105-112
    • /
    • 2006
  • 본 연구에서는 일반적으로 잘 알려진 기저 세포암 검출을 위한 간단한 전처리 방법을 제안하였다. 전처리 과정은 half Hanning 윈도우와 함께 데이터를 클리핑하고 PCA(principal components analysis)를 이용하여 차원을 감소하였다. Half Hanning 윈도우는 $1650cm^{-1}$ 피크 부근의 크기를 낮춤으로써 음성 오류율을 줄여 분류 성능을 향상시켰다. 이 실험에서 사용한 MAP(maximum a posteriori), KNN (k-nearest neighbor), PNN(probabilistic neural network), MLP(multilayer perceptron), SVM(support vector machine)와 MSE(minimum squared error)의 분류결과는 제안한 방법이 효과적임을 입증하고 있다. KNN 분류방법은 216개 라만 스펙트럼에 대한 분류실험에서 민감도가 약 97.3%로 제안한 윈도우를 적용한 이 실험에서 기저 세포암 검출 성능이 가장 많이 개선되었다.

  • PDF

데이터마이닝을 활용한 유전자 질병 분석을 위한 MKSV시스템 구현 (For Gene Disease Analysis using Data Mining Implement MKSV System)

  • 정유정;최광미
    • 한국전자통신학회논문지
    • /
    • 제14권4호
    • /
    • pp.781-786
    • /
    • 2019
  • 오늘날 다양한 생명현상을 다루고있는 질병연구와 같은 효율적인 목적을 달성하기 위해서는 이들 연구로부터 획득한 빅데이터를 처리하여 효과적인 현실적 가치를 부여할 수 있어야 한다. 본 논문에서 제안한 MKSV알고리즘은 최적의 확률분포를 추정하여 입력패턴을 결정 한 후 데이터마이닝 기법으로 분류한 결과 효율적인 계산량과 인식률을 획득할 수 있었다. MKSV 알고리즘은 유전자 데이터의 확률적 흐름을 시뮬레이션하여 빅데이터의 데이터마이닝 과정을 통해 데이터를 분류하여 빠르고 효과적인 성능 향상을 보임으로써 현 사회에 급증하는 질병과 유전자의 관련성을 연구하는 데 유용할 것이다.