• Title/Summary/Keyword: bayesian classification

Search Result 254, Processing Time 0.034 seconds

A Study on the Classification for Satellite Images using Hybrid Method (하이브리드 분류기법을 이용한 위성영상의 분류에 관한 연구)

  • Jeon, Young-Joon;Kim, Jin-Il
    • The KIPS Transactions:PartB
    • /
    • v.11B no.2
    • /
    • pp.159-168
    • /
    • 2004
  • This paper presents hybrid classification method to improve the performance of satellite images classification by combining Bayesian maximum likelihood classifier, ISODATA clustering and fuzzy C-Means algorithm. In this paper, the training data of each class were generated by separating the spectral signature using ISODATA clustering. We can classify according to pixel's membership grade followed by cluster center of fuzzy C-Means algorithm as the mean value of training data for each class. Bayesian maximum likelihood classifier is performed with prior probability by result of fuzzy C-Means classification. The results shows that proposed method could improve performance of classification method and also perform classification with no concern about spectral signature of the training data. The proposed method Is applied to a Landsat TM satellite image for the verifying test.

Big Numeric Data Classification Using Grid-based Bayesian Inference in the MapReduce Framework

  • Kim, Young Joon;Lee, Keon Myung
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.14 no.4
    • /
    • pp.313-321
    • /
    • 2014
  • In the current era of data-intensive services, the handling of big data is a crucial issue that affects almost every discipline and industry. In this study, we propose a classification method for large volumes of numeric data, which is implemented in a distributed programming framework, i.e., MapReduce. The proposed method partitions the data space into a grid structure and it then models the probability distributions of classes for grid cells by collecting sufficient statistics using distributed MapReduce tasks. The class labeling of new data is achieved by k-nearest neighbor classification based on Bayesian inference.

Variational Bayesian multinomial probit model with Gaussian process classification on mice protein expression level data (가우시안 과정 분류에 대한 변분 베이지안 다항 프로빗 모형: 쥐 단백질 발현 데이터에의 적용)

  • Donghyun Son;Beom Seuk Hwang
    • The Korean Journal of Applied Statistics
    • /
    • v.36 no.2
    • /
    • pp.115-127
    • /
    • 2023
  • Multinomial probit model is a popular model for multiclass classification and choice model. Markov chain Monte Carlo (MCMC) method is widely used for estimating multinomial probit model, but its computational cost is high. However, it is well known that variational Bayesian approximation is more computationally efficient than MCMC, because it uses subsets of samples. In this study, we describe multinomial probit model with Gaussian process classification and how to employ variational Bayesian approximation on the model. This study also compares the results of variational Bayesian multinomial probit model to the results of naive Bayes, K-nearest neighbors and support vector machine for the UCI mice protein expression level data.

Hyperparameter Search for Facies Classification with Bayesian Optimization (베이지안 최적화를 이용한 암상 분류 모델의 하이퍼 파라미터 탐색)

  • Choi, Yonguk;Yoon, Daeung;Choi, Junhwan;Byun, Joongmoo
    • Geophysics and Geophysical Exploration
    • /
    • v.23 no.3
    • /
    • pp.157-167
    • /
    • 2020
  • With the recent advancement of computer hardware and the contribution of open source libraries to facilitate access to artificial intelligence technology, the use of machine learning (ML) and deep learning (DL) technologies in various fields of exploration geophysics has increased. In addition, ML researchers have developed complex algorithms to improve the inference accuracy of various tasks such as image, video, voice, and natural language processing, and now they are expanding their interests into the field of automatic machine learning (AutoML). AutoML can be divided into three areas: feature engineering, architecture search, and hyperparameter search. Among them, this paper focuses on hyperparamter search with Bayesian optimization, and applies it to the problem of facies classification using seismic data and well logs. The effectiveness of the Bayesian optimization technique has been demonstrated using Vincent field data by comparing with the results of the random search technique.

A Study on Data Classification of Raman OIM Hyperspectral Bone Data

  • Jung, Sung-Hwan
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.8
    • /
    • pp.1010-1019
    • /
    • 2011
  • This was a preliminary research for the goal of understanding between internal structure of Osteogenesis Imperfecta Murine (OIM) bone and its fragility. 54 hyperspectral bone data sets were captured by using JASCO 2000 Raman spectrometer at UMKC-CRISP (University of Missouri-Kansas City Center for Research on Interfacial Structure and Properties). Each data set consists of 1,091 data points from 9 OIM bones. The original captured hyperspectral data sets were noisy and base-lined ones. We removed the noise and corrected the base-lined data for the final efficient classification. High dimensional Raman hyperspectral data on OIM bones was reduced by Principal Components Analysis (PCA) and Linear Discriminant Analysis (LDA) and efficiently classified for the first time. We confirmed OIM bones could be classified such as strong, middle and weak one by using the coefficients of their PCA or LDA. Through experiment, we investigated the efficiency of classification on the reduced OIM bone data by the Bayesian classifier and K -Nearest Neighbor (K-NN) classifier. As the experimental result, the case of LDA reduction showed higher classification performance than that of PCA reduction in the two classifiers. K-NN classifier represented better classification rate, compared with Bayesian classifier. The classification performance of K-NN was about 92.6% in case of LDA.

Pattern Classification by Using Bayesian GTM (베이지안 GTM을 이용한 패턴 분류)

  • 최준혁;김중배;김대수;임기욱
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2001.12a
    • /
    • pp.287-290
    • /
    • 2001
  • Bishop이 제안한 generative Topographic Mapping(GTM)은 Kohonen이 제안한 자율 학습 신경망인 Self Organizing Maps(SOM)의 확률적 버전이다. 본 논문에서는 이러한 GTM 모형에 베이지안 추론을 결합하여 작은 오분류율을 가지는 분류 알고리즘인 베이지안 GTM(Bayesian GTM)을 제안한다. 이 방법은 기존의 GTM의 빠른 계산 처리 능력과 베이지안 추론을 이용하여 기존의 분류 알고리즘보다 우수한 결과가 나타남을 실험을 통하여 확인하였다.

  • PDF

Classification of Transient Signals in Ocean Background Noise Using Bayesian Classifier (베이즈 분류기를 이용한 수중 배경소음하의 과도신호 분류)

  • Kim, Ju-Ho;Bok, Tae-Hoon;Paeng, Dong-Guk;Bae, Jin-Ho;Lee, Chong-Hyun;Kim, Seong-Il
    • Journal of Ocean Engineering and Technology
    • /
    • v.26 no.4
    • /
    • pp.57-63
    • /
    • 2012
  • In this paper, a Bayesian classifier based on PCA (principle component analysis) is proposed to classify underwater transient signals using $16^{th}$ order LPC (linear predictive coding) coefficients as feature vector. The proposed classifier is composed of two steps. The mechanical signals were separated from biological signals in the first step, and then each type of the mechanical signal was recognized in the second step. Three biological transient signals and two mechanical signals were used to conduct experiments. The classification ratios for the feature vectors of biological signals and mechanical signals were 94.75% and 97.23%, respectively, when all 16 order LPC vector were used. In order to determine the effect of underwater noise on the classification performance, underwater ambient noise was added to the test signals and the classification ratio according to SNR (signal-to-noise ratio) was compared by changing dimension of feature vector using PCA. The classification ratios of the biological and mechanical signals under ocean ambient noise at 10dB SNR, were 0.51% and 100% respectively. However, the ratios were changed to 53.07% and 83.14% when the dimension of feature vector was converted to three by applying PCA. For correct, classification, it is required SNR over 10 dB for three dimension feature vector and over 30dB SNR for seven dimension feature vector under ocean ambient noise environment.

Automatic Sputum Color Image Segmentation for Lung Cancer Diagnosis

  • Taher, Fatma;Werghi, Naoufel;Al-Ahmad, Hussain
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.7 no.1
    • /
    • pp.68-80
    • /
    • 2013
  • Lung cancer is considered to be the leading cause of cancer death worldwide. A technique commonly used consists of analyzing sputum images for detecting lung cancer cells. However, the analysis of sputum is time consuming and requires highly trained personnel to avoid errors. The manual screening of sputum samples has to be improved by using image processing techniques. In this paper we present a Computer Aided Diagnosis (CAD) system for early detection and diagnosis of lung cancer based on the analysis of the sputum color image with the aim to attain a high accuracy rate and to reduce the time consumed to analyze such sputum samples. In order to form general diagnostic rules, we present a framework for segmentation and extraction of sputum cells in sputum images using respectively, a Bayesian classification method followed by region detection and feature extraction techniques to determine the shape of the nuclei inside the sputum cells. The final results will be used for a (CAD) system for early detection of lung cancer. We analyzed the performance of a Bayesian classification with respect to the color space representation and quantification. Our methods were validated via a series of experimentation conducted with a data set of 100 images. Our evaluation criteria were based on sensitivity, specificity and accuracy.

An Implementation of Pan-So-Ri Classification Program Using Naive Bayesian Classifier (나이브 베이지안 분류기를 이용한 판소리 분류 프로그램 구현)

  • Kim, Won-Jong;Lee, Kang-Bok;Kim, Myung-Gwan
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.11 no.3
    • /
    • pp.153-159
    • /
    • 2011
  • Pan-So-Ri singing a story as song is one of Korea traditional musics. it divide into two sect(east-sect, west-sect), and it is hard to classify two sect without knowledge about Pan-So-Ri. In this paper, we have propose a Pan-So-Ri classification program using PCD(Pitch Class Distribution) and Naive Bayesian Classifier. Attribute value of classifier is each appearance frequency of pitch. Experiment is conducted two time with different rounding off location of probability value. Better one show correct classification with east-sect 80%, west-sect 97%, and total accuracy of 88%. this result is used our program.

An Anomaly Detection Framework Based on ICA and Bayesian Classification for IaaS Platforms

  • Wang, GuiPing;Yang, JianXi;Li, Ren
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.8
    • /
    • pp.3865-3883
    • /
    • 2016
  • Infrastructure as a Service (IaaS) encapsulates computer hardware into a large amount of virtual and manageable instances mainly in the form of virtual machine (VM), and provides rental service for users. Currently, VM anomaly incidents occasionally occur, which leads to performance issues and even downtime. This paper aims at detecting anomalous VMs based on performance metrics data of VMs. Due to the dynamic nature and increasing scale of IaaS, detecting anomalous VMs from voluminous correlated and non-Gaussian monitored performance data is a challenging task. This paper designs an anomaly detection framework to solve this challenge. First, it collects 53 performance metrics to reflect the running state of each VM. The collected performance metrics are testified not to follow the Gaussian distribution. Then, it employs independent components analysis (ICA) instead of principal component analysis (PCA) to extract independent components from collected non-Gaussian performance metric data. For anomaly detection, it employs multi-class Bayesian classification to determine the current state of each VM. To evaluate the performance of the designed detection framework, four types of anomalies are separately or jointly injected into randomly selected VMs in a campus-wide testbed. The experimental results show that ICA-based detection mechanism outperforms PCA-based and LDA-based detection mechanisms in terms of sensitivity and specificity.