• 제목/요약/키워드: statistical classifier

검색결과 158건 처리시간 0.029초

Naive Bayes classifiers boosted by sufficient dimension reduction: applications to top-k classification

  • Yang, Su Hyeong;Shin, Seung Jun;Sung, Wooseok;Lee, Choon Won
    • Communications for Statistical Applications and Methods
    • /
    • 제29권5호
    • /
    • pp.603-614
    • /
    • 2022
  • The naive Bayes classifier is one of the most straightforward classification tools and directly estimates the class probability. However, because it relies on the independent assumption of the predictor, which is rarely satisfied in real-world problems, its application is limited in practice. In this article, we propose employing sufficient dimension reduction (SDR) to substantially improve the performance of the naive Bayes classifier, which is often deteriorated when the number of predictors is not restrictively small. This is not surprising as SDR reduces the predictor dimension without sacrificing classification information, and predictors in the reduced space are constructed to be uncorrelated. Therefore, SDR leads the naive Bayes to no longer be naive. We applied the proposed naive Bayes classifier after SDR to build a recommendation system for the eyewear-frames based on customers' face shape, demonstrating its utility in the top-k classification problem.

Nomogram for screening the risk of developing metabolic syndrome using naïve Bayesian classifier

  • Minseok Shin;Jeayoung Lee
    • Communications for Statistical Applications and Methods
    • /
    • 제30권1호
    • /
    • pp.21-35
    • /
    • 2023
  • Metabolic syndrome is a serious disease that can eventually lead to various complications, such as stroke and cardiovascular disease. In this study, we aimed to identify the risk factors related to metabolic syndrome for its prevention and recognition and propose a nomogram that visualizes and predicts the probability of the incidence of metabolic syndrome. We conducted an analysis using data from the Korea National Health and Nutrition Survey (KNHANES VII) and identified 10 risk factors affecting metabolic syndrome by using the Rao-Scott chi-squared test, considering the characteristics of the complex sample. A naïve Bayesian classifier was used to build a nomogram for metabolic syndrome. We then predicted the incidence of metabolic syndrome using the nomogram. Finally, we verified the nomogram using a receiver operating characteristic curve and a calibration plot.

Modifying linearly non-separable support vector machine binary classifier to account for the centroid mean vector

  • Mubarak Al-Shukeili;Ronald Wesonga
    • Communications for Statistical Applications and Methods
    • /
    • 제30권3호
    • /
    • pp.245-258
    • /
    • 2023
  • This study proposes a modification to the objective function of the support vector machine for the linearly non-separable case of a binary classifier yi ∈ {-1, 1}. The modification takes into account the position of each data item xi from its corresponding class centroid. The resulting optimization function involves the centroid mean vector, and the spread of data besides the support vectors, which should be minimized by the choice of hyper-plane β. Theoretical assumptions have been tested to derive an optimal separable hyperplane that yields the minimal misclassification rate. The proposed method has been evaluated using simulation studies and real-life COVID-19 patient outcome hospitalization data. Results show that the proposed method performs better than the classical linear SVM classifier as the sample size increases and is preferred in the presence of correlations among predictors as well as among extreme values.

LCD 패널 상의 불량 검출을 위한 스펙트럴 그래프 이론에 기반한 특성 추출 방법 (Feature extraction method using graph Laplacian for LCD panel defect classification)

  • 김규동;유석인
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2012년도 한국컴퓨터종합학술대회논문집 Vol.39 No.1(B)
    • /
    • pp.522-524
    • /
    • 2012
  • For exact classification of the defect, good feature selection and classifier is necessary. In this paper, various features such as brightness features, shape features and statistical features are stated and Bayes classifier using Gaussian mixture model is used as classifier. Also feature extraction method based on spectral graph theory is presented. Experimental result shows that feature extraction method using graph Laplacian result in better performance than the result using PCA.

주성분 분석을 이용한 목재 건조 중 발생하는 음향방출 신호의 해석 및 분류 (Analysis and Classification of Acoustic Emission Signals During Wood Drying Using the Principal Component Analysis)

  • 강호양;김기복
    • 비파괴검사학회지
    • /
    • 제23권3호
    • /
    • pp.254-262
    • /
    • 2003
  • 본 연구는 목재(참나무 판목 판재) 건조 중 발생하는 음향방출 신호에 대하여 목재 내 수분이동에 의한 신호와 표면할열에 의한 신호를 해석하고 분류하기 위하여 수행되었다. AE 신호의 특징값들에 대한 상관분석을 실시하여 상호의존성이 높은 변수를 제거한 후 주성분 분석을 실시하였다. AE 변수들을 독립변수로 한 분류기와 주성분들을 독립변수로 한 분류기에 대하여 분류성능을 비교하였다. 목재 건조 시 발생하는 표면할열과 수분이동에 따른 AE 신호 파형을 분석한 결과 대체적으로 표면할열에 의한 신호가 최대진폭이 크며 상승시간이 팎고 상대적으로 고주파의 신호인 것으로 분석되었다. 다중 회귀분석모델을 이용하여 수분이동에 의한 신호와 표면할열에 의한 신호를 분류할 수 있는 분류기를 개발하고 평가한 결과 개별 AE 변수들을 독립변수로 하는 분류기 보다 주성분들을 독립변수로 하는 분류기의 분류성능이 양호한 것으로 나타났다.

용접결함의 패턴인식을 위한 디지털 신호처리에 관한 연구 (A Study on the Digital Signal Processing for the Pattern fiecognition of Weld Flaws)

  • 김재열;송찬일;김병현
    • 한국정밀공학회:학술대회논문집
    • /
    • 한국정밀공학회 1995년도 추계학술대회 논문집
    • /
    • pp.393-396
    • /
    • 1995
  • In this syudy, the researches classifying the artificial and natural flaws in welding parts are performed using the smart pattern recognition technology. For this purpose the smart signal pattern recognition package including the user defined function was developed and the total procedure including the digital signal processing,feature extraction , feature selection and classifier selection is treated by bulk. Specially it is composed with and discussed using the statistical classifier such as the linear disciminant function classifier, the empirical Bayesian classifier. Also, the smart pattern recognition technology is applied to classification problem of natural flaw(i.e multiple classification problem-crack,lack of penetration,lack of fusion,porosity,and slag inclusion, the planar and volumetric flaw classification problem). According to this results, if appropriately learned the neural network classifier is better than ststistical classifier in the classification problem of natural flaw. And it is possible to acquire the recognition rate of 80% above through it is different a little according to domain extracting the feature and the classifier.

  • PDF

적응형 AE신호 형상 인식 프로그램 개발자 회전체 금속 접촉부 이상 분류에 관한 적용 연구 (Development of Adaptive AE Signal Pattern Recognition Program and Application to Classification of Defects in Metal Contact Regions of Rotating Component)

  • 이강용;이종명;김준섭
    • 비파괴검사학회지
    • /
    • 제15권4호
    • /
    • pp.520-530
    • /
    • 1996
  • 본 연구에서는 음향방출법을 이용하여 로터리 압축기의 인공 결함을 분류하기 위한 연구를 수행하였다. 이를 위해 프로그램을 개발하였고 선형 분류기, 경험적 Bayesian 분류기, 신경 회로망 분류기를 함께 사용하여 비교하였다. 그 결과 신경 회로망 분류기가 인식률 면에서 유리하였으며 신경 회로망 분류기의 경우 99%이상의 인식률을 얻을 수 있었다.

  • PDF

동시 발생 행렬의 특성함수 모멘트를 이용한 접합 영상 검출 (Spliced Image Detection Using Characteristic Function Moments of Co-occurrence Matrix)

  • 박태희;문용호;엄일규
    • 대한임베디드공학회논문지
    • /
    • 제10권5호
    • /
    • pp.265-272
    • /
    • 2015
  • This paper presents an improved feature extraction method to achieve a good performance in the detection of splicing forged images. Strong edges caused by the image splicing destroy the statistical dependencies between parent and child subbands in the wavelet domain. We analyze the co-occurrence probability matrix of parent and child subbands in the wavelet domain, and calculate the statistical moments from two-dimensional characteristic function of the co-occurrence matrix. The extracted features are used as the input of SVM classifier. Experimental results show that the proposed method obtains a good performance with a small number of features compared to the existing methods.

A Comparative Study of Phishing Websites Classification Based on Classifier Ensemble

  • Tama, Bayu Adhi;Rhee, Kyung-Hyune
    • 한국멀티미디어학회논문지
    • /
    • 제21권5호
    • /
    • pp.617-625
    • /
    • 2018
  • Phishing website has become a crucial concern in cyber security applications. It is performed by fraudulently deceiving users with the aim of obtaining their sensitive information such as bank account information, credit card, username, and password. The threat has led to huge losses to online retailers, e-business platform, financial institutions, and to name but a few. One way to build anti-phishing detection mechanism is to construct classification algorithm based on machine learning techniques. The objective of this paper is to compare different classifier ensemble approaches, i.e. random forest, rotation forest, gradient boosted machine, and extreme gradient boosting against single classifiers, i.e. decision tree, classification and regression tree, and credal decision tree in the case of website phishing. Area under ROC curve (AUC) is employed as a performance metric, whilst statistical tests are used as baseline indicator of significance evaluation among classifiers. The paper contributes the existing literature on making a benchmark of classifier ensembles for web phishing detection.

미소결함의 형상인식을 위한 디지털 신호처리 적용에 관한 연구 (A Study on the Application of Digital Signal Processing for Pattern Recognition of Microdefects)

  • 홍석주
    • 한국생산제조학회지
    • /
    • 제9권1호
    • /
    • pp.119-127
    • /
    • 2000
  • In this study the classified researches the artificial and natural flaws in welding parts are performed using the pattern recognition technology. For this purpose the signal pattern recognition package including the user defined function was developed and the total procedure including the digital signal processing feature extraction feature selection and classifi-er selection is teated by bulk,. Specially it is composed with and discussed using the statistical classifier such as the linear discriminant function the empirical Bayesian classifier. Also the pattern recognition technology is applied to classifica-tion problem of natural flaw(i.e multiple classification problem-crack lack of penetration lack of fusion porosity and slag inclusion the planar and volumetric flaw classification problem), According to this result it is possible to acquire the recognition rate of 83% above even through it is different a little according to domain extracting the feature and the classifier.

  • PDF