• Title/Summary/Keyword: Bayes Classifier

Search Result 150, Processing Time 0.024 seconds

An Experimental Study on Fault Detection and Diagnosis Method for a Water Chiller Using Bayes Classifier (베이즈 분류기를 이용한 수냉식 냉동기의 고장 진단 방법에 관한 실험적 연구)

  • Lee, Heung-Ju;Chang, Young-Soo;Kang, Byung-Ha
    • Proceedings of the SAREK Conference
    • /
    • 2008.06a
    • /
    • pp.36-41
    • /
    • 2008
  • Fault detection and diagnosis(FDD) system is beneficial in equipment management by providing the operator with tools which can help find out a failure of the system. An experimental study has been performed on fault detection and diagnosis method for a water chiller. Bayes classifier, which is one of classical pattern classifiers, is adopted in deciding whether fault occurred or not. FDD algorithm can detect refrigerant leak failure, when 20% amount of charged refrigerant for normal operation leaks from the water chiller. The refrigerant leak failure caused COP reduction by 6.7% compared with normal operation performance. When two kinds of faults, such as a decrease in the mass flow rate of cooling water and temperature sensor fault of cooling water inlet, are detected, COP is a little decreased by these faults.

  • PDF

Selecting Machine Learning Model Based on Natural Language Processing for Shanghanlun Diagnostic System Classification (자연어 처리 기반 『상한론(傷寒論)』 변병진단체계(辨病診斷體系) 분류를 위한 기계학습 모델 선정)

  • Young-Nam Kim
    • 대한상한금궤의학회지
    • /
    • v.14 no.1
    • /
    • pp.41-50
    • /
    • 2022
  • Objective : The purpose of this study is to explore the most suitable machine learning model algorithm for Shanghanlun diagnostic system classification using natural language processing (NLP). Methods : A total of 201 data items were collected from 『Shanghanlun』 and 『Clinical Shanghanlun』, 'Taeyangbyeong-gyeolhyung' and 'Eumyangyeokchahunobokbyeong' were excluded to prevent oversampling or undersampling. Data were pretreated using a twitter Korean tokenizer and trained by logistic regression, ridge regression, lasso regression, naive bayes classifier, decision tree, and random forest algorithms. The accuracy of the models were compared. Results : As a result of machine learning, ridge regression and naive Bayes classifier showed an accuracy of 0.843, logistic regression and random forest showed an accuracy of 0.804, and decision tree showed an accuracy of 0.745, while lasso regression showed an accuracy of 0.608. Conclusions : Ridge regression and naive Bayes classifier are suitable NLP machine learning models for the Shanghanlun diagnostic system classification.

  • PDF

Text Document Classification Scheme using TF-IDF and Naïve Bayes Classifier (TF-IDF와 Naïve Bayes 분류기를 활용한 문서 분류 기법)

  • Yoo, Jong-Yeol;Hyun, Sang-Hyun;Yang, Dong-Min
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2015.10a
    • /
    • pp.242-245
    • /
    • 2015
  • Recently due to large-scale data spread in digital economy, the era of big data is coming. Through big data, unstructured text data consisting of technical text document, confidential document, false information documents are experiencing serious problems in the runoff. To prevent this, the need of art to sort and process the document consisting of unstructured text data has increased. In this paper, we propose a novel text classification scheme which learns some data sets and correctly classifies unstructured text data into two different categories, True and False. For the performance evaluation, we implement our proposed scheme using $Na{\ddot{i}}ve$ Bayes document classifier and TF-IDF modules in Python library, and compare it with the existing document classifier.

  • PDF

A Novel Method for a Reliable Classifier using Gradients

  • Han, Euihwan;Cha, Hyungtai
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.6 no.1
    • /
    • pp.18-20
    • /
    • 2017
  • In this paper, we propose a new classification method to complement a $na{\ddot{i}}ve$ Bayesian classifier. This classifier assumes data distribution to be Gaussian, finds the discriminant function, and derives the decision curve. However, this method does not investigate finding the decision curve in much detail, and there are some minor problems that arise in finding an accurate discriminant function. Our findings also show that this method could produce errors when finding the decision curve. The aim of this study has therefore been to investigate existing problems and suggest a more reliable classification method. To do this, we utilize the gradient to find the decision curve. We then compare/analyze our algorithm with the $na{\ddot{i}}ve$ Bayesian method. Performance evaluation indicates that the average accuracy of our classification method is about 10% higher than $na{\ddot{i}}ve$ Bayes.

Modified Na$\ddot{i}$ve Bayes Classifier for Categorizing Questions in Question-Answering Community (확장된 나이브 베이즈 분류기를 활용한 질문-답변 커뮤니티의 질문 분류)

  • Yeon, Jong-Heum;Shim, Jun-Ho;Lee, Sang-Goo
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.1
    • /
    • pp.95-99
    • /
    • 2010
  • Social media refers to the content, which are created by users, such as blogs, social networks, and wikis. Recently, question-answering (QA) communities, in which users share information by questions and answers, are regarded as a kind of social media. Thus, QA communities have become a huge source of information for the past decade. However, it is hard for users to search the exact question-answer that is exactly matched with their needs as the number of question-answers increases in QA communities. This paper proposes an approach for classifying a question into three categories (information, opinion, and suggestion) according to the purpose of the question for more accurate information retrieval. Specifically, our approach is based on modified Na$\ddot{i}$ve Bayes classifier which uses structural characteristics of QA documents to improve the classification accuracy. Through our experiments, we achieved about 71.2% in classification accuracy.

Metalevel Data Mining through Multiple Classifier Fusion (다수 분류기를 이용한 메타레벨 데이터마이닝)

  • 김형관;신성우
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1999.10b
    • /
    • pp.551-553
    • /
    • 1999
  • This paper explores the utility of a new classifier fusion approach to discrimination. Multiple classifier fusion, a popular approach in the field of pattern recognition, uses estimates of each individual classifier's local accuracy on training data sets. In this paper we investigate the effectiveness of fusion methods compared to individual algorithms, including the artificial neural network and k-nearest neighbor techniques. Moreover, we propose an efficient meta-classifier architecture based on an approximation of the posterior Bayes probabilities for learning the oracle.

  • PDF

Study of Machine-Learning Classifier and Feature Set Selection for Intent Classification of Korean Tweets about Food Safety

  • Yeom, Ha-Neul;Hwang, Myunggwon;Hwang, Mi-Nyeong;Jung, Hanmin
    • Journal of Information Science Theory and Practice
    • /
    • v.2 no.3
    • /
    • pp.29-39
    • /
    • 2014
  • In recent years, several studies have proposed making use of the Twitter micro-blogging service to track various trends in online media and discussion. In this study, we specifically examine the use of Twitter to track discussions of food safety in the Korean language. Given the irregularity of keyword use in most tweets, we focus on optimistic machine-learning and feature set selection to classify collected tweets. We build the classifier model using Naive Bayes & Naive Bayes Multinomial, Support Vector Machine, and Decision Tree Algorithms, all of which show good performance. To select an optimum feature set, we construct a basic feature set as a standard for performance comparison, so that further test feature sets can be evaluated. Experiments show that precision and F-measure performance are best when using a Naive Bayes Multinomial classifier model with a test feature set defined by extracting Substantive, Predicate, Modifier, and Interjection parts of speech.

Performance Analysis of Mulitilayer Neural Net Claddifiers Using Simulated Pattern-Generating Processes (모의 패턴생성 프로세스를 이용한 다단신경망분류기의 성능분석)

  • Park, Dong-Seon
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.2
    • /
    • pp.456-464
    • /
    • 1997
  • We describe a random prcess model that prvides sets of patterms whth prcisely contrlolled within-class varia-bility and between-class distinctions.We used these pattems in a simulation study wity the back-propagation netwoek to chracterize its perfotmance as we varied the process-controlling parameters,the statistical differences between the processes,and the random noise on the patterns.Our results indicated that grneralized statistical difference between the processes genrating the patterns provided a good predictor of the difficulty of the clssi-fication problem. Also we analyzed the performance of the Bayes classifier whith the maximum-likeihood cri-terion and we compared the performance of the neural network to that of the Bayes classifier.We found that the performance of neural network was intermediate between that of the simulated and theoretical Bayes classifier.

  • PDF

An Experimental Study on Fault Detection and Diagnosis Method for a Water Chiller Using Bayes Classifier (베이즈 분류기를 이용한 수냉식 냉동기의 고장 진단 방법에 관한 실험적 연구)

  • Lee, Heung-Ju;Chang, Young-Soo;Kang, Byung-Ha
    • Korean Journal of Air-Conditioning and Refrigeration Engineering
    • /
    • v.20 no.7
    • /
    • pp.508-516
    • /
    • 2008
  • Fault detection and diagnosis(FDD) system is beneficial in equipment management by providing the operator with tools which can help find out a failure of the system. An experimental study has been performed on fault detection and diagnosis method for a water chiller. Bayes classifier, which is one of classical pattern classifiers, is adopted in deciding whether fault occurred or not. Failure modes in this study include refrigerant leakage, decrease in mass flow rate of the chilled water and cooling water, and sensor error of the cooling water inlet temperature. It is possible to detect and diagnose faults in this study by adopting FDD algorithm using only four parameters(compressor outlet temperature, chilled water inlet temperature, cooling water outlet temperature and compressor power consumption). Refrigerant leakage failure is detected at 20% of refrigerant leakage. When mass flow rate of the chilled and cooling water decrease more than 8% or 12%, FDD algorithm can detect the faults. The deviation of temperature sensor over $0.6^{\circ}C$ can be detected as fault.

Development of Visual Inspection Process Adapting Naive Bayes Classifiers (나이브 베이즈 분류기를 적용한 외관검사공정 개발)

  • Ryu, Sun-Joong
    • Journal of the Korean Institute of Gas
    • /
    • v.19 no.2
    • /
    • pp.45-53
    • /
    • 2015
  • In order to improve the performance of the visual inspection process, in addition to existing automatic visual inspection machine and human inspectors have developed a new process configuration using a Naive Bayes classifier. By applying the classifier, defect leakage and human inspector's work amount could be improved at the same time. New classification method called AMPB was applied instead of conventional methods based on MAP classification. By experimental results using the filter product for camera modules, it was confirmed that it is possible to configure the process at the level of leakage ratio 1.14% and human inspector's work amount ratio 75.5%. It is significant that the result can be applied in such a wide range as gas leak detection which is the collaboration process between inspection machine and human inspector's