• Title/Summary/Keyword: 규칙기반 분류분석

Search Result 130, Processing Time 0.029 seconds

Rule-Based Classification Analysis Using Entropy Distribution (엔트로피 분포를 이용한 규칙기반 분류분석 연구)

  • Lee, Jung-Jin;Park, Hae-Ki
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.4
    • /
    • pp.527-540
    • /
    • 2010
  • Rule-based classification analysis is widely used for massive datamining because it is easy to understand and its algorithm is uncomplicated. In this classification analysis, majority vote of rules or weighted combination of rules using their supports are frequently used in order to combine rules. We propose a method to combine rules by using the multinomial distribution in this paper. Iterative proportional fitting algorithm is used to estimate the multinomial distribution which maximizes entropy constrained on rules' support. Simulation experiments show that this method can compete with other well known classification models in the case of two similar populations.

Rhythm Classification of ECG Signal by Rule and SVM Based Algorithm (규칙 및 SVM 기반 알고리즘에 의한 심전도 신호의 리듬 분류)

  • Kim, Sung-Oan;Kim, Dae-Hwan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.18 no.9
    • /
    • pp.43-51
    • /
    • 2013
  • Classification result by comprehensive analysis of rhythm section and heartbeat unit makes a reliable diagnosis of heart disease possible. In this paper, based on feature-points of ECG signals, rhythm analysis for constant section and heartbeat unit is conducted using rule-based classification and SVM-based classification respectively. Rhythm types are classified using a rule base deduced from clinical materials for features of rhythm section in rule-based classification, and monotonic rhythm or major abnormality heartbeats are classified using multiple SVMs trained previously for features of heartbeat unit in SVM-based classification. Experimental results for the MIT-BIH arrhythmia database show classification ratios of 68.52% by rule-based method alone and 87.04% by fusion method of rule-based and SVM-based for 11 rhythm types. The proposed fusion method is improved by about 19% through misclassification improvement for monotonic and arrangement rhythms by SVM-based method.

A Rule-Based Image Classification Method for Analysis of Urban Development in the Capital Area (수도권 도시개발 분석을 위한 규칙기반 영상분류)

  • Lee, Jin-A;Lee, Sung-Soon
    • Spatial Information Research
    • /
    • v.19 no.6
    • /
    • pp.43-54
    • /
    • 2011
  • This study proposes a rule-based image classification method for the time-series analysis of changes in the land surface of the Seongnam-Yongin area using satellite-image data from 2000 to 2009. In order to identify the change patterns during each period, 11 classes were employed in accordance with statistical/mathematic rules. A generalized algorithm was used so that the rules could be applied to the unsupervised-classification method that does not establish any training sites. The results showed that the urban area of the object increased by 145% due to housing-site development. The image data from 2009 had a classification accuracy of 98%. For method verification, the results were compared to land-cover changes through Post-classification comparison. The maximum utilization of the available data within multiple images and the optimized classification allowed for an improvement in the classification accuracy. The proposed rule-based image-classification method is expected to be widely employed for the time-series analysis of images to produce a thematic map for urban development and to monitor urban development and environmental change.

P2P Traffic Classification using Advanced Heuristic Rules and Analysis of Decision Tree Algorithms (개선된 휴리스틱 규칙 및 의사 결정 트리 분석을 이용한 P2P 트래픽 분류 기법)

  • Ye, Wujian;Cho, Kyungsan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.3
    • /
    • pp.45-54
    • /
    • 2014
  • In this paper, an improved two-step P2P traffic classification scheme is proposed to overcome the limitations of the existing methods. The first step is a signature-based classifier at the packet-level. The second step consists of pattern heuristic rules and a statistics-based classifier at the flow-level. With pattern heuristic rules, the accuracy can be improved and the amount of traffic to be classified by statistics-based classifier can be reduced. Based on the analysis of different decision tree algorithms, the statistics-based classifier is implemented with REPTree. In addition, the ensemble algorithm is used to improve the performance of statistics-based classifier Through the verification with the real datasets, it is shown that our hybrid scheme provides higher accuracy and lower overhead compared to other existing schemes.

Travel Time Prediction Algorithm for Trajectory data by using Rule-Based Classification on MapReduce (맵리듀스 환경에서 규칙 기반 분류화를 이용한 궤적 데이터 주행 시간 예측 알고리즘)

  • Kim, JaeWon;Lee, HyunJo;Chang, JaeWoo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.11a
    • /
    • pp.798-801
    • /
    • 2014
  • 여행 정보 시스템(ATIS), 교통 관리 시스템 (ITS) 등 궤적 기반 서비스에서, 서비스 품질을 향상시키기 위해서는 주어진 궤적 질의에 대한 정확한 주행시간을 예측하는 것이 필수적이다. 이를 위한 대표적인 공간 데이터 분석 기법으로는 데이터 분류에서 높은 정확도를 보장하는 규칙 기반 분류화 기법이 존재한다. 그러나 기존 규칙 기반 분류화 기법은 단일 컴퓨터 환경만을 고려하기 때문에, 대용량 공간 데이터 처리에 적합하지 않은 문제점이 존재한다. 이를 해결하기 위해, 본 연구에서는 맵리듀스 환경에서 규칙 기반 분류화를 이용한 궤적 데이터 주행 시간 예측 알고리즘을 개발하고자 한다. 제안하는 알고리즘은 첫째, 맵리듀스를 이용하여 대용량 공간 데이터를 병렬적으로 분석함으로써, 활용도 높은 궤적 데이터 규칙을 생성한다. 이를 통해 대용량 공간 데이터 기반의 규칙 생성 시간을 감소시킨다. 둘째, 그리드 구조 기반의 지도 데이터 분할을 통해, 사용자 질의처리 시 탐색 성능을 향상시킨다. 즉, 주행 시간 예측을 위한 규칙 그룹을 탐색 시 질의를 포함하는 그리드 셀만을 탐색하기 때문에, 질의처리 성능이 향상된다. 마지막으로 맵리듀스 구조에 적합한 질의처리 알고리즘을 설계하여, 효율적인 병렬 질의처리를 지원한다. 이를 위해 맵 함수에서는 선정된 그리드 셀에 대해, 질의에 포함된 도로 구간에서의 주행 시간을 병렬적으로 측정한다. 아울러 리듀스 함수에서는 출발 시간 및 구간별 주행 시간을 바탕으로 맵 함수의 결과를 병합함으로써, 최종 결과를 생성한다. 이를 통해 공간 빅데이터 분석을 통한 주행 시간 예측 기법의 처리 시간 및 결과 정확도를 향상시킨다.

A New Importance Measure of Association Rules Using Information Theory (정보이론에 기반한 연관 규칙들의 새로운 중요도 측정 방법)

  • Lee, Chang-Hwan;Bae, Joohyun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.3 no.1
    • /
    • pp.37-42
    • /
    • 2014
  • The abstract should concisely state what was done, how it was done, principal results, and their significance. It should be less than 300 words for all forms of publication. The abstract should be written as one paragraph and should not contain tabular material or numbered references. At the end of abstract, keywords should be given in 3 to 5 words or phrases.

Rule Discovery for Cancer Classification using Genetic Programming based on Arithmetic Operators (산술 연산자 기반 유전자 프로그래밍을 이용한 암 분류 규칙 발견)

  • 홍진혁;조성배
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.8
    • /
    • pp.999-1009
    • /
    • 2004
  • As a new approach to the diagnosis of cancers, bioinformatics attracts great interest these days. Machine teaming techniques have produced valuable results, but the field of medicine requires not only highly accurate classifiers but also the effective analysis and interpretation of them. Since gene expression data in bioinformatics consist of tens of thousands of features, it is nearly impossible to represent their relations directly. In this paper, we propose a method composed of a feature selection method and genetic programming. Rank-based feature selection is adopted to select useful features and genetic programming based arithmetic operators is used to generate classification rules with features selected. Experimental results on Lymphoma cancer dataset, in which the proposed method obtained 96.6% test accuracy as well as useful classification rules, have shown the validity of the proposed method.

Application of the Rule-Based Image Classification Method to Jeju Island (규칙기반 영상분류 방법의 제주도 지역의 적용)

  • Lee, Jin-A;Lee, Sung-Soon
    • Spatial Information Research
    • /
    • v.21 no.1
    • /
    • pp.63-73
    • /
    • 2013
  • Geographic features are reflected in satellite images, which contain characteristic elements. Information on changes can be obtained through a comparison of images taken at different times. If multi-temporal images can be classified through the use of an unsupervised method, this is likely to improve the accuracy of image classification and contribute to various applications. A rule-based image classification algorithm for automatic processing without human involvement has been developed, but it must be verified that its results are not affected by imperfect elements. In this study, Landsat images of Jeju Island were used to carry out a rule-based image classification. The application results were examined for complex cases, including the presence of clouds in the images, different photographed times, and the type of target area, such as city, mountain, or field. The presence of clouds did not affect calculations, and appropriate classification rules were applied, depending on the different photographed times. The expansion of the urban areas of Jeju and the increase of facilities such as vinyl greenhouses in Seoguipo were identified. Furthermore, space information changes and accurate classifications for Jeju Island were obtained. With the goal of performing high-quality unsupervised classifications, measures to generalize and improve the methods employed were searched for. The findings of this study could be used in time-series analyses of images for various applications, including urban development and environmental change monitoring.

Genetics-Based Machine Learning for Generating Classification Rule in Data Mining (데이터 마이닝의 분류 규칙 발견을 위한 유전자알고리즘 학습방법)

  • 김대희;박상호
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2001.11a
    • /
    • pp.429-434
    • /
    • 2001
  • 데이터(data)치 홍수와 정보의 빈곤이라는 환경에 처한 지금, 정보기술을 이용하여 데이터를 여과하고, 분석하며, 결과를 해석하는 자동화 된 데이터 분석 방안에 높은 관심을 가지게 되었으며, 데이터 마이닝(Data Mining))은 이러한 요구를 충족시키는 정보기술의 활용방법이다. 특히 데이터 마이닝(Data Mining)의 분류(Classification) 방법은 중요한 분야가 되고 있다. 분류 작업의 핵심은 어떻게 적당한 결정규칙(decision rule)을 정의하느냐에 달려 있는데 이를 위해 학습능력을 가지고 있는 알고리즘이 필요하다. 본 논문에서는 유전자 알고리즘(Genetic Algorithm)을 기반으로 하는 강건한 학습방법을 제시했으며, 이러한 학습을 통해 데이터 마이닝(Data Mining)의 분류시스템을 제안하였다.

  • PDF

Analyzing the Applicability of Greenhouse Detection Using Image Classification (영상분류에 의한 하우스재배지 탐지 활용성 분석)

  • Sung, Jeung Su;Lee, Sung Soon;Baek, Seung Hee
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.30 no.4
    • /
    • pp.397-404
    • /
    • 2012
  • Jeju where concentrates on agriculture and tourism, conversion of outdoor culture into cultivation under structure happens actively for the purpose of increasing profit so continuous examination on house cultivation area is very important for this region. This paper is to suggest the effective image classification method using high resolution satellite image to detect the greenhouse. We carried out classification of greenhouse using the supervised classification and rule-based classification method about Formosat-2 images. Connecting result of two classification try to find accuracy improvement for greenhouse detection. Results about each classification method were calculated the accuracy by comparing with the result of visual detection. As a result, mahalanobis distance among the supervised methods was resulted in the highest detection. Also, it could be checked that detection accuracy was improved by tying with result of supervised method and result of rule-based classification. Therefore, it was expected that effective detection of greenhouse would be feasible if henceforward further study is performed in the process of connecting supervised classification and rule-based classification.