• Title/Summary/Keyword: Classifier Clustering

Search Result 137, Processing Time 0.029 seconds

Data mining Algorithms for the Development of Sasang Type Diagnosis (사상체질 진단검사를 위한 데이터마이닝 알고리즘 연구)

  • Hong, Jin-Woo;Kim, Young-In;Park, So-Jung;Kim, Byoung-Chul;Eom, Il-Kyu;Hwang, Min-Woo;Shin, Sang-Woo;Kim, Byung-Joo;Kwon, Young-Kyu;Chae, Han
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.23 no.6
    • /
    • pp.1234-1240
    • /
    • 2009
  • This study was to compare the effectiveness and validity of various data-mining algorithm for Sasang type diagnostic test. We compared the sensitivity and specificity index of nine attribute selection and eleven class classification algorithms with 31 data-set characterizing Sasang typology and 10-fold validation methods installed in Waikato Environment Knowledge Analysis (WEKA). The highest classification validity score can be acquired as follows; 69.9 as Percentage Correctly Predicted index with Naive Bayes Classifier, 80 as sensitivity index with LWL/Tae-Eum type, 93.5 as specificity index with Naive Bayes Classifier/So-Eum type. The classification algorithm with highest PCP index of 69.62 after attribute selection was Naive Bayes Classifier. In this study we can find that the best-fit algorithm for traditional medicine is case sensitive and that characteristics of clinical circumstances, and data-mining algorithms and study purpose should be considered to get the highest validity even with the well defined data sets. It is also confirmed that we can't find one-fits-all algorithm and there should be many studies with trials and errors. This study will serve as a pivotal foundation for the development of medical instruments for Pattern Identification and Sasang type diagnosis on the basis of traditional Korean Medicine.

Classification of Forest Vegetation Zone over Southern Part of Korean Peninsula Using Geographic Information Systems (環境因子의 空間分析을 통한 南韓지역의 山林植生帶 구분/지리정보시스템(GIS)에 의한 접근)

  • Lee, Kyu-Sung;Byong-Chun Lee;Joon Hwan Shin
    • The Korean Journal of Ecology
    • /
    • v.19 no.5
    • /
    • pp.465-476
    • /
    • 1996
  • There are several environmental variables that may be influential to the spatial distribution of forest vegetation. To create a map of forest vegetation zone over southern part of Korean Peninsula, digital map layers were produced for each of environmental variables that include topography, geographic locations, and climate. In addition, an extensive set of field survey data was collected at relatively undisturbed forests and they were introduced into the GIS database with exact coordinates of survey sites. Preliminary statistical analysis on the survey data showed that the environmental variables were significantly different among the previously defined five forest vegetation zones. Classification of the six layers of digital map representing environmental variables was carried out by a supervised classifier using the training statistics from field survey data and by a clustering algorithm. Although the maps from two classifiers were somewhat different due to the classification procedure applied, they showed overall patterns of vertical and horizontal distribution of forest zones. considering the spatial contents of many ecological studies, GIS can be used as an important tool to manage and analyze spatial data. This study discusses more about the generation of digital map and the analysis procedure rather than the outcome map of forest vegetation zone.

  • PDF

CAD Scheme To Detect Brain Tumour In MR Images using Active Contour Models and Tree Classifiers

  • Helen, R.;Kamaraj, N.
    • Journal of Electrical Engineering and Technology
    • /
    • v.10 no.2
    • /
    • pp.670-675
    • /
    • 2015
  • Medical imaging is one of the most powerful tools for gaining information about internal organs and tissues. It is a challenging task to develop sophisticated image analysis methods in order to improve the accuracy of diagnosis. The objective of this paper is to develop a Computer Aided Diagnostics (CAD) scheme for Brain Tumour detection from Magnetic Resonance Image (MRI) using active contour models and to investigate with several approaches for improving CAD performances. The problem in clinical medicine is the automatic detection of brain Tumours with maximum accuracy and in less time. This work involves the following steps: i) Segmentation performed by Fuzzy Clustering with Level Set Method (FCMLSM) and performance is compared with snake models based on Balloon force and Gradient Vector Force (GVF), Distance Regularized Level Set Method (DRLSE). ii) Feature extraction done by Shape and Texture based features. iii) Brain Tumour detection performed by various tree classifiers. Based on investigation FCMLSM is well suited segmentation method and Random Forest is the most optimum classifier for this problem. This method gives accuracy of 97% and with minimum classification error. The time taken to detect Tumour is approximately 2 mins for an examination (30 slices).

Design of Fingerprints Identification Based on RBFNN Using Image Processing Techniques (영상처리 기법을 통한 RBFNN 패턴 분류기 기반 개선된 지문인식 시스템 설계)

  • Bae, Jong-Soo;Oh, Sung-Kwun;Kim, Hyun-Ki
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.65 no.6
    • /
    • pp.1060-1069
    • /
    • 2016
  • In this paper, we introduce the fingerprint recognition system based on Radial Basis Function Neural Network(RBFNN). Fingerprints are classified as four types(Whole, Arch, Right roof, Left roof). The preprocessing methods such as fast fourier transform, normalization, calculation of ridge's direction, filtering with gabor filter, binarization and rotation algorithm, are used in order to extract the features on fingerprint images and then those features are considered as the inputs of the network. RBFNN uses Fuzzy C-Means(FCM) clustering in the hidden layer and polynomial functions such as linear, quadratic, and modified quadratic are defined as connection weights of the network. Particle Swarm Optimization (PSO) algorithm optimizes a number of essential parameters needed to improve the accuracy of RBFNN. Those optimized parameters include the number of clusters and the fuzzification coefficient used in the FCM algorithm, and the orders of polynomial of networks. The performance evaluation of the proposed fingerprint recognition system is illustrated with the use of fingerprint data sets that are collected through Anguli program.

A Robust Method for Automatic Segmentation and Recognition of Apoptosis Cell (Apoptosis 세포의 자동화된 분할 및 인식을 위한 강인한 방법)

  • Liu, Hai-Ling;Shin, Young-Suk
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.15 no.6
    • /
    • pp.464-468
    • /
    • 2009
  • In this paper we propose an image-based approach, which is different from the traditional flow cytometric method to detect shape of apoptosis cells. This method can overcome the defects of cytometry and give precise recognition of apoptosis cells. In this work K-means clustering was used to do the rough segmentation and an active contour model, called 'snake' was used to do the precise edge detection. And then some features were extracted including physical feature, shape descriptor and texture features of the apoptosis cells. Finally a Mahalanobis distance classifier classifies the segmentation images as apoptosis and non-apoptosis cell.

TEMPORAL CLASSIFICATION METHOD FOR FORECASTING LOAD PATTERNS FROM AMR DATA

  • Lee, Heon-Gyu;Shin, Jin-Ho;Ryu, Keun-Ho
    • Proceedings of the KSRS Conference
    • /
    • 2007.10a
    • /
    • pp.594-597
    • /
    • 2007
  • We present in this paper a novel mid and long term power load prediction method using temporal pattern mining from AMR (Automatic Meter Reading) data. Since the power load patterns have time-varying characteristic and very different patterns according to the hour, time, day and week and so on, it gives rise to the uninformative results if only traditional data mining is used. Also, research on data mining for analyzing electric load patterns focused on cluster analysis and classification methods. However despite the usefulness of rules that include temporal dimension and the fact that the AMR data has temporal attribute, the above methods were limited in static pattern extraction and did not consider temporal attributes. Therefore, we propose a new classification method for predicting power load patterns. The main tasks include clustering method and temporal classification method. Cluster analysis is used to create load pattern classes and the representative load profiles for each class. Next, the classification method uses representative load profiles to build a classifier able to assign different load patterns to the existing classes. The proposed classification method is the Calendar-based temporal mining and it discovers electric load patterns in multiple time granularities. Lastly, we show that the proposed method used AMR data and discovered more interest patterns.

  • PDF

Pattern Classification Based on the Selective Perception Ability of Human Beings (인간 시각의 선택적 지각 능력에 기반한 패턴 분류)

  • Kim Do-Hyeon;Kim Kwang-Baek;Cho Jae-Hyun;Cha Eui-Young
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.10 no.2
    • /
    • pp.398-405
    • /
    • 2006
  • We propose a pattern classification model using a selective perception ability of human beings. Generally, human beings recognize an object by putting a selective concentration on it in the region of interest. Much better classification and recognition could be possible by adapting this phenomenon in pattern classification. First, the pattern classification model creates some reference cluster patterns in a usual way. Then it generates an SPM(Selective Perception Map) that reflects the mutual relation of the reference cluster patterns. In the recognition phase, the model applies the SPM as a weight for calculating the distance between an input pattern and the reference patterns. Our experiments show that the proposed classifier with the SPM acquired the better results than other approaches in pattern classification.

Power Load Pattern Classification from AMR Data (AMR 데이터에서의 전력 부하 패턴 분류)

  • Piao, Minghao;Park, Jin-Hyung;Lee, Heon-Gyu;Shin, Jin-Ho;Ryu, Keun-Ho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2008.05a
    • /
    • pp.231-234
    • /
    • 2008
  • Currently an automated methodology based on data mining techniques is presented for the prediction of customer load patterns in load demand data. The main aim of our work is to forecast customers' contract information from capacity of daily power consumption patterns. According to the result, we try to evaluate the contract information's suitability. The proposed our approach consists of three stages: (i) data preprocessing: noise or outlier is detected and removed (ii) cluster analysis: SOMs clustering is used to create load patterns and the representative load profiles and (iii) classification: we applied the K-NNs classifier in order to predict the customers' contract information base on power consumption patterns. According to the our proposed methodology, power load measured from AMR(automatic meter reading) system, as well as customer indexes, were used as inputs. The output was the classification of representative load profiles (or classes). Lastly, in order to evaluate KNN classification technique, the proposed methodology was applied on a set of high voltage customers of the Korea power system and the results of our experiments was presented.

Performance Improvement of Collaborative Filtering System Using Associative User′s Clustering Analysis for the Recalculation of Preference and Representative Attribute-Neighborhood (선호도 재계산을 위한 연관 사용자 군집 분석과 Representative Attribute -Neighborhood를 이용한 협력적 필터링 시스템의 성능향상)

  • Jung, Kyung-Yong;Kim, Jin-Su;Kim, Tae-Yong;Lee, Jung-Hyun
    • The KIPS Transactions:PartB
    • /
    • v.10B no.3
    • /
    • pp.287-296
    • /
    • 2003
  • There has been much research focused on collaborative filtering technique in Recommender System. However, these studies have shown the First-Rater Problem and the Sparsity Problem. The main purpose of this Paper is to solve these Problems. In this Paper, we suggest the user's predicting preference method using Bayesian estimated value and the associative user clustering for the recalculation of preference. In addition to this method, to complement a shortcoming, which doesn't regard the attribution of item, we use Representative Attribute-Neighborhood method that is used for the prediction when we find the similar neighborhood through extracting the representative attribution, which most affect the preference. We improved the efficiency by using the associative user's clustering analysis in order to calculate the preference of specific item within the cluster item vector to the collaborative filtering algorithm. Besides, for the problem of the Sparsity and First-Rater, through using Association Rule Hypergraph Partitioning algorithm associative users are clustered according to the genre. New users are classified into one of these genres by Naive Bayes classifier. In addition, in order to get the similarity value between users belonged to the classified genre and new users, and this paper allows the different estimated value to item which user evaluated through Naive Bayes learning. As applying the preference granted the estimated value to Pearson correlation coefficient, it can make the higher accuracy because the errors that cause the missing value come less. We evaluate our method on a large collaborative filtering database of user rating and it significantly outperforms previous proposed method.

Context-Dependent Classification of Multi-Echo MRI Using Bayes Compound Decision Model (Bayes의 복합 의사결정모델을 이용한 다중에코 자기공명영상의 context-dependent 분류)

  • 전준철;권수일
    • Investigative Magnetic Resonance Imaging
    • /
    • v.3 no.2
    • /
    • pp.179-187
    • /
    • 1999
  • Purpose : This paper introduces a computationally inexpensive context-dependent classification of multi-echo MRI with Bayes compound decision model. In order to produce accurate region segmentation especially in homogeneous area and along boundaries of the regions, we propose a classification method that uses contextual information of local enighborhood system in the image. Material and Methods : The performance of the context free classifier over a statistically heterogeneous image can be improved if the local stationary regions in the image are disassociated from each other through the mechanism of the interaction parameters defined at he local neighborhood level. In order to improve the classification accuracy, we use the contextual information which resolves ambiguities in the class assignment of a pattern based on the labels of the neighboring patterns in classifying the image. Since the data immediately surrounding a given pixel is intimately associated with this given pixel., then if the true nature of the surrounding pixel is known this can be used to extract the true nature of the given pixel. The proposed context-dependent compound decision model uses the compound Bayes decision rule with the contextual information. As for the contextual information in the model, the directional transition probabilities estimated from the local neighborhood system are used for the interaction parameters. Results : The context-dependent classification paradigm with compound Bayesian model for multi-echo MR images is developed. Compared to context free classification which does not consider contextual information, context-dependent classifier show improved classification results especially in homogeneous and along boundaries of regions since contextual information is used during the classification. Conclusion : We introduce a new paradigm to classify multi-echo MRI using clustering analysis and Bayesian compound decision model to improve the classification results.

  • PDF