• Title/Summary/Keyword: 논문 분류

Search Result 12,560, Processing Time 0.034 seconds

Assessing Classification Accuracy using Cohen's kappa in Data Mining (데이터 마이닝에서 Cohen의 kappa를 이용한 분류정확도 측정)

  • Um, Yonghwan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.18 no.1
    • /
    • pp.177-183
    • /
    • 2013
  • In this paper, Cohen's kappa and weighted kappa are applied to measuring classification accuracy when performing classification in data minig. Cohen's kappa compensates for classifications that may be due to chance and is used for the data with nominal or ordinal scales. Especially, for the ordinal data, weighted kappa which measures the classification accuracy by quantifying the classification errors as weights is used. We used two weights (linear weight, quadratic weight) for calculations of weighted kappa. Also for the calculation and comparison of kappa and weighted kappa we used a real data set, fat-liver data.

An Implementation of Neuro-Fuzzy Based Land Convert Pattern Classification System for Remote Sensing Image (뉴로-퍼지 알고리즘을 이용한 원격탐사 화상의 지표면 패턴 분류시스템 구현)

  • 이상구
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.9 no.5
    • /
    • pp.472-479
    • /
    • 1999
  • In this paper, we propose a land cover pattern classifier for remote sensing image by using neuro-fuzzy algorithm. The proposed pattem classifier has a 3-layer feed-forward architecture that is derived from generic fuzzy perceptrons, and the weights are con~posed of h u y sets. We also implement a neuro-fuzzy pattern classification system in the Visual C++ environment. To measure the performance of this, we compare it with the conventional neural networks with back-propagation learning and the Maximum-likelihood algorithms. We classified the remote sensing image into the eight classes covered the majority of land cover feature, selected the same training sites. Experimental results show that the proposed classifier performs well especially in the mixed composition area having many classes rather than the conventional systems.

  • PDF

A Design and Implementation of Web Robot by Using Genre-based Categorization and Subject-based Categorization (장르기반 분류와 주제기반 분류를 이용한 웹 로봇의 설계 및 구현)

  • Lee Yong-Bae
    • The KIPS Transactions:PartB
    • /
    • v.12B no.4 s.100
    • /
    • pp.499-506
    • /
    • 2005
  • It still has some restrictions to collect a specialized information with only the function of existing web robot which collect an enormous of data by circulating through the internet. Therefore, in this paper the functions of the current web robot and its application areas are analyzed and the limitations of collecting a specialized information are found out. Also we define what functions are necessary for a web robot in order to collect a specialized information. Then the designed structure is described. There are two critical functions which are applied to web robot. One is a genre-based categorization that classifies the text by the type, and the other is a content-based categorization by the subject. Most of all, genre-based categorization is used as fundamental feature which enables web robot to collect the aimed documents efficiently.

A Hypertext Categorization Method using Incrementally Computable Class Link Information (점진적으로 계산되는 분류정보와 링크정보를 이용한 하이퍼텍스트 문서 분류 방법)

  • Oh, Hyo-Jung;Myaeng, Sung-Hyoun
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.7
    • /
    • pp.498-509
    • /
    • 2002
  • As WWW grows at an increasing speed, a classifier targeted at hypertext has become in high demand. While document categorization il quite mature, the issue of utilizing hypertext structure and hyperlinks has been relatively unexplored. In this paper, we propose a practical method for enhancing both the speed and the quality of hypertext categorization using hyerlinks. In comparison against a recently proposed technique that appears to be the only one of the kind, we obtained up to 18.5% of improvement in effectiveness while reducing the processing time dramatically. We attempt to explain through experiments what factors contribute to tile improvement.

Sasang Constitution Classification of a Middle-Aged Man Using Speech Signal Analysis (음성 정보 분석값을 통한 장년기 남성의 사상체질 분류)

  • Kim, Bong-Hyun;Lee, Se-Hwan;Park, Sun-Ae;Ka, Min-Kyoung;Cho, Dong-Uk
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2007.11a
    • /
    • pp.117-120
    • /
    • 2007
  • 개인의 체질에 맞춰 의학적 행위를 시행하는 사상의학은 우리나라 고유의 전통의학으로 가치를 인정받고 있다. 이러한 사상의학에서 가장 중요한 것은 사상체질의 정확한 분류이다. 본 논문에서는 기존의 사상체질 분류 방법인 용모사기, 체형기상, QSCCII, 체질침 등이 임상의들의 직관에 의해 행해지고 있다는 문제점을 해결하기 위해 사상체질 분류의 정량화 및 객관화를 위한 연구를 수행하였다. 이를 위해 본 논문에서는 음성 신호 분석에서 발생하는 정보의 출력값에 의해 사상 체질을 분류하는 방법을 제안하였다. 이를 위해 40대 이상의 장년기 남성을 대상으로 사상체질 전문의의 진단표에서 뚜렷한 특징을 보유하고 있는 집단군을 구성하고 이들의 음성 특성을 분류하여 음성학적 요소를 추출하고자 한다. 또한 출력된 결과값을 토대로 체질 집단별 차이점과 유사성을 분류하여 사상 체질 분류를 행하였다.

Weighted Bayesian Automatic Document Categorization Based on Association Word Knowledge Base by Apriori Algorithm (Apriori알고리즘에 의한 연관 단어 지식 베이스에 기반한 가중치가 부여된 베이지만 자동 문서 분류)

  • 고수정;이정현
    • Journal of Korea Multimedia Society
    • /
    • v.4 no.2
    • /
    • pp.171-181
    • /
    • 2001
  • The previous Bayesian document categorization method has problems that it requires a lot of time and effort in word clustering and it hardly reflects the semantic information between words. In this paper, we propose a weighted Bayesian document categorizing method based on association word knowledge base acquired by mining technique. The proposed method constructs weighted association word knowledge base using documents in training set. Then, classifier using Bayesian probability categorizes documents based on the constructed association word knowledge base. In order to evaluate performance of the proposed method, we compare our experimental results with those of weighted Bayesian document categorizing method using vocabulary dictionary by mutual information, weighted Bayesian document categorizing method, and simple Bayesian document categorizing method. The experimental result shows that weighted Bayesian categorizing method using association word knowledge base has improved performance 0.87% and 2.77% and 5.09% over weighted Bayesian categorizing method using vocabulary dictionary by mutual information and weighted Bayesian method and simple Bayesian method, respectively.

  • PDF

A Study of CPC-based Technology Classification Analysis Model of Patents (CPC 기반 특허 기술 분류 분석 모델)

  • Chae, Soo-Hyeon;Gim, Jangwon
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.10
    • /
    • pp.443-452
    • /
    • 2018
  • With the explosively increasing intellectual property rights, securing technological competitiveness of companies is more and more important. In particular, since patents include core technologies and element technologies, patent analysis researches are actively conducted to measure the technological value of companies. Various patent analysis studies have been conducted by the International Patent Classification(IPC), which does not include the latest technical classification, and the technical classification accuracy is low. In order to overcome this problem, the Cooperative Patent Classification(CPC), which includes the latest technology classification and detailed technical classification, has been developed. In this paper, we propose a model to analyze the classification of the technologies included in the patent by using the detailed classification system of CPC. It is possible to analyze the inventor's patents in consideration of the relation, importance, and efficiency between the detailed classification schemes of the CPCs to extract the core technology fields and to analyze the details more accurately than the existing IPC-based methods. Also, we perform the comparative evaluation with the existing IPC based patent analysis method and confirm that the proposed model shows better performance in analyzing the inventor's core technology classification.

Diversity based Ensemble Genetic Programming for Improving Classification Performance (분류 성능 향상을 위한 다양성 기반 앙상블 유전자 프로그래밍)

  • Hong Jin-Hyuk;Cho Sung-Bae
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.12
    • /
    • pp.1229-1237
    • /
    • 2005
  • Combining multiple classifiers has been actively exploited to improve classification performance. It is required to construct a pool of accurate and diverse base classifier for obtaining a good ensemble classifier. Conventionally ensemble learning techniques such as bagging and boosting have been used and the diversify of base classifiers for the training set has been estimated, but there are some limitations in classifying gene expression profiles since only a few training samples are available. This paper proposes an ensemble technique that analyzes the diversity of classification rules obtained by genetic programming. Genetic programming generates interpretable rules, and a sample is classified by combining the most diverse set of rules. We have applied the proposed method to cancer classification with gene expression profiles. Experiments on lymphoma cancer dataset, prostate cancer dataset and ovarian cancer dataset have illustrated the usefulness of the proposed method. h higher classification accuracy has been obtained with the proposed method than without considering diversity. It has been also confirmed that the diversity increases classification performance.

Comparison of object oriented and pixel based classification of satellite data for effective management of natural resources (천연 자원의 효율적인 관리를 위한 위성자료의 객체 및 픽셀기반의 비교)

  • Jayakumar, S.;Heo, Joon;Sohn, Hong-Gyoo;Lee, Jung-Bin;Kim, Jong-Suk
    • Proceedings of the Korean Society of Surveying, Geodesy, Photogrammetry, and Cartography Conference
    • /
    • 2007.04a
    • /
    • pp.215-218
    • /
    • 2007
  • 이 논문은 고해상도 Quickbird 영상을 이용하여 세부레벨계획을 위한 토지피복분류를 수행하였으며 고해상도 영상을 이용한 토지피복분류를 위하여 객체기반분류와 ISODATA 기법을 적용하였다. 객체기반분류는 eCognition 소프트웨어를 사용하였으며 ISODATA 기법의 토지피복분류 결과와 비교분석을 수행하였다. 연구 대상지역은 인도의 Sukkalampatti이라 하는 작은 유역을 대상으로 연구를 진행하였다. 고해상도 영상의 사용으로 토지피복분류에 있어서 공간 해상도에 따른 토지피복의 세부레벨분류 정확도를 향상 시킬 수 있는 이점을 확인 할 수 있으며 또한, 객체기반분류와 ISODATA 기법의 분류 결과는 eCognition을 사용한 객체기반 토지피복분류결과가 ISODATA의 픽셀기반의 분류방법보다 높은 정확도를 보였다.

  • PDF

A Study on the Construction and Cultural DB Classification for Multimedia Contents. (문화콘텐츠 DB분류 및 구축에 관한 연구)

  • Moon, Byung-Chae
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2004.11a
    • /
    • pp.43-55
    • /
    • 2004
  • 본 논문의 주된 내용은 21세기 지식정보사회의 신경향에 부합하는 새로운 분류체계를 연구한 것으로, 연구된 내용은 다음과 같다. 위의 연구를 통해, 21세기 지식정보사회의 신경향에 부합하는 새로운 분류체계가 되도록 하기 위해 듀이십진분류법을 지양하고 인터넷 검색시스템 정보 분류 방식을 따랐으며, 이용자의 요구에 따라 통합 또는 세분화가 가능한 열린 분류안을 지향했다. 또한, 기획성 분류군을 설정하여 지역 및 문화의 특수성, 사용자의 개인적 요구에 부응할 수 있게 했다. 결론적으로 본 연구에서는 주제별 분류와 기획성 분류를 혼합한 형태로 구성하는 것을 권했다. 주제별 분류안은 학술적으로는 유용하지만, 데이터가 항목별로 균등하게 반영되기 어렵고 일반인의 관심을 다양하게 담아내기 어렵다. 따라서 기획성 분류를 가미함으로써 문화의 특징을 살려내야 한다는 것이다.

  • PDF