• Title/Summary/Keyword: Multi-Class Classification

Search Result 226, Processing Time 0.028 seconds

Land Cover Classification of the Korean Peninsula Using Linear Spectral Mixture Analysis of MODIS Multi-temporal Data (MODIS 다중시기 영상의 선형분광혼합화소분석을 이용한 한반도 토지피복분류도 구축)

  • Jeong, Seung-Gyu;Park, Chong-Hwa;Kim, Sang-Wook
    • Korean Journal of Remote Sensing
    • /
    • v.22 no.6
    • /
    • pp.553-563
    • /
    • 2006
  • This study aims to produce land-cover maps of Korean peninsula using multi-temporal MODIS (Moderate Resolution Imaging Spectroradiometer) imagery. To solve the low spatial resolution of MODIS data and enhance classification accuracy, Linear Spectral Mixture Analysis (LSMA) was employed. LSMA allowed to determine the fraction of each surface type in a pixel and develop vegetation, soil and water fraction images. To eliminate clouds, MVC (Maximum Value Composite) was utilized for vegetation fraction and MinVC (Minimum Value Composite) for soil fraction image respectively. With these images, using ISODATA unsupervised classifier, southern part of Korean peninsula was classified to low and mid level land-cover classes. The results showed that vegetation and soil fraction images reflected phenological characteristics of Korean peninsula. Paddy fields and forest could be easily detected in spring and summer data of the entire peninsula and arable land in North Korea. Secondly, in low level land-cover classification, overall accuracy was 79.94% and Kappa value was 0.70. Classification accuracy of forest (88.12%) and paddy field (85.45%) was higher than that of barren land (60.71%) and grassland (57.14%). In midlevel classification, forest class was sub-divided into deciduous and conifers and field class was sub-divided into paddy and field classes. In mid level, overall accuracy was 82.02% and Kappa value was 0.6986. Classification accuracy of deciduous (86.96%) and paddy (85.38%) were higher than that of conifers (62.50%) and field (77.08%).

Land cover classification of a non-accessible area using multi-sensor images and GIS data (다중센서와 GIS 자료를 이용한 접근불능지역의 토지피복 분류)

  • Kim, Yong-Min;Park, Wan-Yong;Eo, Yang-Dam;Kim, Yong-Il
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.28 no.5
    • /
    • pp.493-504
    • /
    • 2010
  • This study proposes a classification method based on an automated training extraction procedure that may be used with very high resolution (VHR) images of non-accessible areas. The proposed method overcomes the problem of scale difference between VHR images and geographic information system (GIS) data through filtering and use of a Landsat image. In order to automate maximum likelihood classification (MLC), GIS data were used as an input to the MLC of a Landsat image, and a binary edge and a normalized difference vegetation index (NDVI) were used to increase the purity of the training samples. We identified the thresholds of an NDVI and binary edge appropriate to obtain pure samples of each class. The proposed method was then applied to QuickBird and SPOT-5 images. In order to validate the method, visual interpretation and quantitative assessment of the results were compared with products of a manual method. The results showed that the proposed method could classify VHR images and efficiently update GIS data.

Customer Level Classification Model Using Ordinal Multiclass Support Vector Machines

  • Kim, Kyoung-Jae;Ahn, Hyun-Chul
    • Asia pacific journal of information systems
    • /
    • v.20 no.2
    • /
    • pp.23-37
    • /
    • 2010
  • Conventional Support Vector Machines (SVMs) have been utilized as classifiers for binary classification problems. However, certain real world problems, including corporate bond rating, cannot be addressed by binary classifiers because these are multi-class problems. For this reason, numerous studies have attempted to transform the original SVM into a multiclass classifier. These studies, however, have only considered nominal classification problems. Thus, these approaches have been limited by the existence of multiclass classification problems where classes are not nominal but ordinal in real world, such as corporate bond rating and multiclass customer classification. In this study, we adopt a novel multiclass SVM which can address ordinal classification problems using ordinal pairwise partitioning (OPP). The proposed model in our study may use fewer classifiers, but it classifies more accurately because it considers the characteristics of the order of the classes. Although it can be applied to all kinds of ordinal multiclass classification problems, most prior studies have applied it to finance area like bond rating. Thus, this study applies it to a real world customer level classification case for implementing customer relationship management. The result shows that the ordinal multiclass SVM model may also be effective for customer level classification.

Optimization of Input Features for Vegetation Classification Based on Random Forest and Sentinel-2 Image (랜덤포레스트와 Sentinel-2를 이용한 식생 분류의 입력특성 최적화)

  • LEE, Seung-Min;JEONG, Jong-Chul
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.23 no.4
    • /
    • pp.52-67
    • /
    • 2020
  • Recently, the Arctic has been exposed to snow-covered land due to melting permafrost every year, and the Korea Geographic Information Institute(NGII) provides polar spatial information service by establishing spatial information of the polar region. However, there is a lack of spatial information on vegetation sensitive to climate change. This research used a multi-temporal Sentinel-2 image to perform land cover classification of the Ny-Ålesund in Arctic Svalbard. In the pre-processing step, 10 bands and 6 vegetation spectral index were generated from multi-temporal Sentinel-2 images. In image-classification step is consisted of extracting the vegetation area through 8-class land cover classification and performing the vegetation species classification. The image classification algorithm used Random Forest to evaluate the accuracy and calculate feature importance through Out-Of-Bag(OOB). To identify the advantages of multi- temporary Sentinel-2 for vegetation classification, the overall accuracy was compared according to the number of images stacked and vegetation spectral index. Overall accuracy was 77% when using single-time Sentinel-2 images, but improved to 81% when using multi-time Sentinel-2 images. In addition, the overall accuracy improved to about 83% in learning when the vegetation index was used additionally. The most important spectral variables to distinguish between vegetation classes are located in the Red, Green, and short wave infrared-1(SWIR1). This research can be used as a basic study that optimizes input characteristics in performing the classification of vegetation in the polar regions.

Multiple SVM Classifier for Pattern Classification in Data Mining (데이터 마이닝에서 패턴 분류를 위한 다중 SVM 분류기)

  • Kim Man-Sun;Lee Sang-Yong
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.15 no.3
    • /
    • pp.289-293
    • /
    • 2005
  • Pattern classification extracts various types of pattern information expressing objects in the real world and decides their class. The top priority of pattern classification technologies is to improve the performance of classification and, for this, many researches have tried various approaches for the last 40 years. Classification methods used in pattern classification include base classifier based on the probabilistic inference of patterns, decision tree, method based on distance function, neural network and clustering but they are not efficient in analyzing a large amount of multi-dimensional data. Thus, there are active researches on multiple classifier systems, which improve the performance of classification by combining problems using a number of mutually compensatory classifiers. The present study identifies problems in previous researches on multiple SVM classifiers, and proposes BORSE, a model that, based on 1:M policy in order to expand SVM to a multiple class classifier, regards each SVM output as a signal with non-linear pattern, trains the neural network for the pattern and combine the final results of classification performance.

HR-evaluation sentence multi-classification and Analysis post-training effect using unlabeled data (HR-평가 문장 Multi-classification 및 Unlabeled data 를 활용한 Post-training 효과 분석)

  • Choi, Cheol;Lim, HeuiSeok
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.05a
    • /
    • pp.424-427
    • /
    • 2022
  • 본 연구는 도메인 특성이 강한 HR 평가문장을 BERT PLM 모델을통해 4 가지 class 로 구분하는 문제를 다룬다. 다양한 PLM 모델 적용과 training data 수에 따른 모델 성능 비교를 통해 특정 도메인에 언어모델을 적용하기 위해서 필요한 기준을 확인하였다. 또한 Unlabeled 된 HR 분야 corpus 를 활용하여 BERT 모델을 post-training 한 HR-BERT 가 PLM 분석모델 정확도 향상에 미치는 결과를 탐구한다. 위와 같은 연구를 통해 HR 이 가지고 있는 가장 큰 text data 에 대한 활용 기반을 마련하고, 특수한 도메인 분야에 PLM 을 적용하기 위한 가이드를 제시하고자 한다

Fault Diagnosis of Rotating Machinery Based on Multi-Class Support Vector Machines

  • Yang Bo-Suk;Han Tian;Hwang Won-Woo
    • Journal of Mechanical Science and Technology
    • /
    • v.19 no.3
    • /
    • pp.846-859
    • /
    • 2005
  • Support vector machines (SVMs) have become one of the most popular approaches to learning from examples and have many potential applications in science and engineering. However, their applications in fault diagnosis of rotating machinery are rather limited. Most of the published papers focus on some special fault diagnoses. This study covers the overall diagnosis procedures on most of the faults experienced in rotating machinery and examines the performance of different SVMs strategies. The excellent characteristics of SVMs are demonstrated by comparing the results obtained by artificial neural networks (ANNs) using vibration signals of a fault simulator.

Classification of Power Quality Disturbances Using Feature Vector Combination and Neural Networks (특징벡터 결합과 신경회로망을 이용한 전력외란 식별)

  • Nam, Sang-Won
    • Proceedings of the KIEE Conference
    • /
    • 1997.11a
    • /
    • pp.671-674
    • /
    • 1997
  • The objective of this paper is to present a new feature-vector extraction method for the automatic detection and classification of power quality(PQ) disturbances, where FIT, DWT(Discrete Wavelet Transform), and Fisher's criterion are utilized to extract an appropriate feature vector. In particular, the proposed classifier consists of three parts: i.e., (i) automatic detection of PQ disturbances, where the wavelet transform and signal power estimation method are utilized to detect each disturbance, (ii) feature vector extraction from the detected disturbance, and (iii) automatic classification, where Multi-Layer Perceptron(MLP) is used to classify each disturbance from the corresponding extracted feature vector. To demonstrate the performance and applicability of the proposed classification algorithm, some test results obtained by analyzing 10-class power quality disturbances are also provided.

  • PDF

Multiclass SVM Model with Order Information

  • Ahn, Hyun-Chul;Kim, Kyoung-Jae
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.6 no.4
    • /
    • pp.331-334
    • /
    • 2006
  • Original Support Vsctor Machines (SVMs) by Vapnik were used for binary classification problems. Some researchers have tried to extend original SVM to multiclass classification. However, their studies have only focused on classifying samples into nominal categories. This study proposes a novel multiclass SVM model in order to handle ordinal multiple classes. Our suggested model may use less classifiers but predict more accurately because it utilizes additional hidden information, the order of the classes. To validate our model, we apply it to the real-world bond rating case. In this study, we compare the results of the model to those of statistical and typical machine learning techniques, and another multi class SVM algorithm. The result shows that proposed model may improve classification performance in comparison to other typical multiclass classification algorithms.

A Study on the Dataset of the Korean Multi-class Emotion Analysis in Radio Listeners' Messages (라디오 청취자 문자 사연을 활용한 한국어 다중 감정 분석용 데이터셋연구)

  • Jaeah, Lee;Gooman, Park
    • Journal of Broadcast Engineering
    • /
    • v.27 no.6
    • /
    • pp.940-943
    • /
    • 2022
  • This study aims to analyze the Korean dataset by performing Korean sentence Emotion Analysis in the radio listeners' text messages collected personally. Currently, in Korea, research on the Emotion Analysis of Korean sentences is variously continuing. However, it is difficult to expect high accuracy of Emotion Analysis due to the linguistic characteristics of Korean. In addition, a lot of research has been done on Binary Sentiment Analysis that allows positive/negative classification only, but Multi-class Emotion Analysis that is classified into three or more emotions requires more research. In this regard, it is necessary to consider and analyze the Korean dataset to increase the accuracy of Multi-class Emotion Analysis for Korean. In this paper, we analyzed why Korean Emotion Analysis is difficult in the process of conducting Emotion Analysis through surveys and experiments, proposed a method for creating a dataset that can improve accuracy and can be used as a basis for Emotion Analysis of Korean sentences.