• Title/Summary/Keyword: multi-classification

Search Result 1,242, Processing Time 0.031 seconds

Feature Extraction and Classification of Multi-temporal SAR Data Using 3D Wavelet Transform (3차원 웨이블렛 변환을 이용한 다중시기 SAR 영상의 특징 추출 및 분류)

  • Yoo, Hee Young;Park, No-Wook;Hong, Sukyoung;Lee, Kyungdo;Kim, Yihyun
    • Korean Journal of Remote Sensing
    • /
    • v.29 no.5
    • /
    • pp.569-579
    • /
    • 2013
  • In this study, land-cover classification was implemented using features extracted from multi-temporal SAR data through 3D wavelet transform and the applicability of the 3D wavelet transform as a feature extraction approach was evaluated. The feature extraction stage based on 3D wavelet transform was first carried out before the classification and the extracted features were used as input for land-cover classification. For a comparison purpose, original image data without the feature extraction stage and Principal Component Analysis (PCA) based features were also classified. Multi-temporal Radarsat-1 data acquired at Dangjin, Korea was used for this experiment and five land-cover classes including paddy fields, dry fields, forest, water, and built up areas were considered for classification. According to the discrimination capability analysis, the characteristics of dry field and forest were similar, so it was very difficult to distinguish these two classes. When using wavelet-based features, classification accuracy was generally improved except built-up class. Especially the improvement of accuracy for dry field and forest classes was achieved. This improvement may be attributed to the wavelet transform procedure decomposing multi-temporal data not only temporally but also spatially. This experiment result shows that 3D wavelet transform would be an effective tool for feature extraction from multi-temporal data although this procedure should be tested to other sensors or other areas through extensive experiments.

A Study on the Classification of Ultrasonic Liver Image Feature Vectors and the Design of Diagnosis System (초음파 간영상의 특징벡터 분류 및 진단시스템 구현에 관한 연구)

  • Jeong, Jeong-Won;Kim, Dong-Youn
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1995 no.11
    • /
    • pp.177-182
    • /
    • 1995
  • Since one property(i.e. coarseness, orientation, regularity, granularity etc.) of ultrasound liver images was not sufficiently enough to classify the characteristics of livers, we used the multi-feature vectors from ultrasound images to diagnose the liver disease. The proposed classifier, which uses the multi-feature vectors and Bayes decision rule, performed well for the classification of normal, fat and cirrhosis liver. In our simulation, we used the Battacharyya distance and Hotelling Trace Criterion to select the best multi-feature vectors for the classifier and obtained less classification errors than other methods using single feature vector.

  • PDF

Incremental Multi-classification by Least Squares Support Vector Machine

  • Oh, Kwang-Sik;Shim, Joo-Yong;Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.4
    • /
    • pp.965-974
    • /
    • 2003
  • In this paper we propose an incremental classification of multi-class data set by LS-SVM. By encoding the output variable in the training data set appropriately, we obtain a new specific output vectors for the training data sets. Then, online LS-SVM is applied on each newly encoded output vectors. Proposed method will enable the computation cost to be reduced and the training to be performed incrementally. With the incremental formulation of an inverse matrix, the current information and new input data are used for building another new inverse matrix for the estimation of the optimal bias and lagrange multipliers. Computational difficulties of large scale matrix inversion can be avoided. Performance of proposed method are shown via numerical studies and compared with artificial neural network.

  • PDF

LAND COVER CLASSIFICATION BY USING SAR COHERENCE IMAGES

  • Yoon, Bo-Yeol;Kim, Youn-Soo
    • Proceedings of the KSRS Conference
    • /
    • 2008.10a
    • /
    • pp.76-79
    • /
    • 2008
  • This study presents the use of multi-temporal JERS-1 SAR images to the land cover classification. So far, land cover classified by high resolution aerial photo and field survey and so on. The study site was located in Non-san area. This study developed on multi-temporal land cover status monitoring and coherence information mapping can be processing by L band SAR image. From July, 1997 to October, 1998 JERS SAR images (9 scenes) coherence values are analyzed and then classified land cover. This technique which forms the basis of what is called SAR Interferometry or InSAR for short has also been employed in spaceborne systems. In such systems the separation of the antennas, called the baseline is obtained by utilizing a single antenna in a repeat pass

  • PDF

HANDWRITTEN HANGUL RECOGNITION MODEL USING MULTI-LABEL CLASSIFICATION

  • HANA CHOI
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.27 no.2
    • /
    • pp.135-145
    • /
    • 2023
  • Recently, as deep learning technology has developed, various deep learning technologies have been introduced in handwritten recognition, greatly contributing to performance improvement. The recognition accuracy of handwritten Hangeul recognition has also improved significantly, but prior research has focused on recognizing 520 Hangul characters or 2,350 Hangul characters using SERI95 data or PE92 data. In the past, most of the expressions were possible with 2,350 Hangul characters, but as globalization progresses and information and communication technology develops, there are many cases where various foreign words need to be expressed in Hangul. In this paper, we propose a model that recognizes and combines the consonants, medial vowels, and final consonants of a Korean syllable using a multi-label classification model, and achieves a high recognition accuracy of 98.38% as a result of learning with the public data of Korean handwritten characters, PE92. In addition, this model learned only 2,350 Hangul characters, but can recognize the characters which is not included in the 2,350 Hangul characters

The Effect of Meta-Features of Multiclass Datasets on the Performance of Classification Algorithms (다중 클래스 데이터셋의 메타특징이 판별 알고리즘의 성능에 미치는 영향 연구)

  • Kim, Jeonghun;Kim, Min Yong;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.23-45
    • /
    • 2020
  • Big data is creating in a wide variety of fields such as medical care, manufacturing, logistics, sales site, SNS, and the dataset characteristics are also diverse. In order to secure the competitiveness of companies, it is necessary to improve decision-making capacity using a classification algorithm. However, most of them do not have sufficient knowledge on what kind of classification algorithm is appropriate for a specific problem area. In other words, determining which classification algorithm is appropriate depending on the characteristics of the dataset was has been a task that required expertise and effort. This is because the relationship between the characteristics of datasets (called meta-features) and the performance of classification algorithms has not been fully understood. Moreover, there has been little research on meta-features reflecting the characteristics of multi-class. Therefore, the purpose of this study is to empirically analyze whether meta-features of multi-class datasets have a significant effect on the performance of classification algorithms. In this study, meta-features of multi-class datasets were identified into two factors, (the data structure and the data complexity,) and seven representative meta-features were selected. Among those, we included the Herfindahl-Hirschman Index (HHI), originally a market concentration measurement index, in the meta-features to replace IR(Imbalanced Ratio). Also, we developed a new index called Reverse ReLU Silhouette Score into the meta-feature set. Among the UCI Machine Learning Repository data, six representative datasets (Balance Scale, PageBlocks, Car Evaluation, User Knowledge-Modeling, Wine Quality(red), Contraceptive Method Choice) were selected. The class of each dataset was classified by using the classification algorithms (KNN, Logistic Regression, Nave Bayes, Random Forest, and SVM) selected in the study. For each dataset, we applied 10-fold cross validation method. 10% to 100% oversampling method is applied for each fold and meta-features of the dataset is measured. The meta-features selected are HHI, Number of Classes, Number of Features, Entropy, Reverse ReLU Silhouette Score, Nonlinearity of Linear Classifier, Hub Score. F1-score was selected as the dependent variable. As a result, the results of this study showed that the six meta-features including Reverse ReLU Silhouette Score and HHI proposed in this study have a significant effect on the classification performance. (1) The meta-features HHI proposed in this study was significant in the classification performance. (2) The number of variables has a significant effect on the classification performance, unlike the number of classes, but it has a positive effect. (3) The number of classes has a negative effect on the performance of classification. (4) Entropy has a significant effect on the performance of classification. (5) The Reverse ReLU Silhouette Score also significantly affects the classification performance at a significant level of 0.01. (6) The nonlinearity of linear classifiers has a significant negative effect on classification performance. In addition, the results of the analysis by the classification algorithms were also consistent. In the regression analysis by classification algorithm, Naïve Bayes algorithm does not have a significant effect on the number of variables unlike other classification algorithms. This study has two theoretical contributions: (1) two new meta-features (HHI, Reverse ReLU Silhouette score) was proved to be significant. (2) The effects of data characteristics on the performance of classification were investigated using meta-features. The practical contribution points (1) can be utilized in the development of classification algorithm recommendation system according to the characteristics of datasets. (2) Many data scientists are often testing by adjusting the parameters of the algorithm to find the optimal algorithm for the situation because the characteristics of the data are different. In this process, excessive waste of resources occurs due to hardware, cost, time, and manpower. This study is expected to be useful for machine learning, data mining researchers, practitioners, and machine learning-based system developers. The composition of this study consists of introduction, related research, research model, experiment, conclusion and discussion.

Multi-Label Combination for Prediction of Protein Subcellular Localization (다중레이블 조합을 사용한 단백질 세포내 위치 예측)

  • Chi, Sang-Mun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.18 no.7
    • /
    • pp.1749-1756
    • /
    • 2014
  • Knowledge about protein subcellular localization provides important information about protein function. This paper improves a label power-set multi-label classification for the accurate prediction of subcellular localization of proteins which simultaneously exist at multiple subcellular locations. Among multi-label classification methods, label power-set method can effectively model the correlation between subcellular locations of proteins performing certain biological function. With constrained optimization, this paper calculates combination weights which are used in the linear combination representation of a multi-label by other multi-labels. Using these weights, the prediction probabilities of multi-labels are combined to give final prediction results. Experimental results on human protein dataset show that the proposed method achieves higher performance than other prediction methods for protein subcellular localization. This shows that the proposed method can successfully enrich the prediction probability of multi-labels by exploiting the overlapping information between multi-labels.

An Intelligent System of Marker Gene Selection for Classification of Cancers using Microarray Data (마이크로어레이 데이터를 이용한 암 분류 표지 유전자 선별 시스템)

  • Park, Su-Young;Jung, Chai-Yeoung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.14 no.10
    • /
    • pp.2365-2370
    • /
    • 2010
  • The method of cancer classification based on microarray could contribute to being accurate cancer classification by finding differently expressing gene pattern statistically according to a cancer type. Therefore, the process to select a closely related informative gene with a particular cancer classification to classify cancer using present microarray technology with effect is essential. In this paper, the system can detect marker genes to likely express the most differentially explaining the effects of cancer using ovarian cancer microarray data. And it compare and analyze a performance of classification of the proposed system with it of established microarray system using multi-perceptron neural network layer. Microarray data set including marker gene that are selected using ANOVA method represent the highest classification accuracy of 98.61%, which show that it improve classification performance than established microarray system.

Accuracy Improvement of Vegetation Classification Using High Resolution Imagery and OOC Technique (고해상도 영상자료 및 객체지향분류기법을 이용한 식생분류 정확도 향상 방안 연구)

  • Hong, Chang-Hee;Park, Jong-Hwa
    • Journal of Environmental Impact Assessment
    • /
    • v.18 no.6
    • /
    • pp.387-392
    • /
    • 2009
  • As Our society's environmental awareness and concern the significant increases, the importance of the legal system for environmental conservation such as the Prior Environmental Review System, Environmental Impact Assessment is growing increasingly. but, still critical issues are present such as reliability. Though there could be various causes such as the system or procedures etc. Above all, basically the environmental data problem is the critical cause. Therefore, this study was trying to improve the environmental data accuracy using the high-resolution color aerial photography, LiDAR data and Object Oriented Classification method. And in this study, classification based on coverage percentage of a particular species was attempted through the multi-resolution segmentation and multi-level classification method. The classification result was verified by comparison with 11 points local survey data. All 11 points were classified correctly. And even though the exact coverage percentage of the particular species did not be measured, It was confirmed that the species was occupied similar portion. It is important that the environmental data which can be used for the conservation value assessment could be acquired.

A Study of Land-Cover Classification Technique for Merging Image Using Fuzzy C-Mean Algorithm (Fuzzy C-Mean 알고리즘을 이용한 중합 영상의 토지피복분류기법 연구)

  • 신석효;안기원;양경주
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.22 no.2
    • /
    • pp.171-178
    • /
    • 2004
  • The advantage of the remote sensing is extraction the information of wide area rapidly. Such advantage is the resource and environment are quick and efficient method to grasps accurately method through the land cover classification of wide area. Accordingly this study was presented more better land cover classification method through an algorithm development. We accomplished FCM(Fuzzy C-Mean) classification technique with MLC (Maximum Likelihood classification) technique to be general land cover classification method in the content of research. And evaluated the accuracy assessment of two classification method. This study is used to the high-resolution(6.6m) Electro-Optical Camera(EOC) panchromatic image of the first Korea Multi-Purpose Satellite 1(KOMPSAT-1) and the multi-spectral Moderate Resolution Imaging Spectroradiometer(MODIS) image data(36 bands).