• Title/Summary/Keyword: Classification Performance

Search Result 3,704, Processing Time 0.032 seconds

A Study on the Relationship between Class Similarity and the Performance of Hierarchical Classification Method in a Text Document Classification Problem (텍스트 문서 분류에서 범주간 유사도와 계층적 분류 방법의 성과 관계 연구)

  • Jang, Soojung;Min, Daiki
    • The Journal of Society for e-Business Studies
    • /
    • v.25 no.3
    • /
    • pp.77-93
    • /
    • 2020
  • The literature has reported that hierarchical classification methods generally outperform the flat classification methods for a multi-class document classification problem. Unlike the literature that has constructed a class hierarchy, this paper evaluates the performance of hierarchical and flat classification methods under a situation where the class hierarchy is predefined. We conducted numerical evaluations for two data sets; research papers on climate change adaptation technologies in water sector and 20NewsGroup open data set. The evaluation results show that the hierarchical classification method outperforms the flat classification methods under a certain condition, which differs from the literature. The performance of hierarchical classification method over flat classification method depends on class similarities at levels in the class structure. More importantly, the hierarchical classification method works better when the upper level similarity is less that the lower level similarity.

A Feature Selection-based Ensemble Method for Arrhythmia Classification

  • Namsrai, Erdenetuya;Munkhdalai, Tsendsuren;Li, Meijing;Shin, Jung-Hoon;Namsrai, Oyun-Erdene;Ryu, Keun Ho
    • Journal of Information Processing Systems
    • /
    • v.9 no.1
    • /
    • pp.31-40
    • /
    • 2013
  • In this paper, a novel method is proposed to build an ensemble of classifiers by using a feature selection schema. The feature selection schema identifies the best feature sets that affect the arrhythmia classification. Firstly, a number of feature subsets are extracted by applying the feature selection schema to the original dataset. Then classification models are built by using the each feature subset. Finally, we combine the classification models by adopting a voting approach to form a classification ensemble. The voting approach in our method involves both classification error rate and feature selection rate to calculate the score of the each classifier in the ensemble. In our method, the feature selection rate depends on the extracting order of the feature subsets. In the experiment, we applied our method to arrhythmia dataset and generated three top disjointed feature sets. We then built three classifiers based on the top-three feature subsets and formed the classifier ensemble by using the voting approach. Our method can improve the classification accuracy in high dimensional dataset. The performance of each classifier and the performance of their ensemble were higher than the performance of the classifier that was based on whole feature space of the dataset. The classification performance was improved and a more stable classification model could be constructed with the proposed approach.

Learning Networks for Learning the Pattern Vectors causing Classification Error (분류오차유발 패턴벡터 학습을 위한 학습네트워크)

  • Lee Yong-Gu;Choi Woo-Seung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.5 s.37
    • /
    • pp.77-86
    • /
    • 2005
  • In this paper, we designed a learning algorithm of LVQ that extracts classification errors and learns ones and improves classification performance. The proposed LVQ learning algorithm is the learning Networks which is use SOM to learn initial reference vectors and out-star learning algorithm to determine the class of the output neurons of LVQ. To extract pattern vectors which cause classification errors, we proposed the error-cause condition, which uses that condition and constructed the pattern vector space which consists of the input pattern vectors that cause the classification errors and learned these pattern vectors , and improved performance of the pattern classification. To prove the performance of the proposed learning algorithm, the simulation is performed by using training vectors and test vectors that are Fisher' Iris data and EMG data, and classification performance of the proposed learning method is compared with ones of the conventional LVQ, and it was a confirmation that the proposed learning method is more successful classification than the conventional classification.

  • PDF

Robust Face Recognition under Limited Training Sample Scenario using Linear Representation

  • Iqbal, Omer;Jadoon, Waqas;ur Rehman, Zia;Khan, Fiaz Gul;Nazir, Babar;Khan, Iftikhar Ahmed
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.7
    • /
    • pp.3172-3193
    • /
    • 2018
  • Recently, several studies have shown that linear representation based approaches are very effective and efficient for image classification. One of these linear-representation-based approaches is the Collaborative representation (CR) method. The existing algorithms based on CR have two major problems that degrade their classification performance. First problem arises due to the limited number of available training samples. The large variations, caused by illumintion and expression changes, among query and training samples leads to poor classification performance. Second problem occurs when an image is partially noised (contiguous occlusion), as some part of the given image become corrupt the classification performance also degrades. We aim to extend the collaborative representation framework under limited training samples face recognition problem. Our proposed solution will generate virtual samples and intra-class variations from training data to model the variations effectively between query and training samples. For robust classification, the image patches have been utilized to compute representation to address partial occlusion as it leads to more accurate classification results. The proposed method computes representation based on local regions in the images as opposed to CR, which computes representation based on global solution involving entire images. Furthermore, the proposed solution also integrates the locality structure into CR, using Euclidian distance between the query and training samples. Intuitively, if the query sample can be represented by selecting its nearest neighbours, lie on a same linear subspace then the resulting representation will be more discriminate and accurately classify the query sample. Hence our proposed framework model the limited sample face recognition problem into sufficient training samples problem using virtual samples and intra-class variations, generated from training samples that will result in improved classification accuracy as evident from experimental results. Moreover, it compute representation based on local image patches for robust classification and is expected to greatly increase the classification performance for face recognition task.

Aircraft Classification with Fusion of HRRP and JEM Based on the Confidence of a Classifier (구분기 신뢰도에 기반한 HRRP 및 JEM 융합 항공기 식별)

  • Kim, Si-Ho;Lee, Sang-In;Chae, Dae-Young
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.28 no.3
    • /
    • pp.217-224
    • /
    • 2017
  • In this paper, we propose a fusion classification method combining HRRP and JEM classifier with complementary properties for the classification of aircraft. The fusion method is based on the confidence of a classifier for a classification result to improve performance compared with single classifier in various situations. The confidence is defined as the posterior probability estimated from the classification performance of a classifier and it depends on the aspect angle and the certainty for a classification result. Through the classification test using simulation data, we can verify that the proposed fusion method shows good performance by fusing the classifiers effectively.

Performance of an ML Modulation Classification of QAM Signals with Single-Sample Observation (단일표본관측을 이용한 직교진폭변조 신호의 치운 변조분류 성능)

  • Kang Seog Geun
    • The KIPS Transactions:PartC
    • /
    • v.12C no.1 s.97
    • /
    • pp.63-68
    • /
    • 2005
  • In this paper, performance of a maximum-likelihood modulation classification for quadrature amplitude modulation (QAM) is studied. Unlike previous works, the relative classification performance with respect to the available modulations and performance limit with single-sample observation are presented. For those purposes, all constellations are set to have the same minimum Euclidean distance between symbols so that a smaller constellation is a subset of the larger ones. And only one sample of received waveform is used for multiple hypothesis test. As a result, classification performance is improved with increase in signal-to-noise ratio in all the experiments. Especially, when the true modulation format used in the transmitter is 4 QAM, almost perfect classification can be achieved without any additional information or observation samples. Though the possibility of false classification due to the symbols shared by subset constellations always exists, correct classification ratio of $80{\%}$ can be obtained with the single-sample observation when the true modulation formats are 16 and 64 QAM.

A Novel Thresholding for Prediction Analytics with Machine Learning Techniques

  • Shakir, Khan;Reemiah Muneer, Alotaibi
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.1
    • /
    • pp.33-40
    • /
    • 2023
  • Machine-learning techniques are discovering effective performance on data analytics. Classification and regression are supported for prediction on different kinds of data. There are various breeds of classification techniques are using based on nature of data. Threshold determination is essential to making better model for unlabelled data. In this paper, threshold value applied as range, based on min-max normalization technique for creating labels and multiclass classification performed on rainfall data. Binary classification is applied on autism data and classification techniques applied on child abuse data. Performance of each technique analysed with the evaluation metrics.

One-dimensional CNN Model of Network Traffic Classification based on Transfer Learning

  • Lingyun Yang;Yuning Dong;Zaijian Wang;Feifei Gao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.2
    • /
    • pp.420-437
    • /
    • 2024
  • There are some problems in network traffic classification (NTC), such as complicated statistical features and insufficient training samples, which may cause poor classification effect. A NTC architecture based on one-dimensional Convolutional Neural Network (CNN) and transfer learning is proposed to tackle these problems and improve the fine-grained classification performance. The key points of the proposed architecture include: (1) Model classification--by extracting normalized rate feature set from original data, plus existing statistical features to optimize the CNN NTC model. (2) To apply transfer learning in the classification to improve NTC performance. We collect two typical network flows data from Youku and YouTube, and verify the proposed method through extensive experiments. The results show that compared with existing methods, our method could improve the classification accuracy by around 3-5%for Youku, and by about 7 to 27% for YouTube.

Awareness and Performance for Standard Precautions among Health Care Workers in a General Hospital (일개 종합병원 의료종사자 직종별 표준주의 인지도와 수행도 비교)

  • Kim, Ja Young;Kim, Bog Ja
    • Journal of Korean Critical Care Nursing
    • /
    • v.5 no.2
    • /
    • pp.49-60
    • /
    • 2012
  • Purpose: The purpose of this study was to explore health care workers awareness and performance of standard precautions. Methods: Participants were 296 health care workers including nurses, physicians, and medical technicians. Awareness and performance of standard precautions were measured with 4-point Likert scales. The data were analyzed with t-tests and one-way ANOVA by using SPSS 18.0. Results: The mean scores of awareness were 3.72 in nurses, 3.62 in physicians, and 3.47 in medical technicians. There was a significant difference of awareness by occupational classification (F=12.39, p<.001). The mean scores of performance of standard precautions were 3.45 in nurses, 3.19 in physicians, and 3.23 in medical technicians. There was a significant difference of performance by occupational classification (F=10.98, p<.001). In addition, the score of performance of standard precautions was significantly lower than that of awareness (t=11.89, p<.001). Conclusion: The results of this study indicated that awareness and performance of standard precautions were different by occupational classification. To improve performance of standard precautions in hospitals, it is necessary to provide a distinct infection control program by occupational classification.

  • PDF

Evaluation Standard for Performance of Artificial Intelligence Systems: ISO/IEC TR 24029-1 (인공지능 시스템의 성능 평가 표준: ISO/IEC TR 24029-1)

  • Seongsoo Lee
    • Journal of IKEEE
    • /
    • v.27 no.3
    • /
    • pp.350-354
    • /
    • 2023
  • This paper describes ISO/IEC TR 24029-1, an international standard to evaluate the performance of artificial intelligence systems. ISO/IEC TR 24029-1 defines the performance measures of artificial intelligence systems in two categories, i.e. interpolation and classificiation. Performance measures in the interpolation categories mean how much the predicted values of the artificial intelligence system is close to the real values. Performance measures in the classification categories mean how much the predicted classes of the artificial intelligence system is equal to the real classes. Based on these performance measures, performance of artificial intelligence systems can be evaluated and performance of different artificial intelligence systems can be compared.