• Title/Summary/Keyword: classifiers

Search Result 743, Processing Time 0.032 seconds

A Chi-Square-Based Decision for Real-Time Malware Detection Using PE-File Features

  • Belaoued, Mohamed;Mazouzi, Smaine
    • Journal of Information Processing Systems
    • /
    • v.12 no.4
    • /
    • pp.644-660
    • /
    • 2016
  • The real-time detection of malware remains an open issue, since most of the existing approaches for malware categorization focus on improving the accuracy rather than the detection time. Therefore, finding a proper balance between these two characteristics is very important, especially for such sensitive systems. In this paper, we present a fast portable executable (PE) malware detection system, which is based on the analysis of the set of Application Programming Interfaces (APIs) called by a program and some technical PE features (TPFs). We used an efficient feature selection method, which first selects the most relevant APIs and TPFs using the chi-square ($KHI^2$) measure, and then the Phi (${\varphi}$) coefficient was used to classify the features in different subsets, based on their relevance. We evaluated our method using different classifiers trained on different combinations of feature subsets. We obtained very satisfying results with more than 98% accuracy. Our system is adequate for real-time detection since it is able to categorize a file (Malware or Benign) in 0.09 seconds.

Classification of Cancer-related Gene Expression Data Using Neural Network Classifiers (신경망 분류기를 이용한 암 관련 유전자 발현정보를 분류)

  • 권영준;류중원;조성배
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2001.04b
    • /
    • pp.295-297
    • /
    • 2001
  • 최근 생물 유전자 정보를 효과적으로 분석하기 위한 적절한 도구의 필요성이 대두되고 있다. 본 논문에서는 백혈병 환자의 골수로부터 얻어낸 DNA Microarray 유전 정보를 분류하여 환자가 가지고 있는 암의 종류를 예측하기 위한 최적의 특징추출방법과 분류 방법을 찾고자 한다. 이를 위해 피어슨 상관관계, 유클리디안 거리, 코사인 계수, 스피어맨 상관관계, 정보 이득, 상호 정보, 신호 대잡음비의 7가지 특징 추출 방법을 사용하였으며, 역전과 신경망, 의사결정 트리, 구조 적응형 자기구성 지도, $textsc{k}$-최근접 이웃 등 가지의 기계학습 분류기를 이용하여 분류 실험을 하였다. 실험결과, 피어슨 상관관계와 역전파 신경망을 이용한 분류 방법이 97.1%의 인식률을 보임을 알 수 있었다.

  • PDF

Evolvable Cellular Classifiers for pattern Recognition (패턴 인식을 위한 진화 셀룰라 분류기)

  • Ju, Jae-Ho;Shin, Yoon-Cheol;Kang, Hoon
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.10 no.4
    • /
    • pp.379-389
    • /
    • 2000
  • A cellular automaton is well-known for self-organizing and dynamic behavions in the filed of artifial life. This paper addresses a new neuronic architecture called an evolvable celluar classifier which evolves with the genetic rules (chromosomes) in the non-uniform cellular automata. An evolvable cellular classifier is primarily based on cellular programming, but its mechanism is simpler becaise it utilizes only mutations for the main genetic operators and resmbles the Hopfield network. Therefore, the desirable bit-patterns could be obtained through evolutionary processes for just one individual agent, As a rusult, an evolvable hardware is derived which is applicable to clessification of bit-string information.

  • PDF

An Experimental Study on Text Categorization using an SVM Classifier (SVM 분류기를 이용한 문서 범주화 연구)

  • 정영미;임혜영
    • Journal of the Korean Society for information Management
    • /
    • v.17 no.4
    • /
    • pp.229-248
    • /
    • 2000
  • Among several learning algorithms for lexl calegoriration. SVM(Snpport Vsctor Machines) has been provcd to ouq~e~fotm other classifiers. Th~study e~~aluales the categarizalion ability of en SVM classifier using the ModApte split of the Reutcrs-21578 dataset. First. an experiment 1s perlormed to test a few feature wetghtlng schemes that will be used in thc calegarization tasks. Second, (he categorization periarrnances of the lulear SVM and the non-linear SVM are compared. Finally. the binary SVM classifier is expanded into a multi-class classifier and thek pcrforrnnnces are comparativcly evaluated.

  • PDF

Discrimination of Acoustic Emission Signals using Pattern Recognition Analysis (형상인식법을 이용한 음향방출신호의 분류)

  • Joo, Y.S.;Jung, H.K.;Sim, C.M.;Lim, H.T.
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.10 no.2
    • /
    • pp.23-31
    • /
    • 1990
  • Acoustic Emission(AE) signals obtained during fracture toughness test and fatigue test for nuclear pressure vessel material(SA 508 cl.3) and artificial AE signals from pencil break and ultrasonic pulser were classified using pattern recognition methods. Three different classifiers ; namely Minimum Distance Classifier, Linear Discriminant Classifier and Maximum Likelihood Classifier were used for pattern recognition. In this study, the performance of each classifier was compared. The discrimination of AE signals from cracking and crack surface rubbing was possible and the analysis for crack propagation was applicable by pattern recognition methods.

  • PDF

Vocabulary Expansion Technique for Advertisement Classification

  • Jung, Jin-Yong;Lee, Jung-Hyun;Ha, Jong-Woo;Lee, Sang-Keun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.6 no.5
    • /
    • pp.1373-1387
    • /
    • 2012
  • Contextual advertising is an important revenue source for major service providers on the Web. Ads classification is one of main tasks in contextual advertising, and it is used to retrieve semantically relevant ads with respect to the content of web pages. However, it is difficult for traditional text classification methods to achieve satisfactory performance in ads classification due to scarce term features in ads. In this paper, we propose a novel ads classification method that handles the lack of term features for classifying ads with short text. The proposed method utilizes a vocabulary expansion technique using semantic associations among terms learned from large-scale search query logs. The evaluation results show that our methodology achieves 4.0% ~ 9.7% improvements in terms of the hierarchical f-measure over the baseline classifiers without vocabulary expansion.

A Neuro-Fuzzy Model Approach for the Land Cover Classification

  • Han, Jong-Gyu;Chi, Kwang-Hoon;Suh, Jae-Young
    • Proceedings of the KSRS Conference
    • /
    • 1998.09a
    • /
    • pp.122-127
    • /
    • 1998
  • This paper presents the neuro-fuzzy classifier derived from the generic model of a 3-layer fuzzy perceptron and developed the classification software based on the neuro-fuzzl model. Also, a comparison of the neuro-fuzzy and maximum-likelihood classifiers is presented in this paper. The Airborne Multispectral Scanner(AMS) imagery of Tae-Duk Science Complex Town were used for this comparison. The neuro-fuzzy classifier was more considerably accurate in the mixed composition area like "bare soil" , "dried grass" and "coniferous tree", however, the "cement road" and "asphalt road" classified more correctly with the maximum-likelihood classifier than the neuro-fuzzy classifier. Thus, the neuro-fuzzy model can be used to classify the mixed composition area like the natural environment of korea peninsula. From this research we conclude that the neuro-fuzzy classifier was superior in suppression of mixed pixel classification errors, and more robust to training site heterogeneity and the use of class labels for land use that are mixtures of land cover signatures.

  • PDF

Comparing Machine Learning Classifiers for Movie WOM Opinion Mining

  • Kim, Yoosin;Kwon, Do Young;Jeong, Seung Ryul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.8
    • /
    • pp.3169-3181
    • /
    • 2015
  • Nowadays, online word-of-mouth has become a powerful influencer to marketing and sales in business. Opinion mining and sentiment analysis is frequently adopted at market research and business analytics field for analyzing word-of-mouth content. However, there still remain several challengeable areas for 1) sentiment analysis aiming for Korean word-of-mouth content in film market, 2) availability of machine learning models only using linguistic features, 3) effect of the size of the feature set. This study took a sample of 10,000 movie reviews which had posted extremely negative/positive rating in a movie portal site, and conducted sentiment analysis with four machine learning algorithms: naïve Bayesian, decision tree, neural network, and support vector machines. We found neural network and support vector machine produced better accuracy than naïve Bayesian and decision tree on every size of the feature set. Besides, the performance of them was boosting with increasing of the feature set size.

Bare Numeral Constructions and the Discourse Representation of Partitivity

  • Hong, Min-Pyo
    • Language and Information
    • /
    • v.5 no.1
    • /
    • pp.17-34
    • /
    • 2001
  • Kamp & Reyle's (1993) proposal to represent split antecedents to a plural pronoun in terms of sum- mation and abstraction is critically reviewed in this paper, In point our some weak- nesses of their analysis as well as wrong predictions they make. In propose to treat the partitive reading found in bare numeral constructions by separating the conven-tional DRS construction rules from the cognitively motivated DRS-operations at a different level. A preference rule is also proposed that would constrain the sortal structure of discourse referents when such operations as summation and abstraction are enforced at the DRS's of relevant levels. Evidence for the separate treatment of linguistically motivated processes apart from cognitively motivated ones comes from both English and Korean constructions involving definite plural pronouns and numeral classifiers.

  • PDF

Gait Recognition Based on GF-CNN and Metric Learning

  • Wen, Junqin
    • Journal of Information Processing Systems
    • /
    • v.16 no.5
    • /
    • pp.1105-1112
    • /
    • 2020
  • Gait recognition, as a promising biometric, can be used in video-based surveillance and other security systems. However, due to the complexity of leg movement and the difference of external sampling conditions, gait recognition still faces many problems to be addressed. In this paper, an improved convolutional neural network (CNN) based on Gabor filter is therefore proposed to achieve gait recognition. Firstly, a gait feature extraction layer based on Gabor filter is inserted into the traditional CNNs, which is used to extract gait features from gait silhouette images. Then, in the process of gait classification, using the output of CNN as input, we utilize metric learning techniques to calculate distance between two gaits and achieve gait classification by k-nearest neighbors classifiers. Finally, several experiments are conducted on two open-accessed gait datasets and demonstrate that our method reaches state-of-the-art performances in terms of correct recognition rate on the OULP and CASIA-B datasets.