• Title/Summary/Keyword: feature ranking

Search Result 48, Processing Time 0.032 seconds

Cross-architecture Binary Function Similarity Detection based on Composite Feature Model

  • Xiaonan Li;Guimin Zhang;Qingbao Li;Ping Zhang;Zhifeng Chen;Jinjin Liu;Shudan Yue
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.8
    • /
    • pp.2101-2123
    • /
    • 2023
  • Recent studies have shown that the neural network-based binary code similarity detection technology performs well in vulnerability mining, plagiarism detection, and malicious code analysis. However, existing cross-architecture methods still suffer from insufficient feature characterization and low discrimination accuracy. To address these issues, this paper proposes a cross-architecture binary function similarity detection method based on composite feature model (SDCFM). Firstly, the binary function is converted into vector representation according to the proposed composite feature model, which is composed of instruction statistical features, control flow graph structural features, and application program interface calling behavioral features. Then, the composite features are embedded by the proposed hierarchical embedding network based on a graph neural network. In which, the block-level features and the function-level features are processed separately and finally fused into the embedding. In addition, to make the trained model more accurate and stable, our method utilizes the embeddings of predecessor nodes to modify the node embedding in the iterative updating process of the graph neural network. To assess the effectiveness of composite feature model, we contrast SDCFM with the state of art method on benchmark datasets. The experimental results show that SDCFM has good performance both on the area under the curve in the binary function similarity detection task and the vulnerable candidate function ranking in vulnerability search task.

A new feature ranking and feature selection framework for realtime IDS (실시간 침입탐지 시스템을 위한 새로운 특징랭킹과 특징선택 프레임워크에 대한 연구)

  • Lee, Sang-Jae;Kim, Se-Heon
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2008.10a
    • /
    • pp.514-518
    • /
    • 2008
  • 인터넷의 보급에 따라 네트워크를 통한 공격에 피해가 급증하고 있다. 이러한 네트워크 침해를 막기위해 여러 연구자들은 침입탐지 시스템(IDS)을 제안하였으나, 시스템의 탐지율에만 초점을 맞추고 있기 때문에 실시간(Realtime)으로 동작하지 못하고 있다. 실시간 IDS를 위하여 최근 다양한 특징선택(Feature selection)들이 제안되고 있다. 본1) 논문에서는 특징들을 중요도의 순위를 정하는 새로운 랭킹 방법과 이 방법에 따라서 특징을 선택하는 특징 선택 알고리즘을 제안한다. 또한 제안된 알고리즘을 통하여 선택된 특징을 사용할 경우 탐지결과가 우수함을 실험으로 보여주고 있다.

  • PDF

Feature Parameter Analysis for Rotor Fault Diagnosis (회전체 결함 진단을 위한 특징 파라미터 분석)

  • Jeoung, Rae-Hycuk;Chai, Jang-Bom;Lee, Byoung-Hak;Lee, Do-Hwan;Lee, Byung-Kon
    • The KSFM Journal of Fluid Machinery
    • /
    • v.15 no.6
    • /
    • pp.31-38
    • /
    • 2012
  • Rotor of rotating machinery is the highly damaged part. Fault of 7 different types was confirmed as the main causes of rotor damage from the pump failure history data in domestic and U.S. nuclear. For each fault types, simulation testing was performed and fault signals were collected form the sensors. To calculate the statistical parameters of time-domain & frequency-domain, measured signals were analyzed by using the discrete wavelet transform, fast fourier transform, statistical analysis. Total 84 parameters were obtained. And Effectiveness factor were used to evaluate the discrimination capacity of each parameter. From the effectiveness factor, RAW-P4/RAW-P7/WT2-NNL/WT2-EE/WT1-P1 showed high ranking. Finally, these parameters were selected as the feature parameters of intelligent fault diagnostics for rotor.

Feature Selection Method by Information Theory and Particle S warm Optimization (상호정보량과 Binary Particle Swarm Optimization을 이용한 속성선택 기법)

  • Cho, Jae-Hoon;Lee, Dae-Jong;Song, Chang-Kyu;Chun, Myung-Geun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.19 no.2
    • /
    • pp.191-196
    • /
    • 2009
  • In this paper, we proposed a feature selection method using Binary Particle Swarm Optimization(BPSO) and Mutual information. This proposed method consists of the feature selection part for selecting candidate feature subset by mutual information and the optimal feature selection part for choosing optimal feature subset by BPSO in the candidate feature subsets. In the candidate feature selection part, we computed the mutual information of all features, respectively and selected a candidate feature subset by the ranking of mutual information. In the optimal feature selection part, optimal feature subset can be found by BPSO in the candidate feature subset. In the BPSO process, we used multi-object function to optimize both accuracy of classifier and selected feature subset size. DNA expression dataset are used for estimating the performance of the proposed method. Experimental results show that this method can achieve better performance for pattern recognition problems than conventional ones.

A characteristic-based technology measurement with market factor considered (시장요인이 고려된 특성치 준거 기술측정)

  • 김성철;유평일
    • Korean Management Science Review
    • /
    • v.11 no.2
    • /
    • pp.237-253
    • /
    • 1994
  • Technology measurement is related with how to construct indicators of technological change and relative ranking of technological sophistication. Many attempts have been made to understand the measurement of technology. However, technology measurement still remains little understood problem in spite of its importance. This article is concerned with improving the measurement of technology by introducing market factors into the model. It illustrate a simple approach to the measurement of technology. This approach is based on the characteristic-space paradigm of technology. A relative ranking of technological sophistication for a product is measurable as a set of characteristics. The main feature of the proposed approach is the combination of technical factors and market factors. Technical factors are reflected in the definition of technological sophistication. Market factors are embraced in the determination of the relative importance assigned to each technology defining characteristics. Thus, the weight is determined by technical factors and market factors, which differentiates the study from the past based on judgmental technique such as experts' opinion.

  • PDF

Analyzing empirical performance of correlation based feature selection with company credit rank score dataset - Emphasis on KOSPI manufacturing companies -

  • Nam, Youn Chang;Lee, Kun Chang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.4
    • /
    • pp.63-71
    • /
    • 2016
  • This paper is about applying efficient data mining method which improves the score calculation and proper building performance of credit ranking score system. The main idea of this data mining technique is accomplishing such objectives by applying Correlation based Feature Selection which could also be used to verify the properness of existing rank scores quickly. This study selected 2047 manufacturing companies on KOSPI market during the period of 2009 to 2013, which have their own credit rank scores given by NICE information service agency. Regarding the relevant financial variables, total 80 variables were collected from KIS-Value and DART (Data Analysis, Retrieval and Transfer System). If correlation based feature selection could select more important variables, then required information and cost would be reduced significantly. Through analysis, this study show that the proposed correlation based feature selection method improves selection and classification process of credit rank system so that the accuracy and credibility would be increased while the cost for building system would be decreased.

A Feature Selection Method Based on Fuzzy Cluster Analysis (퍼지 클러스터 분석 기반 특징 선택 방법)

  • Rhee, Hyun-Sook
    • The KIPS Transactions:PartB
    • /
    • v.14B no.2
    • /
    • pp.135-140
    • /
    • 2007
  • Feature selection is a preprocessing technique commonly used on high dimensional data. Feature selection studies how to select a subset or list of attributes that are used to construct models describing data. Feature selection methods attempt to explore data's intrinsic properties by employing statistics or information theory. The recent developments have involved approaches like correlation method, dimensionality reduction and mutual information technique. This feature selection have become the focus of much research in areas of applications with massive and complex data sets. In this paper, we provide a feature selection method considering data characteristics and generalization capability. It provides a computational approach for feature selection based on fuzzy cluster analysis of its attribute values and its performance measures. And we apply it to the system for classifying computer virus and compared with heuristic method using the contrast concept. Experimental result shows the proposed approach can give a feature ranking, select the features, and improve the system performance.

A machine learning informed prediction of severe accident progressions in nuclear power plants

  • JinHo Song;SungJoong Kim
    • Nuclear Engineering and Technology
    • /
    • v.56 no.6
    • /
    • pp.2266-2273
    • /
    • 2024
  • A machine learning platform is proposed for the diagnosis of a severe accident progression in a nuclear power plant. To predict the key parameters for accident management including lost signals, a long short term memory (LSTM) network is proposed, where multiple accident scenarios are used for training. Training and test data were produced by MELCOR simulation of the Fukushima Daiichi Nuclear Power Plant (FDNPP) accident at unit 3. Feature variables were selected among plant parameters, where the importance ranking was determined by a recursive feature elimination technique using RandomForestRegressor. To answer the question of whether a reduced order ML model could predict the complex transient response, we performed a systematic sensitivity study for the choices of target variables, the combination of training and test data, the number of feature variables, and the number of neurons to evaluate the performance of the proposed ML platform. The number of sensitivity cases was chosen to guarantee a 95 % tolerance limit with a 95 % confidence level based on Wilks' formula to quantify the uncertainty of predictions. The results of investigations indicate that the proposed ML platform consistently predicts the target variable. The median and mean predictions were close to the true value.

Optimization Design of Log-periodic Dipole Antenna Arrays Via Multiobjective Genetic Algorithms

  • Wang, H.J.
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.1353-1355
    • /
    • 2003
  • Genetic algorithms (GA) is a well known technique that is capable of handling multiobjective functions and discrete constraints in the process of numerical optimization. Together with the Pareto ranking scheme, more than one possible solution can be obtained despite the imposed constraints and multi-criteria design functions. In view of this unique capability, the design of the log-periodic dipole antenna array (LPDA) using this special feature is proposed in this paper. This method also provides gain, front-back level and S parameter design tradeoff for the LPDA design in broadband application at no extra computational cost.

  • PDF

Comparative Study of GDPA and Hough Transformation for Linear Feature Extraction using Space-borne Imagery (위성 영상정보를 이용한 선형 지형지물 추출에서의 GDPA와 Hough 변환 처리결과 비교연구)

  • Lee Kiwon;Ryu Hee-Young;Kwon Byung-Doo
    • Korean Journal of Remote Sensing
    • /
    • v.20 no.4
    • /
    • pp.261-274
    • /
    • 2004
  • The feature extraction using remotely sensed imagery has been recognized one of the important tasks in remote sensing applications. As the high-resolution imagery are widely used to the engineering purposes, need of more accurate feature information also is increasing. Especially, in case of the automatic extraction of linear feature such as road using mid or low-resolution imagery, several techniques was developed and applied in the mean time. But quantitatively comparative analysis of techniques and case studies for high-resolution imagery is rare. In this study, we implemented a computer program to perform and compare GDPA (Gradient Direction Profile Analysis) algorithm and Hough transformation. Also the results of applying two techniques to some images were compared with road centerline layers and boundary layers of digital map and presented. For quantitative comparison, the ranking method using commission error and omission error was used. As results, Hough transform had high accuracy over 20% on the average. As for execution speed, GDPA shows main advantage over Hough transform. But the accuracy was not remarkable difference between GDPA and Hough transform, when the noise removal was app]ied to the result of GDPA. In conclusion, it is expected that GDPA have more advantage than Hough transform in the application side.