• Title/Summary/Keyword: feature model validation

Search Result 111, Processing Time 0.025 seconds

A Pre-processing Study to Solve the Problem of Rare Class Classification of Network Traffic Data (네트워크 트래픽 데이터의 희소 클래스 분류 문제 해결을 위한 전처리 연구)

  • Ryu, Kyung Joon;Shin, DongIl;Shin, DongKyoo;Park, JeongChan;Kim, JinGoog
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.9 no.12
    • /
    • pp.411-418
    • /
    • 2020
  • In the field of information security, IDS(Intrusion Detection System) is normally classified in two different categories: signature-based IDS and anomaly-based IDS. Many studies in anomaly-based IDS have been conducted that analyze network traffic data generated in cyberspace by machine learning algorithms. In this paper, we studied pre-processing methods to overcome performance degradation problems cashed by rare classes. We experimented classification performance of a Machine Learning algorithm by reconstructing data set based on rare classes and semi rare classes. After reconstructing data into three different sets, wrapper and filter feature selection methods are applied continuously. Each data set is regularized by a quantile scaler. Depp neural network model is used for learning and validation. The evaluation results are compared by true positive values and false negative values. We acquired improved classification performances on all of three data sets.

Fault Diagnosis of Bearing Based on Convolutional Neural Network Using Multi-Domain Features

  • Shao, Xiaorui;Wang, Lijiang;Kim, Chang Soo;Ra, Ilkyeun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.5
    • /
    • pp.1610-1629
    • /
    • 2021
  • Failures frequently occurred in manufacturing machines due to complex and changeable manufacturing environments, increasing the downtime and maintenance costs. This manuscript develops a novel deep learning-based method named Multi-Domain Convolutional Neural Network (MDCNN) to deal with this challenging task with vibration signals. The proposed MDCNN consists of time-domain, frequency-domain, and statistical-domain feature channels. The Time-domain channel is to model the hidden patterns of signals in the time domain. The frequency-domain channel uses Discrete Wavelet Transformation (DWT) to obtain the rich feature representations of signals in the frequency domain. The statistic-domain channel contains six statistical variables, which is to reflect the signals' macro statistical-domain features, respectively. Firstly, in the proposed MDCNN, time-domain and frequency-domain channels are processed by CNN individually with various filters. Secondly, the CNN extracted features from time, and frequency domains are merged as time-frequency features. Lastly, time-frequency domain features are fused with six statistical variables as the comprehensive features for identifying the fault. Thereby, the proposed method could make full use of those three domain-features for fault diagnosis while keeping high distinguishability due to CNN's utilization. The authors designed massive experiments with 10-folder cross-validation technology to validate the proposed method's effectiveness on the CWRU bearing data set. The experimental results are calculated by ten-time averaged accuracy. They have confirmed that the proposed MDCNN could intelligently, accurately, and timely detect the fault under the complex manufacturing environments, whose accuracy is nearly 100%.

Comparison of radiomics prediction models for lung metastases according to four semiautomatic segmentation methods in soft-tissue sarcomas of the extremities

  • Heesoon Sheen;Han-Back Shin;Jung Young Kim
    • Journal of the Korean Physical Society
    • /
    • v.80
    • /
    • pp.247-256
    • /
    • 2022
  • Our objective was to investigate radiomics signatures and prediction models defined by four segmentation methods in using 2-[18F]fluoro-2-deoxy-d-glucose positron emission tomography (18F-FDG PET) imaging of lung metastases of soft-tissue sarcomas (STSs). For this purpose, three fixed threshold methods using the standardized uptake value (SUV) and gradient-based edge detection (ED) were used for tumor delineation on the PET images of STSs. The Dice coefficients (DCs) of the segmentation methods were compared. The least absolute shrinkage and selection operator (LASSO) regression and Spearman's rank, and Friedman's ANOVA test were used for selection and validation of radiomics features. The developed radiomics models were assessed using ROC (receiver operating characteristics) curve and confusion matrices. According to the results, the DC values showed the biggest difference between SUV40% and other segmentation methods (DC: 0.55 and 0.59). Grey-level run-length matrix_run-length nonuniformity (GLRLM_RLNU) was a common radiomics signature extracted by all segmentation methods. The multivariable logistic regression of ED showed the highest area under the ROC (receiver operating characteristic) curve (AUC), sensitivity, specificity, and accuracy (AUC: 0.88, sensitivity: 0.85, specificity: 0.74, accuracy: 0.81). In our research, the ED method was able to derive a significant model of radiomics. GLRLM_RLNU which was selected from all segmented methods as a meaningful feature was considered the obvious radiomics feature associated with the heterogeneity and the aggressiveness. Our results have apparently showed that radiomics signatures have the potential to uncover tumor characteristics.

Air Passenger Demand Forecasting and Baggage Carousel Expansion: Application to Incheon International Airport (항공 수요예측 및 고객 수하물 컨베이어 확장 모형 연구 : 인천공항을 중심으로)

  • Yoon, Sung Wook;Jeong, Suk Jae
    • Journal of Korean Society of Transportation
    • /
    • v.32 no.4
    • /
    • pp.401-409
    • /
    • 2014
  • This study deals with capacity expansion planning of airport infrastructure in view of economic validation that reflect construction costs and social benefits according to the reduction of passengers' delay time. We first forecast the airport peak-demand which has a seasonal and cyclical feature with ARIMA model that has been one of the most widely used linear models in time series forecasting. A discrete event simulation model is built for estimating actual delay time of passengers that consider the passenger's dynamic flow within airport infrastructure after arriving at the airport. With the trade-off relationship between cost and benefit, we determine an economic quantity of conveyor that will be expanded. Through the experiment performed with the case study of Incheon international airport, we demonstrate that our approach can be an effective method to solve the airport expansion problem with seasonal passenger arrival and dynamic operational aspects in airport infrastructure.

Dynamic Bayesian Network based Two-Hand Gesture Recognition (동적 베이스망 기반의 양손 제스처 인식)

  • Suk, Heung-Il;Sin, Bong-Kee
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.4
    • /
    • pp.265-279
    • /
    • 2008
  • The idea of using hand gestures for human-computer interaction is not new and has been studied intensively during the last dorado with a significant amount of qualitative progress that, however, has been short of our expectations. This paper describes a dynamic Bayesian network or DBN based approach to both two-hand gestures and one-hand gestures. Unlike wired glove-based approaches, the success of camera-based methods depends greatly on the image processing and feature extraction results. So the proposed method of DBN-based inference is preceded by fail-safe steps of skin extraction and modeling, and motion tracking. Then a new gesture recognition model for a set of both one-hand and two-hand gestures is proposed based on the dynamic Bayesian network framework which makes it easy to represent the relationship among features and incorporate new information to a model. In an experiment with ten isolated gestures, we obtained the recognition rate upwards of 99.59% with cross validation. The proposed model and the related approach are believed to have a strong potential for successful applications to other related problems such as sign languages.

Dynamic RNN-CNN malware classifier correspond with Random Dimension Input Data (임의 차원 데이터 대응 Dynamic RNN-CNN 멀웨어 분류기)

  • Lim, Geun-Young;Cho, Young-Bok
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.5
    • /
    • pp.533-539
    • /
    • 2019
  • This study proposes a malware classification model that can handle arbitrary length input data using the Microsoft Malware Classification Challenge dataset. We are based on imaging existing data from malware. The proposed model generates a lot of images when malware data is large, and generates a small image of small data. The generated image is learned as time series data by Dynamic RNN. The output value of the RNN is classified into malware by using only the highest weighted output by applying the Attention technique, and learning the RNN output value by Residual CNN again. Experiments on the proposed model showed a Micro-average F1 score of 92% in the validation data set. Experimental results show that the performance of a model capable of learning and classifying arbitrary length data can be verified without special feature extraction and dimension reduction.

Simulation Research on the Thermal Effects in Dipolar Illuminated Lithography

  • Yao, Changcheng;Gong, Yan
    • Journal of the Optical Society of Korea
    • /
    • v.20 no.2
    • /
    • pp.251-256
    • /
    • 2016
  • The prediction of thermal effects in lithography projection objective plays a significant role in the real-time dynamic compensation of thermal aberrations. For the illuminated lithography projection objective, this paper applies finite element analysis to get the temperature distribution, surface deformation and stress data. To improve the efficiency, a temperature distribution function model is proposed to use for the simulation of thermal aberrations with the help of optical analysis software CODE V. SigFit is approved integrated optomechanical analysis software with the feature of calculating OPD effects due to temperature change, and it is utilized to prove the validation of the temperature distribution function. Results show that the impact of surface deformation and stress is negligible compared with the refractive index change; astigmatisms and 4-foil aberrations dominate in the thermal aberration, about 1.7 λ and 0.45 λ. The system takes about one hour to reach thermal equilibrium and the contrast of the imaging of dense lines get worse as time goes on.

Multi-Task FaceBoxes: A Lightweight Face Detector Based on Channel Attention and Context Information

  • Qi, Shuaihui;Yang, Jungang;Song, Xiaofeng;Jiang, Chen
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.10
    • /
    • pp.4080-4097
    • /
    • 2020
  • In recent years, convolutional neural network (CNN) has become the primary method for face detection. But its shortcomings are obvious, such as expensive calculation, heavy model, etc. This makes CNN difficult to use on the mobile devices which have limited computing and storage capabilities. Therefore, the design of lightweight CNN for face detection is becoming more and more important with the popularity of smartphones and mobile Internet. Based on the CPU real-time face detector FaceBoxes, we propose a multi-task lightweight face detector, which has low computing cost and higher detection precision. First, to improve the detection capability, the squeeze and excitation modules are used to extract attention between channels. Then, the textual and semantic information are extracted by shallow networks and deep networks respectively to get rich features. Finally, the landmark detection module is used to improve the detection performance for small faces and provide landmark data for face alignment. Experiments on AFW, FDDB, PASCAL, and WIDER FACE datasets show that our algorithm has achieved significant improvement in the mean average precision. Especially, on the WIDER FACE hard validation set, our algorithm outperforms the mean average precision of FaceBoxes by 7.2%. For VGA-resolution images, the running speed of our algorithm can reach 23FPS on a CPU device.

Micro-Expression Recognition Base on Optical Flow Features and Improved MobileNetV2

  • Xu, Wei;Zheng, Hao;Yang, Zhongxue;Yang, Yingjie
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.6
    • /
    • pp.1981-1995
    • /
    • 2021
  • When a person tries to conceal emotions, real emotions will manifest themselves in the form of micro-expressions. Research on facial micro-expression recognition is still extremely challenging in the field of pattern recognition. This is because it is difficult to implement the best feature extraction method to cope with micro-expressions with small changes and short duration. Most methods are based on hand-crafted features to extract subtle facial movements. In this study, we introduce a method that incorporates optical flow and deep learning. First, we take out the onset frame and the apex frame from each video sequence. Then, the motion features between these two frames are extracted using the optical flow method. Finally, the features are inputted into an improved MobileNetV2 model, where SVM is applied to classify expressions. In order to evaluate the effectiveness of the method, we conduct experiments on the public spontaneous micro-expression database CASME II. Under the condition of applying the leave-one-subject-out cross-validation method, the recognition accuracy rate reaches 53.01%, and the F-score reaches 0.5231. The results show that the proposed method can significantly improve the micro-expression recognition performance.

Several models for tunnel boring machine performance prediction based on machine learning

  • Mahmoodzadeh, Arsalan;Nejati, Hamid Reza;Ibrahim, Hawkar Hashim;Ali, Hunar Farid Hama;Mohammed, Adil Hussein;Rashidi, Shima;Majeed, Mohammed Kamal
    • Geomechanics and Engineering
    • /
    • v.30 no.1
    • /
    • pp.75-91
    • /
    • 2022
  • This paper aims to show how to use several Machine Learning (ML) methods to estimate the TBM penetration rate systematically (TBM-PR). To this end, 1125 datasets including uniaxial compressive strength (UCS), Brazilian tensile strength (BTS), punch slope index (PSI), distance between the planes of weakness (DPW), orientation of discontinuities (alpha angle-α), rock fracture class (RFC), and actual/measured TBM-PRs were established. To evaluate the ML methods' ability to perform, the 5-fold cross-validation was taken into consideration. Eventually, comparing the ML outcomes and the TBM monitoring data indicated that the ML methods have a very good potential ability in the prediction of TBM-PR. However, the long short-term memory model with a correlation coefficient of 0.9932 and a route mean square error of 2.68E-6 outperformed the remaining six ML algorithms. The backward selection method showed that PSI and RFC were more and less significant parameters on the TBM-PR compared to the others.