• 제목/요약/키워드: SVM Model

검색결과 698건 처리시간 0.024초

HR 데이터 기반의 퇴사 예측 모델 개발 (Development of a Resignation Prediction Model using HR Data)

  • 박연정;이도길
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2021년도 춘계학술발표대회
    • /
    • pp.297-300
    • /
    • 2021
  • 대부분의 기업에서는 우수한 인적 자원의 유출을 방지하기 위해 직원들이 이직 및 퇴사하는 이유를 연구한다. 이에 기업은 직원이 퇴사하기 전에 면담을 하거나 설문조사를 통해서 연구에 필요한 데이터를 얻는다. 하지만 설문조사에서는 직원들이 직장 생활을 하는 데에 불리할 수도 있는 의견을 드러내려고 하지 않아 정확한 결과를 얻기 힘든 것이 현실이다. 한편, 한국노동연구원에서 발표한 자료에 따르면 기업이 요구하는 최소 학력 수준과 직원의 학력 수준 간의 차이가 클수록 이직 경향이 커진다. 따라서 본 연구에서는 한국노동연구원의 자료에 착안하여, 직원이 가지고 있는 객관적 데이터인 전공, 교육수준, 재직 중인 회사 유형 등의 데이터를 기반으로 직원의 퇴사 여부를 예측하고자 한다. 퇴사 예측 모델을 생성하기 위해 Decision Tree, XGBoost, kNN, SVM을 활용하였으며 각각의 성능을 비교했다. 이 결과, 지금까지 설문조사로 진행되었던 연구에서 파악하지 못한 다양한 요인을 알아낼 수 있었다. 이를 통해 기업이 퇴사 예측 모델을 이용하여 직원이 퇴사하기 전에 미리 이를 인지하고 방지하는 데에 도움을 줄 수 있을 것으로 예상된다.

특징 추출 기법을 이용한 사용자 행동 인식 모델 (Human Action Recognition Model using Feature Engineering)

  • 김다혜;한예찬;정영섭;김재윤
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2021년도 제64차 하계학술대회논문집 29권2호
    • /
    • pp.47-48
    • /
    • 2021
  • 사용자 행동 인식(HAR)은 사용자의 행동을 분석하여 사용자의 현재 행동을 추측하는 것이며, 센서 데이터에서 특성을 추출하는 것이 중요하다. 본 연구에서는 다양한 특징 추출 기법을 사용하여 기계학습 모델을 비교한다. 변수마다 특성에 맞는 기법을 사용했으며, 정확도와 Kappa 통계량, F1 score 모두 SVM 모델에서 95.2%, 94.2%, 95.1%로 가장 높았다. 이는 기계학습 모델에서 특징 추출 기법을 사용하여 우수한 정확도를 달성할 수 있음을 보인다.

  • PDF

STATISTICAL MODELLING USING DATA MINING TOOLS IN MERGERS AND ACQUISITION WITH REGARDS TO MANUFACTURE & SERVICE SECTOR

  • KALAIVANI, S.;SIVAKUMAR, K.;VIJAYARANGAM, J.
    • Journal of applied mathematics & informatics
    • /
    • 제40권3_4호
    • /
    • pp.563-575
    • /
    • 2022
  • Many organizations seek statistical modelling facilitated by data analytics technologies for determining the prediction models associated with M&A (Merger and Acquisition). By combining these data analytics tool alongside with data collection approaches aids organizations towards M&A decision making, followed by achieving profitable insights as well. It promotes for better visibility, overall improvements and effective negotiation strategies for post-M&A integration. This paper explores on the impact of pre and post integration of M&A in a standard organizational setting via devising a suitable statistical model via employing techniques such as Naïve Bayes, K-nearest neighbour (KNN), and Decision Tree & Support Vector Machine (SVM).

IP카메라의 DoS 공격 탐지 머신러닝 모델에 대한 연구 (A Study on Machine Learning model for detection of DoS Attack)

  • 정웅교;김동영;곽병일
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2022년도 추계학술발표대회
    • /
    • pp.709-711
    • /
    • 2022
  • ICT 기술의 빠른 발전과 함께 Internet of Things (IoT) 환경에서의 Internet Protocol (IP) 카메라의 사용률이 증가하면서, IP 카메라에 대한 개인정보 이슈와 제품의 보안성 검토 관련 소비자의 개인정보 유출 우려가 증가하고 있다. 본 논문에서는, IP 카메라에 대한 4개 종류의 Denial of Service (DoS) 공격을 통해 IP 카메라 이상 반응을 확인했다. 또한, 이 과정에서 수집한 공격 패킷 데이터를 기반으로, DoS 공격을 탐지하는 간단한 피쳐 구성과 머신러닝 모델을 제안하였다. 최종적으로, DoS 공격을 통해 실제 IP 카메라에 대한 가용성 테스트를 수행하였으며 머신러닝 알고리즘 4개 Decision Tree, Random Forest, Multilayer Perceptron, SVM에서의 DoS 공격 탐지 성능을 비교하였다.

SIFT 및 HSV 특징 추출 기반 폐기물 객체 유사도 측정 모델 (The SIFT and HSV feature extraction-based waste Object similarity measurement model)

  • 고준혁 ;최혁순 ;김진아 ;문남미
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2023년도 추계학술발표대회
    • /
    • pp.1220-1223
    • /
    • 2023
  • 폐기물을 처리하는데 있어 배출과 수거에 대한 프로세스 자동화를 위해 폐기물 객체 유사도 판별이 요구된다. 이를 위해 본 연구에서는 폐기물 데이터셋에서 SIFT(Scale-Invariant Feature Transform)와 HSV(Hue, Saturation, Value)기반으로 두 이미지의 공통된 특징을 추출해 융합하고, 기계학습을 통해 이미지 객체 간의 유사도를 측정하는 모델을 제안한다. 실험을 위해 수집된 폐기물 데이터셋 81,072 장을 활용하여 이미지를 학습시키고, 전통적인 임계치 기반 유사도 측정과 본 논문에서 제시하는 유사도 측정을 비교하여 성능을 확인하였다. 임계치 기반 측정에서 SIFT 와 HSV 는 각각 0.82, 0.89(Acc)가 측정되었고, 본 논문에서 제시한 특징 추출 방법을 사용한 기계학습의 성능은 DT(Decision Tree)와 SVM(Support Vector Machine) 모두 0.93 (Acc)로 4%의 정확도가 향상되었다.

Forecasting realized volatility using data normalization and recurrent neural network

  • Yoonjoo Lee;Dong Wan Shin;Ji Eun Choi
    • Communications for Statistical Applications and Methods
    • /
    • 제31권1호
    • /
    • pp.105-127
    • /
    • 2024
  • We propose recurrent neural network (RNN) methods for forecasting realized volatility (RV). The data are RVs of ten major stock price indices, four from the US, and six from the EU. Forecasts are made for relative ratio of adjacent RVs instead of the RV itself in order to avoid the out-of-scale issue. Forecasts of RV ratios distribution are first constructed from which those of RVs are computed which are shown to be better than forecasts constructed directly from RV. The apparent asymmetry of RV ratio is addressed by the Piecewise Min-max (PM) normalization. The serial dependence of the ratio data renders us to consider two architectures, long short-term memory (LSTM) and gated recurrent unit (GRU). The hyperparameters of LSTM and GRU are tuned by the nested cross validation. The RNN forecast with the PM normalization and ratio transformation is shown to outperform other forecasts by other RNN models and by benchmarking models of the AR model, the support vector machine (SVM), the deep neural network (DNN), and the convolutional neural network (CNN).

Anomaly-Based Network Intrusion Detection: An Approach Using Ensemble-Based Machine Learning Algorithm

  • Kashif Gul Chachar;Syed Nadeem Ahsan
    • International Journal of Computer Science & Network Security
    • /
    • 제24권1호
    • /
    • pp.107-118
    • /
    • 2024
  • With the seamless growth of the technology, network usage requirements are expanding day by day. The majority of electronic devices are capable of communication, which strongly requires a secure and reliable network. Network-based intrusion detection systems (NIDS) is a new method for preventing and alerting computers and networks from attacks. Machine Learning is an emerging field that provides a variety of ways to implement effective network intrusion detection systems (NIDS). Bagging and Boosting are two ensemble ML techniques, renowned for better performance in the learning and classification process. In this paper, the study provides a detailed literature review of the past work done and proposed a novel ensemble approach to develop a NIDS system based on the voting method using bagging and boosting ensemble techniques. The test results demonstrate that the ensemble of bagging and boosting through voting exhibits the highest classification accuracy of 99.98% and a minimum false positive rate (FPR) on both datasets. Although the model building time is average which can be a tradeoff by processor speed.

역삼투압 해수담수화(SWRO) 플랜트에서 독립변수의 다중공선성을 고려한 예측모델에 관한 연구 (A Study on the Prediction Model Considering the Multicollinearity of Independent Variables in the Seawater Reverse Osmosis)

  • 한인섭;윤연아;장태우;김용수
    • 품질경영학회지
    • /
    • 제48권1호
    • /
    • pp.171-186
    • /
    • 2020
  • Purpose: The purpose of this study is conducting of predictive models that considered multicollinearity of independent variables in order to carry out more efficient and reliable predictions about differential pressure in seawater reverse osmosis. Methods: The main variables of each RO system are extracted through factor analysis. Common variables are derived through comparison of RO system # 1 and RO system # 2. In order to carry out the prediction modeling about the differential pressure, which is the target variable, we constructed the prediction model reflecting the regression analysis, the artificial neural network, and the support vector machine in R package, and figured out the superiority of the model by comparing RMSE. Results: The number of factors extracted from factor analysis of RO system #1 and RO system #2 is same. And the value of variability(% Var) increased as step proceeds according to the analysis procedure. As a result of deriving the average RMSE of the models, the overall prediction of the SVM was superior to the other models. Conclusion: This study is meaningful in that it has been conducting a demonstration study of considering the multicollinearity of independent variables. Before establishing a predictive model for a target variable, it would be more accurate predictive model if the relevant variables are derived and reflected.

지도학습 알고리즘 기반 3D 노지 작물 구분 모델 개발 (Development of 3D Crop Segmentation Model in Open-field Based on Supervised Machine Learning Algorithm)

  • 정영준;이종혁;이상익;오부영;;서병훈;김동수;서예진;최원
    • 한국농공학회논문집
    • /
    • 제64권1호
    • /
    • pp.15-26
    • /
    • 2022
  • 3D open-field farm model developed from UAV (Unmanned Aerial Vehicle) data could make crop monitoring easier, also could be an important dataset for various fields like remote sensing or precision agriculture. It is essential to separate crops from the non-crop area because labeling in a manual way is extremely laborious and not appropriate for continuous monitoring. We, therefore, made a 3D open-field farm model based on UAV images and developed a crop segmentation model using a supervised machine learning algorithm. We compared performances from various models using different data features like color or geographic coordinates, and two supervised learning algorithms which are SVM (Support Vector Machine) and KNN (K-Nearest Neighbors). The best approach was trained with 2-dimensional data, ExGR (Excess of Green minus Excess of Red) and z coordinate value, using KNN algorithm, whose accuracy, precision, recall, F1 score was 97.85, 96.51, 88.54, 92.35% respectively. Also, we compared our model performance with similar previous work. Our approach showed slightly better accuracy, and it detected the actual crop better than the previous approach, while it also classified actual non-crop points (e.g. weeds) as crops.

Using Keystroke Dynamics for Implicit Authentication on Smartphone

  • Do, Son;Hoang, Thang;Luong, Chuyen;Choi, Seungchan;Lee, Dokyeong;Bang, Kihyun;Choi, Deokjai
    • 한국멀티미디어학회논문지
    • /
    • 제17권8호
    • /
    • pp.968-976
    • /
    • 2014
  • Authentication methods on smartphone are demanded to be implicit to users with minimum users' interaction. Existing authentication methods (e.g. PINs, passwords, visual patterns, etc.) are not effectively considering remembrance and privacy issues. Behavioral biometrics such as keystroke dynamics and gait biometrics can be acquired easily and implicitly by using integrated sensors on smartphone. We propose a biometric model involving keystroke dynamics for implicit authentication on smartphone. We first design a feature extraction method for keystroke dynamics. And then, we build a fusion model of keystroke dynamics and gait to improve the authentication performance of single behavioral biometric on smartphone. We operate the fusion at both feature extraction level and matching score level. Experiment using linear Support Vector Machines (SVM) classifier reveals that the best results are achieved with score fusion: a recognition rate approximately 97.86% under identification mode and an error rate approximately 1.11% under authentication mode.