Search | Korea Science

Network Traffic Measurement Analysis using Machine Learning

Hae-Duck Joshua Jeong
- Korean Journal of Artificial Intelligence
- /
- v.11 no.2
- /
- pp.19-27
- /
- 2023
In recent times, an exponential increase in Internet traffic has been observed as a result of advancing development of the Internet of Things, mobile networks with sensors, and communication functions within various devices. Further, the COVID-19 pandemic has inevitably led to an explosion of social network traffic. Within this context, considerable attention has been drawn to research on network traffic analysis based on machine learning. In this paper, we design and develop a new machine learning framework for network traffic analysis whereby normal and abnormal traffic is distinguished from one another. To achieve this, we combine together well-known machine learning algorithms and network traffic analysis techniques. Using one of the most widely used datasets KDD CUP'99 in the Weka and Apache Spark environments, we compare and investigate results obtained from time series type analysis of various aspects including malicious codes, feature extraction, data formalization, network traffic measurement tool implementation. Experimental analysis showed that while both the logistic regression and the support vector machine algorithm were excellent for performance evaluation, among these, the logistic regression algorithm performs better. The quantitative analysis results of our proposed machine learning framework show that this approach is reliable and practical, and the performance of the proposed system and another paper is compared and analyzed. In addition, we determined that the framework developed in the Apache Spark environment exhibits a much faster processing speed in the Spark environment than in Weka as there are more datasets used to create and classify machine learning models.
https://doi.org/10.24225/kjai.2023.11.2.19 인용 PDF

Deep LS-SVM for regression

Hwang, Changha;Shim, Jooyong
- Journal of the Korean Data and Information Science Society
- /
- v.27 no.3
- /
- pp.827-833
- /
- 2016
In this paper, we propose a deep least squares support vector machine (LS-SVM) for regression problems, which consists of the input layer and the hidden layer. In the hidden layer, LS-SVMs are trained with the original input variables and the perturbed responses. For the final output, the main LS-SVM is trained with the outputs from LS-SVMs of the hidden layer as input variables and the original responses. In contrast to the multilayer neural network (MNN), LS-SVMs in the deep LS-SVM are trained to minimize the penalized objective function. Thus, the learning dynamics of the deep LS-SVM are entirely different from MNN in which all weights and biases are trained to minimize one final error function. When compared to MNN approaches, the deep LS-SVM does not make use of any combination weights, but trains all LS-SVMs in the architecture. Experimental results from real datasets illustrate that the deep LS-SVM significantly outperforms state of the art machine learning methods on regression problems.
https://doi.org/10.7465/jkdi.2016.27.3.827 인용 PDF KSCI

A Study on the Short-term Load Forecasting using Support Vector Machine (지원벡터머신을 이용한 단기전력 수요예측에 관한 연구)

Jo, Nam-Hoon;Song, Kyung-Bin;Roh, Young-Su;Kang, Dae-Seung
- The Transactions of the Korean Institute of Electrical Engineers A
- /
- v.55 no.7
- /
- pp.306-312
- /
- 2006
Support Vector Machine(SVM), of which the foundations have been developed by Vapnik (1995), is gaining popularity thanks to many attractive features and promising empirical performance. In this paper, we propose a new short-term load forecasting technique based on SVM. We discuss the input vector selection of SVM for load forecasting and analyze the prediction performance for various SVM parameters such as kernel function, cost coefficient C, and $\varepsilon$ (the width of 8 $\varepsilon-tube$). The computer simulation shows that the prediction performance of the proposed method is superior to that of the conventional neural networks.
PDF KSCI

Kernel Adatron Algorithm of Support Vector Machine for Function Approximation (함수근사를 위한 서포트 벡터 기계의 커널 애더트론 알고리즘)

Seok, Kyung-Ha;Hwang, Chang-Ha
- The Transactions of the Korea Information Processing Society
- /
- v.7 no.6
- /
- pp.1867-1873
- /
- 2000
Function approximation from a set of input-output pairs has numerous applications in scientific and engineering areas. Support vector machine (SVM) is a new and very promising classification, regression and function approximation technique developed by Vapnik and his group at AT&TG Bell Laboratories. However, it has failed to establish itself as common machine learning tool. This is partly due to the fact that this is not easy to implement, and its standard implementation requires the use of optimization package for quadratic programming (QP). In this appear we present simple iterative Kernel Adatron (KA) algorithm for function approximation and compare it with standard SVM algorithm using QP.
PDF

Two Machine Learning Models for Mobile Phone Battery Discharge Rate Prediction Based on Usage Patterns

Chantrapornchai, Chantana;Nusawat, Paingruthai
- Journal of Information Processing Systems
- /
- v.12 no.3
- /
- pp.436-454
- /
- 2016
This research presents the battery discharge rate models for the energy consumption of mobile phone batteries based on machine learning by taking into account three usage patterns of the phone: the standby state, video playing, and web browsing. We present the experimental design methodology for collecting data, preprocessing, model construction, and parameter selections. The data is collected based on the HTC One X hardware platform. We considered various setting factors, such as Bluetooth, brightness, 3G, GPS, Wi-Fi, and Sync. The battery levels for each possible state vector were measured, and then we constructed the battery prediction model using different regression functions based on the collected data. The accuracy of the constructed models using the multi-layer perceptron (MLP) and the support vector machine (SVM) were compared using varying kernel functions. Various parameters for MLP and SVM were considered. The measurement of prediction efficiency was done by the mean absolute error (MAE) and the root mean squared error (RMSE). The experiments showed that the MLP with linear regression performs well overall, while the SVM with the polynomial kernel function based on the linear regression gives a low MAE and RMSE. As a result, we were able to demonstrate how to apply the derived model to predict the remaining battery charge.
https://doi.org/10.3745/JIPS.03.0048 인용 PDF KSCI

Purchase Prediction Model using the Support Vector Machine (Support Vector Machine을 이용한 고객구매예측모형)

Ahn, Hyun-Chul;Han, In-Goo;Kim, Kyoung-Jae
- Journal of Intelligence and Information Systems
- /
- v.11 no.3
- /
- pp.69-81
- /
- 2005
As the competition in business becomes severe, companies are focusing their capacity on customer relationship management (CRM) for survival. One of the important issues in CRM is to build a purchase prediction model, which classifies customers into either purchasing or non-purchasing groups. Until now, various techniques for building purchase prediction models have been proposed. However, they have been criticized because their performances are generally low, or it requires much effort to build and maintain them. Thus, in this study, we propose the support vector machine (SVM) a tool for building a purchase prediction model. The SVM is known as the technique that not only produces accurate prediction results but also enables training with the small sample size. To validate the usefulness of SVM, we apply it and some of other comparative techniques to a real-world purchase prediction case. Experimental results show that SVM outperforms all the comparative models including logistic regression and artificial neural networks.
PDF

Quantile regression using asymmetric Laplace distribution (비대칭 라플라스 분포를 이용한 분위수 회귀)

Park, Hye-Jung
- Journal of the Korean Data and Information Science Society
- /
- v.20 no.6
- /
- pp.1093-1101
- /
- 2009
Quantile regression has become a more widely used technique to describe the distribution of a response variable given a set of explanatory variables. This paper proposes a novel modelfor quantile regression using doubly penalized kernel machine with support vector machine iteratively reweighted least squares (SVM-IRWLS). To make inference about the shape of a population distribution, the widely popularregression, would be inadequate, if the distribution is not approximately Gaussian. We present a likelihood-based approach to the estimation of the regression quantiles that uses the asymmetric Laplace density.
PDF

Seismic response of soil-structure interaction using the support vector regression

Mirhosseini, Ramin Tabatabaei
- Structural Engineering and Mechanics
- /
- v.63 no.1
- /
- pp.115-124
- /
- 2017
In this paper, a different technique to predict the effects of soil-structure interaction (SSI) on seismic response of building systems is investigated. The technique use a machine learning algorithm called Support Vector Regression (SVR) with technical and analytical results as input features. Normally, the effects of SSI on seismic response of existing building systems can be identified by different types of large data sets. Therefore, predicting and estimating the seismic response of building is a difficult task. It is possible to approximate a real valued function of the seismic response and make accurate investing choices regarding the design of building system and reduce the risk involved, by giving the right experimental and/or numerical data to a machine learning regression, such as SVR. The seismic response of both single-degree-of-freedom system and six-storey RC frame which can be represent of a broad range of existing structures, is estimated using proposed SVR model, while allowing flexibility of the soil-foundation system and SSI effects. The seismic response of both single-degree-of-freedom system and six-storey RC frame which can be represent of a broad range of existing structures, is estimated using proposed SVR model, while allowing flexibility of the soil-foundation system and SSI effects. The results show that the performance of the technique can be predicted by reducing the number of real data input features. Further, performance enhancement was achieved by optimizing the RBF kernel and SVR parameters through grid search.
https://doi.org/10.12989/sem.2017.63.1.115 인용 KSCI

Sparse Kernel Regression using IRWLS Procedure

Park, Hye-Jung
- Journal of the Korean Data and Information Science Society
- /
- v.18 no.3
- /
- pp.735-744
- /
- 2007
Support vector machine(SVM) is capable of providing a more complete description of the linear and nonlinear relationships among random variables. In this paper we propose a sparse kernel regression(SKR) to overcome a weak point of SVM, which is, the steep growth of the number of support vectors with increasing the number of training data. The iterative reweighted least squares(IRWLS) procedure is used to solve the optimal problem of SKR with a Laplacian prior. Furthermore, the generalized cross validation(GCV) function is introduced to select the hyper-parameters which affect the performance of SKR. Experimental results are then presented which illustrate the performance of the proposed procedure.
PDF

Hourly Steel Industry Energy Consumption Prediction Using Machine Learning Algorithms

Sathishkumar, VE;Lee, Myeong-Bae;Lim, Jong-Hyun;Shin, Chang-Sun;Park, Chang-Woo;Cho, Yong Yun
- Annual Conference of KIPS
- /
- 2019.10a
- /
- pp.585-588
- /
- 2019
Predictions of Energy Consumption for Industries gain an important place in energy management and control system, as there are dynamic and seasonal changes in the demand and supply of energy. This paper presents and discusses the predictive models for energy consumption of the steel industry. Data used includes lagging and leading current reactive power, lagging and leading current power factor, carbon dioxide (tCO2) emission and load type. In the test set, four statistical models are trained and evaluated: (a) Linear regression (LR), (b) Support Vector Machine with radial kernel (SVM RBF), (c) Gradient Boosting Machine (GBM), (d) random forest (RF). Root Mean Squared Error (RMSE), Mean Absolute Error (MAE) and Mean Absolute Percentage Error (MAPE) are used to measure the prediction efficiency of regression designs. When using all the predictors, the best model RF can provide RMSE value 7.33 in the test set.
https://doi.org/10.3745/PKIPS.y2019m10a.585 인용 PDF

Search Result 386, Processing Time 0.034 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)