• Title/Summary/Keyword: Linear Regression Algorithm

Search Result 285, Processing Time 0.026 seconds

Computer Architecture Execution Time Optimization Using Swarm in Machine Learning

  • Sarah AlBarakati;Sally AlQarni;Rehab K. Qarout;Kaouther Laabidi
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.10
    • /
    • pp.49-56
    • /
    • 2023
  • Computer architecture serves as a link between application requirements and underlying technology capabilities such as technical, mathematical, medical, and business applications' computational and storage demands are constantly increasing. Machine learning these days grown and used in many fields and it performed better than traditional computing in applications that need to be implemented by using mathematical algorithms. A mathematical algorithm requires more extensive and quicker calculations, higher computer architecture specification, and takes longer execution time. Therefore, there is a need to improve the use of computer hardware such as CPU, memory, etc. optimization has a main role to reduce the execution time and improve the utilization of computer recourses. And for the importance of execution time in implementing machine learning supervised module linear regression, in this paper we focus on optimizing machine learning algorithms, for this purpose we write a (Diabetes prediction program) and applying on it a Practical Swarm Optimization (PSO) to reduce the execution time and improve the utilization of computer resources. Finally, a massive improvement in execution time were observed.

A Dynamic Piecewise Prediction Model of Solar Insolation for Efficient Photovoltaic Systems (효율적인 태양광 발전량 예측을 위한 Dynamic Piecewise 일사량 예측 모델)

  • Yang, Dong Hun;Yeo, Na Young;Mah, Pyeongsoo
    • KIISE Transactions on Computing Practices
    • /
    • v.23 no.11
    • /
    • pp.632-640
    • /
    • 2017
  • Although solar insolation is the weather factor with the greatest influence on power generation in photovoltaic systems, the Meterological Agency does not provide solar insolation data for future dates. Therefore, it is essential to research prediction methods for solar insolation to efficiently manage photovoltaic systems. In this study, we propose a Dynamic Piecewise Prediction Model that can be used to predict solar insolation values for future dates based on information from the weather forecast. To improve the predictive accuracy, we dynamically divide the entire data set based on the sun altitude and cloudiness at the time of prediction. The Dynamic Piecewise Prediction Model is developed by applying a polynomial linear regression algorithm on the divided data set. To verify the performance of our proposed model, we compared our model to previous approaches. The result of the comparison shows that the proposed model is superior to previous approaches in that it produces a lower prediction error.

Adaptive Macroblock Quantization Method for H.264 Codec (H.264 코덱을 위한 적응적 매크로블록 양자화 방법)

  • Park, Sang-Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.14 no.5
    • /
    • pp.1193-1200
    • /
    • 2010
  • This paper presents a new adaptive macroblock quantization algorithm which generates the output bits corresponding to the target bit budget. The H.264 standard uses various coding modes and optimization methods to improve the compression performance, which makes it difficult to control the amount of the generated traffic accurately. In the proposed scheme, linear regression analysis is used to analyze the relationship between the bit rate of each macroblock and the quantization parameter and to predict the MAD values. Using the predicted values, the quantization parameter of each macroblock is determined by the Lagrange multiplier method and then modified according to the difference between the bit budget and the generated bits. It is shown by experimental results that the new algorithm can generate output bits accurately corresponding to the target bit rates.

Collection Fusion Algorithm in Distributed Multimedia Databases (분산 멀티미디어 데이터베이스에 대한 수집 융합 알고리즘)

  • Kim, Deok-Hwan;Lee, Ju-Hong;Lee, Seok-Lyong;Chung, Chin-Wan
    • Journal of KIISE:Databases
    • /
    • v.28 no.3
    • /
    • pp.406-417
    • /
    • 2001
  • With the advances in multimedia databases on the World Wide Web, it becomes more important to provide users with the search capability of distributed multimedia data. While there have been many studies about the database selection and the collection fusion for text databases. The multimedia databases on the Web have autonomous and heterogeneous properties and they use mainly the content based retrieval. The collection fusion problem of multimedia databases is concerned with the merging of results retrieved by content based retrieval from heterogeneous multimedia databases on the Web. This problem is crucial for the search in distributed multimedia databases, however, it has not been studied yet. This paper provides novel algorithms for processing the collection fusion of heterogeneous multimedia databases on the Web. We propose two heuristic algorithms for estimating the number of objects to be retrieved from local databases and an algorithm using the linear regression. Extensive experiments show the effectiveness and efficiency of these algorithms. These algorithms can provide the basis for the distributed content based retrieval algorithms for multimedia databases on the Web.

  • PDF

Conformity Assessment of Machine Learning Algorithm for Particulate Matter Prediction (미세먼지 예측을 위한 기계 학습 알고리즘의 적합성 평가)

  • Cho, Kyoung-woo;Jung, Yong-jin;Kang, Chul-gyu;Oh, Chang-heon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.1
    • /
    • pp.20-26
    • /
    • 2019
  • Due to the human influence of particulate matter, various studies are being conducted to predict it using past data measured in the atmospheric environment monitoring network. However, it is difficult to precisely set the measurement environment and detailed conditions of the previously designed predictive model, and it is necessary to design a new predictive model based on the existing research results because of the problems such as the missing of the weather data. In this paper, as a previous study for particulate matter prediction, the conformity of the algorithm for particulate matter prediction was evaluated by designing the prediction model through the multiple linear regression and the artificial neural network, which are machine learning algorithms. As a result of the prediction performance comparison through RMSE, 18.13 for the MLR model and 14.31 for the MLP model, and the artificial neural network model was more conformable for predicting the particulate matter concentration.

A study on the prediction of korean NPL market return (한국 NPL시장 수익률 예측에 관한 연구)

  • Lee, Hyeon Su;Jeong, Seung Hwan;Oh, Kyong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.2
    • /
    • pp.123-139
    • /
    • 2019
  • The Korean NPL market was formed by the government and foreign capital shortly after the 1997 IMF crisis. However, this market is short-lived, as the bad debt has started to increase after the global financial crisis in 2009 due to the real economic recession. NPL has become a major investment in the market in recent years when the domestic capital market's investment capital began to enter the NPL market in earnest. Although the domestic NPL market has received considerable attention due to the overheating of the NPL market in recent years, research on the NPL market has been abrupt since the history of capital market investment in the domestic NPL market is short. In addition, decision-making through more scientific and systematic analysis is required due to the decline in profitability and the price fluctuation due to the fluctuation of the real estate business. In this study, we propose a prediction model that can determine the achievement of the benchmark yield by using the NPL market related data in accordance with the market demand. In order to build the model, we used Korean NPL data from December 2013 to December 2017 for about 4 years. The total number of things data was 2291. As independent variables, only the variables related to the dependent variable were selected for the 11 variables that indicate the characteristics of the real estate. In order to select the variables, one to one t-test and logistic regression stepwise and decision tree were performed. Seven independent variables (purchase year, SPC (Special Purpose Company), municipality, appraisal value, purchase cost, OPB (Outstanding Principle Balance), HP (Holding Period)). The dependent variable is a bivariate variable that indicates whether the benchmark rate is reached. This is because the accuracy of the model predicting the binomial variables is higher than the model predicting the continuous variables, and the accuracy of these models is directly related to the effectiveness of the model. In addition, in the case of a special purpose company, whether or not to purchase the property is the main concern. Therefore, whether or not to achieve a certain level of return is enough to make a decision. For the dependent variable, we constructed and compared the predictive model by calculating the dependent variable by adjusting the numerical value to ascertain whether 12%, which is the standard rate of return used in the industry, is a meaningful reference value. As a result, it was found that the hit ratio average of the predictive model constructed using the dependent variable calculated by the 12% standard rate of return was the best at 64.60%. In order to propose an optimal prediction model based on the determined dependent variables and 7 independent variables, we construct a prediction model by applying the five methodologies of discriminant analysis, logistic regression analysis, decision tree, artificial neural network, and genetic algorithm linear model we tried to compare them. To do this, 10 sets of training data and testing data were extracted using 10 fold validation method. After building the model using this data, the hit ratio of each set was averaged and the performance was compared. As a result, the hit ratio average of prediction models constructed by using discriminant analysis, logistic regression model, decision tree, artificial neural network, and genetic algorithm linear model were 64.40%, 65.12%, 63.54%, 67.40%, and 60.51%, respectively. It was confirmed that the model using the artificial neural network is the best. Through this study, it is proved that it is effective to utilize 7 independent variables and artificial neural network prediction model in the future NPL market. The proposed model predicts that the 12% return of new things will be achieved beforehand, which will help the special purpose companies make investment decisions. Furthermore, we anticipate that the NPL market will be liquidated as the transaction proceeds at an appropriate price.

Development of State Diagnosis Algorithm for Performance Improvement of PV System (태양광전원의 성능향상을 위한 상태진단 알고리즘 개발)

  • Choi, Sungsik;Kim, Taeyoun;Park, Jaebeom;Kim, Byungki;Rho, Daeseok
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.2
    • /
    • pp.1036-1043
    • /
    • 2014
  • The installation of PV system to the power distribution system is being increased as one of solutions for environmental pollution and energy crisis. Because the output efficiency of PV system is getting decreased because of the aging phenomenon and several operation obstacles, the technology development of output prediction and state diagnosis of PV modules are required in order to improve operation performance of PV modules. The conventional methods for output prediction by considering various parameters and standard test condition values of PV modules may have difficult and complex computation procedure and also their prediction values may produce large error. To overcome these problems, this paper proposes an optimal prediction algorithm and state diagnosis algorithm of PV modules by using least square methods of linear regression analysis. In addition, this paper presents a state diagnosis evaluation system of PV modules based on the proposed optimal algorithms of PV modules. From the simulation results of proposed evaluation system, it is confirmed that the proposed algorithms is a practical tool for state diagnosis of PV modules.

2D-QSAR analysis for hERG ion channel inhibitors (hERG 이온채널 저해제에 대한 2D-QSAR 분석)

  • Jeon, Eul-Hye;Park, Ji-Hyeon;Jeong, Jin-Hee;Lee, Sung-Kwang
    • Analytical Science and Technology
    • /
    • v.24 no.6
    • /
    • pp.533-543
    • /
    • 2011
  • The hERG (human ether-a-go-go related gene) ion channel is a main factor for cardiac repolarization, and the blockade of this channel could induce arrhythmia and sudden death. Therefore, potential hERG ion channel inhibitors are now a primary concern in the drug discovery process, and lots of efforts are focused on the minimizing the cardiotoxic side effect. In this study, $IC_{50}$ data of 202 organic compounds in HEK (human embryonic kidney) cell from literatures were used to develop predictive 2D-QSAR model. Multiple linear regression (MLR), Support Vector Machine (SVM), and artificial neural network (ANN) were utilized to predict inhibition concentration of hERG ion channel as machine learning methods. Population based-forward selection method with cross-validation procedure was combined with each learning method and used to select best subset descriptors for each learning algorithm. The best model was ANN model based on 14 descriptors ($R^2_{CV}$=0.617, RMSECV=0.762, MAECV=0.583) and the MLR model could describe the structural characteristics of inhibitors and interaction with hERG receptors. The validation of QSAR models was evaluated through the 5-fold cross-validation and Y-scrambling test.

Muscle Fatigue Assessment using Hilbert-Huang Transform and an Autoregressive Model during Repetitive Maximum Isokinetic Knee Extensions (슬관절의 등속성 최대 반복 신전시 Hilbert-Huang 변환과 AR 모델을 이용한 근피로 평가)

  • Kim, H.S.;Choi, S.W.;Yun, A.R.;Lee, S.E.;Shin, K.Y.;Choi, J.I.;Mun, J.H.
    • Journal of Biosystems Engineering
    • /
    • v.34 no.2
    • /
    • pp.127-132
    • /
    • 2009
  • In the working population, muscle fatigue and musculoskeletal discomfort are common, which, in the case of insufficient recovery may lead to musculoskeletal pain. Workers suffering from musculoskeletal pains need to be rehabilitated for recovery. Isokinetic testing has been used in physical strengthening, rehabilitation and post-operative orthopedic surgery. Frequency analysis of electromyography (EMG) signals using the mean frequency (MNF) has been widely used to characterize muscle fatigue. During isokinetic contractions, EMG signals present strong nonstationarities. Hilbert-Haung transform (HHT) and autoregressive (AR) model have been known more suitable than Fourier or wavelet transform for nonstationary signals. Moreover, several analyses have been performed within each active phase during isokinetic contractions. Thus, the aims of this study were i) to determine which one was better suitable for the analysis of MNF between HHT and AR model during repetitive maximum isokinetic extensions and ii) to investigate whether the analysis could be repeated for sequential fixed epoch lengths. Seven healthy volunteers (five males and two females) performed isokinetic knee extensions at $60^{\circ}/s$ and $240^{\circ}/s$ until 50% of the maximum peak torque was reached. Surface EMG signals were recorded from the rectus femoris of the right thigh. An algorithm detecting the onset and offset of EMG signals was applied to extract each active phase of the muscle. Following the results, slopes from the least-square error linear regression of MNF values showed that muscle fatigue of all subjects occurred. The AR model is better suited than HHT for estimating MNF from nonstationary EMG signals during isokinetic knee extensions. Moreover, the linear regression can be extracted from MNF values calculated by sequential fixed epoch lengths (p> 0.0I).

Real-Time, Simultaneous and Proportional Myoelectric Control for Robotic Rehabilitation Therapy of Stroke Survivors (뇌졸중 환자의 로봇 재활 치료를 위한 실시간, 동시 및 비례형 근전도 제어)

  • Jung, YoungJin;Park, Hae Yean;Maitra, Kinsuk;Prabakar, Nagarajan;Kim, Jong-Hoon
    • Therapeutic Science for Rehabilitation
    • /
    • v.7 no.1
    • /
    • pp.79-88
    • /
    • 2018
  • Objective : Conventional therapy approaches for stroke survivors have required considerable demands on therapist's effort and patient's expense. Thus, new robotics rehabilitation therapy technologies have been proposed but they have suffered from less than optimal control algorithms. This article presents a novel technical healthcare solution for the real-time, simultaneous and propositional myoelectric control for stroke survivors' upper limb robotic rehabilitation therapy. Methods : To implement an appropriate computational algorithm for controlling a portable rehabilitative robot, a linear regression model was employed, and a simple game experiment was conducted to identify its potential of clinical utilization. Results : The results suggest that the proposed device and computational algorithm can be used for stroke robot rehabilitation. Conclusion : Moreover, we believe that these techniques will be used as a prominent tool in making a device or finding new therapy approaches in robot-assisted rehabilitation for stroke survivors.