• Title/Summary/Keyword: Linear Regression Algorithm

Search Result 285, Processing Time 0.025 seconds

Predicting compressive strength of bended cement concrete with ANNs

  • Gazder, Uneb;Al-Amoudi, Omar Saeed Baghabara;Khan, Saad Muhammad Saad;Maslehuddin, Mohammad
    • Computers and Concrete
    • /
    • v.20 no.6
    • /
    • pp.627-634
    • /
    • 2017
  • Predicting the compressive strength of concrete is important to assess the load-carrying capacity of a structure. However, the use of blended cements to accrue the technical, economic and environmental benefits has increased the complexity of prediction models. Artificial Neural Networks (ANNs) have been used for predicting the compressive strength of ordinary Portland cement concrete, i.e., concrete produced without the addition of supplementary cementing materials. In this study, models to predict the compressive strength of blended cement concrete prepared with a natural pozzolan were developed using regression models and single- and 2-phase learning ANNs. Back-propagation (BP), Levenberg-Marquardt (LM) and Conjugate Gradient Descent (CGD) methods were used for training the ANNs. A 2-phase learning algorithm is proposed for the first time in this study for predictive modeling of the compressive strength of blended cement concrete. The output of these predictive models indicates that the use of a 2-phase learning algorithm will provide better results than the linear regression model or the traditional single-phase ANN models.

Malicious URL Detection by Visual Characteristics with Machine Learning: Roles of HTTPS (시각적 특징과 머신 러닝으로 악성 URL 구분: HTTPS의 역할)

  • Sung-Won HONG;Min-Soo KANG
    • Journal of Korea Artificial Intelligence Association
    • /
    • v.1 no.2
    • /
    • pp.1-6
    • /
    • 2023
  • In this paper, we present a new method for classifying malicious URLs to reduce cases of learning difficulties due to unfamiliar and difficult terms related to information protection. This study plans to extract only visually distinguishable features within the URL structure and compare them through map learning algorithms, and to compare the contribution values of the best map learning algorithm methods to extract features that have the most impact on classifying malicious URLs. As research data, Kaggle used data that classified 7,046 malicious URLs and 7.046 normal URLs. As a result of the study, among the three supervised learning algorithms used (Decision Tree, Support Vector Machine, and Logistic Regression), the Decision Tree algorithm showed the best performance with 83% accuracy, 83.1% F1-score and 83.6% Recall values. It was confirmed that the contribution value of https is the highest among whether to use https, sub domain, and prefix and suffix, which can be visually distinguished through the feature contribution of Decision Tree. Although it has been difficult to learn unfamiliar and difficult terms so far, this study will be able to provide an intuitive judgment method without explanation of the terms and prove its usefulness in the field of malicious URL detection.

Structural design of Optimized Interval Type-2 FCM Based RBFNN : Focused on Modeling and Pattern Classifier (최적화된 Interval Type-2 FCM based RBFNN 구조 설계 : 모델링과 패턴분류기를 중심으로)

  • Kim, Eun-Hu;Song, Chan-Seok;Oh, Sung-Kwun;Kim, Hyun-Ki
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.66 no.4
    • /
    • pp.692-700
    • /
    • 2017
  • In this paper, we propose the structural design of Interval Type-2 FCM based RBFNN. Proposed model consists of three modules such as condition, conclusion and inference parts. In the condition part, Interval Type-2 FCM clustering which is extended from FCM clustering is used. In the conclusion part, the parameter coefficients of the consequence part are estimated through LSE(Least Square Estimation) and WLSE(Weighted Least Square Estimation). In the inference part, final model outputs are acquired by fuzzy inference method from linear combination of both polynomial and activation level obtained through Interval Type-2 FCM and acquired activation level through Interval Type-2 FCM. Additionally, The several parameters for the proposed model are identified by using differential evolution. Final model outputs obtained through benchmark data are shown and also compared with other already studied models' performance. The proposed algorithm is performed by using Iris and Vehicle data for pattern classification. For the validation of regression problem modeling performance, modeling experiments are carried out by using MPG and Boston Housing data.

The FPNN Algorithm combined with fuzzy inference rules and PNN structure (퍼지추론규칙과 PNN 구조를 융합한 FPNN 알고리즘)

  • Park, Ho-Sung;Park, Byoung-Jun;Ahn, Tae-Chon;Oh, Sung-Kwun
    • Proceedings of the KIEE Conference
    • /
    • 1999.07g
    • /
    • pp.2856-2858
    • /
    • 1999
  • In this paper, the FPNN(Fuzzy Polynomial Neural Networks) algorithm with multi-layer fuzzy inference structure is proposed for the model identification of a complex nonlinear system. The FPNN structure is generated from the mutual combination of PNN (Polynomial Neural Network) structure and fuzzy inference method. The PNN extended from the GMDH(Group Method of Data Handling) uses several types of polynomials such as linear, quadratic and modifled quadratic besides the biquadratic polynomial used in the GMDH. In the fuzzy inference method, simplified and regression polynomial inference method which is based on the consequence of fuzzy rule expressed with a polynomial such as linear, quadratic and modified quadratic equation are used Each node of the FPNN is defined as a fuzzy rule and its structure is a kind of fuzzy-neural networks. Gas furnace data used to evaluate the performance of our proposed model.

  • PDF

A Study on the PRC Generation Algorithms for Virtual Reference Stations Using a Network of DGNSS Reference Stations (DGNSS 기준국 네트워크를 활용한 가상기준국 보정정보 생성 알고리즘에 관한 연구)

  • Kim, Hye-In;Park, Kwan-Dong
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.29 no.3
    • /
    • pp.221-228
    • /
    • 2011
  • For service-area-widening and commercialization of DGNSS service, Ministry of Land, Transport and Maritime Affairs is developing a DGNSS service based on VRS using T-DMB. In this study, three PRC generation algorithms are developed for VRS DGNSS and their accuracies were evaluated. Three DGNSS correction generation algorithms are based on inverse distance weighting, 1st- and 2nd- multiple linear regression, and their positioning accuracies were compared in terms of the number of reference stations used for network composition and the algorithm type. As a result, the positioning accuracy of the case of using 16 sites is better than that of 6 sites. And the algorithm using the multiple linear regression showed the best performance. When the positioning accuracy of VRS DGNSS was compared with the traditional single-reference DGNSS, the improvement ratio was 20-23% and 20-36% for the horizontal and vertical directions, respectively.

Optimization of Data Recovery using Non-Linear Equalizer in Cellular Mobile Channel (셀룰라 이동통신 채널에서 비선형 등화기를 이용한 최적의 데이터 복원)

  • Choi, Sang-Ho;Ho, Kwang-Chun;Kim, Yung-Kwon
    • Journal of IKEEE
    • /
    • v.5 no.1 s.8
    • /
    • pp.1-7
    • /
    • 2001
  • In this paper, we have investigated the CDMA(Code Division Multiple Access) Cellular System with non-linear equalizer in reverse link channel. In general, due to unknown characteristics of channel in the wireless communication, the distribution of the observables cannot be specified by a finite set of parameters; instead, we partitioned the m-dimensional sample space Into a finite number of disjointed regions by using quantiles and a vector quantizer based on training samples. The algorithm proposed is based on a piecewise approximation to regression function based on quantiles and conditional partition moments which are estimated by Robbins Monro Stochastic Approximation (RMSA) algorithm. The resulting equalizers and detectors are robust in the sense that they are insensitive to variations in noise distributions. The main idea is that the robust equalizers and robust partition detectors yield better performance in equiprobably partitioned subspace of observations than the conventional equalizer in unpartitioned observation space under any condition. And also, we apply this idea to the CDMA system and analyze the BER performance.

  • PDF

Estimation of the number of discontinuity points based on likelihood (가능도함수를 이용한 불연속점 수의 추정)

  • Huh, Jib
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.1
    • /
    • pp.51-59
    • /
    • 2010
  • In the case that the regression function has a discontinuity point in generalized linear model, Huh (2009) estimated the location and jump size using the log-likelihood weighted the one-sided kernel function. In this paper, we consider estimation of the unknown number of the discontinuity points in the regression function. The proposed algorithm is based on testing of the existence of a discontinuity point coming from the asymptotic distribution of the estimated jump size described in Huh (2009). The finite sample performance is illustrated by simulated example.

Predicting the Number of Movie Audiences Through Variable Selection Based on Information Gain Measure (정보 소득율 기반의 변수 선택을 통한 영화 관객 수 예측)

  • Park, Hyeon-Mock;Choi, Sang Hyun
    • Journal of Information Technology Applications and Management
    • /
    • v.26 no.3
    • /
    • pp.19-27
    • /
    • 2019
  • In this study, we propose a methodology for predicting the movie audience based on movie information that can be easily acquired before opening and effectively distinguishing qualitative variables. In addition, we constructed a model to estimate the number of movie audiences at the time of data acquisition through the configured variables. Another purpose of this study is to provide a criterion for categorizing success of movies with qualitative characteristics. As an evaluation criterion, we used information gain ratio which is the node selection criterion of C4.5 algorithm. Through the procedure we have selected 416 movie data features. As a result of the multiple linear regression model, the performance of the regression model using the variables selection method based on the information gain ratio was excellent.

Clustering with Adaptive weighting of Context-aware Linear regression (상황인식기반 선형회귀의 적응적 가중치를 적용한 클러스터링)

  • Lee, Kang-whan
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.271-273
    • /
    • 2021
  • 본 논문은 이동노드의 클러스터링내에서 보다 효율적인클러스터링을 제공하고 유지하기위한 딥러닝의 선형회귀적 적응적 보정가중치에 따른 군집적 알고리즘을 제안한다. 대부분의 클러스터링 군집데이터를 처리함에 있어 상호관계에 따른 분류체계가 제공된다. 이러한 경우 이웃한 이동노드중 목적노드와는 연결가능성이 가장높은 이동노드를 클러스터내에서 중계노드로 선택해야 한다. 본 연구에서는 이러한 상황정보를 이해하고 동적이동노드간 속도와 방향속성정보간의 상관관계의 친밀도를 고려한 자율학습기반의 회귀적 모델에서 적응적 가중치에 따른 분류를 제시한다. 본 논문에서는 이러한 상황정보를 이해하고 클러스터링을 유지할 수 있는 자율학습기반의 적응적 가중치에 따른 딥러닝 모델을 제시 한다.

  • PDF

Performance Improvement of Classification Between Pathological and Normal Voice Using HOS Parameter (HOS 특징 벡터를 이용한 장애 음성 분류 성능의 향상)

  • Lee, Ji-Yeoun;Jeong, Sang-Bae;Choi, Hong-Shik;Hahn, Min-Soo
    • MALSORI
    • /
    • no.66
    • /
    • pp.61-72
    • /
    • 2008
  • This paper proposes a method to improve pathological and normal voice classification performance by combining multiple features such as auditory-based and higher-order features. Their performances are measured by Gaussian mixture models (GMMs) and linear discriminant analysis (LDA). The combination of multiple features proposed by the frame-based LDA method is shown to be an effective method for pathological and normal voice classification, with a 87.0% classification rate. This is a noticeable improvement of 17.72% compared to the MFCC-based GMM algorithm in terms of error reduction.

  • PDF