• Title/Summary/Keyword: Machine Learning Models

Search Result 1,395, Processing Time 0.028 seconds

Using Mechanical Learning Analysis of Determinants of Housing Sales and Establishment of Forecasting Model (기계학습을 활용한 주택매도 결정요인 분석 및 예측모델 구축)

  • Kim, Eun-mi;Kim, Sang-Bong;Cho, Eun-seo
    • Journal of Cadastre & Land InformatiX
    • /
    • v.50 no.1
    • /
    • pp.181-200
    • /
    • 2020
  • This study used the OLS model to estimate the determinants affecting the tenure of a home and then compared the predictive power of each model with SVM, Decision Tree, Random Forest, Gradient Boosting, XGBooest and LightGBM. There is a difference from the preceding study in that the Stacking model, one of the ensemble models, can be used as a base model to establish a more predictable model to identify the volume of housing transactions in the housing market. OLS analysis showed that sales profits, housing prices, the number of household members, and the type of residential housing (detached housing, apartments) affected the period of housing ownership, and compared the predictability of the machine learning model with RMSE, the results showed that the machine learning model had higher predictability. Afterwards, the predictive power was compared by applying each machine learning after rebuilding the data with the influencing variables, and the analysis showed the best predictive power of Random Forest. In addition, the most predictable Random Forest, Decision Tree, Gradient Boosting, and XGBooost models were applied as individual models, and the Stacking model was constructed using Linear, Ridge, and Lasso models as meta models. As a result of the analysis, the RMSE value in the Ridge model was the lowest at 0.5181, thus building the highest predictive model.

Performance comparison of SVM and ANN models for solar energy prediction (태양광 에너지 예측을 위한 SVM 및 ANN 모델의 성능 비교)

  • Jung, Wonseok;Jeong, Young-Hwa;Park, Moon-Ghu;Lee, Chang-Kyo;Seo, Jeongwook
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.10a
    • /
    • pp.626-628
    • /
    • 2018
  • In this paper, we compare the performances of SVM (Support Vector Machine) and ANN (Artificial Neural Network) machine learning models for predicting solar energy by using meteorological data. Two machine learning models were built by using fifteen kinds of weather data such as long and short wave radiation average, precipitation and temperature. Then the RBF (Radial Basis Function) parameters in the SVM model and the number of hidden layers/nodes and the regularization parameter in the ANN model were found by experimental studies. MAPE (Mean Absolute Percentage Error) and MAE (Mean Absolute Error) were considered as metrics for evaluating the performances of the SVM and ANN models. Sjoem Simulation results showed that the SVM model achieved the performances of MAPE=21.11 and MAE=2281417.65, and the ANN model did the performances of MAPE=19.54 and MAE=2155345.10776.

  • PDF

Estimation of regional flow duration curve applicable to ungauged areas using machine learning technique (머신러닝 기법을 이용한 미계측 유역에 적용 가능한 지역화 유황곡선 산정)

  • Jeung, Se Jin;Lee, Seung Pil;Kim, Byung Sik
    • Journal of Korea Water Resources Association
    • /
    • v.54 no.spc1
    • /
    • pp.1183-1193
    • /
    • 2021
  • Low flow affects various fields such as river water supply management and planning, and irrigation water. A sufficient period of flow data is required to calculate the Flow Duration Curve. However, in order to calculate the Flow Duration Curve, it is essential to secure flow data for more than 30 years. However, in the case of rivers below the national river unit, there is no long-term flow data or there are observed data missing for a certain period in the middle, so there is a limit to calculating the Flow Duration Curve for each river. In the past, statistical-based methods such as Multiple Regression Analysis and ARIMA models were used to predict sulfur in the unmeasured watershed, but recently, the demand for machine learning and deep learning models is increasing. Therefore, in this study, we present the DNN technique, which is a machine learning technique that fits the latest paradigm. The DNN technique is a method that compensates for the shortcomings of the ANN technique, such as difficult to find optimal parameter values in the learning process and slow learning time. Therefore, in this study, the Flow Duration Curve applicable to the unmeasured watershed is calculated using the DNN model. First, the factors affecting the Flow Duration Curve were collected and statistically significant variables were selected through multicollinearity analysis between the factors, and input data were built into the machine learning model. The effectiveness of machine learning techniques was reviewed through statistical verification.

Predicting and Interpreting Quality of CMP Process for Semiconductor Wafers Using Machine Learning (머신러닝을 이용한 반도체 웨이퍼 평탄화 공정품질 예측 및 해석 모형 개발)

  • Ahn, Jeong-Eon;Jung, Jae-Yoon
    • The Journal of Bigdata
    • /
    • v.4 no.2
    • /
    • pp.61-71
    • /
    • 2019
  • Chemical Mechanical Planarization (CMP) process that planarizes semiconductor wafer's surface by polishing is difficult to manage reliably since it is under various chemicals and physical machinery. In CMP process, Material Removal Rate (MRR) is often used for a quality indicator, and it is important to predict MRR in managing CMP process stably. In this study, we introduce prediction models using machine learning techniques of analyzing time-series sensor data collected in CMP process, and the classification models that are used to interpret process quality conditions. In addition, we find meaningful variables affecting process quality and explain process variables' conditions to keep process quality high by analyzing classification result.

  • PDF

A Study on Prediction Model Performance of Scaffold Pore Size Using Machine Learning Regression Method (머신 러닝 회귀 방안을 이용한 인공지지체 기공 크기 예측모델 성능에 관한 연구)

  • Lee, Song-Yeon;Huh, Yong Jeong
    • Journal of the Semiconductor & Display Technology
    • /
    • v.19 no.1
    • /
    • pp.36-41
    • /
    • 2020
  • In this paper, We need to change all print factors when which print scaffold with 400 ㎛ pore using FDM 3d printer. Therefore the print quantity is 10 billion times, So we are difficult to print on workplace. To solve the problem, we used the prediction model based machine learning regression. We preprocessed and learned the securing print condition data, and we produced different kinds of prediction models. We predicted the pore size of scaffolds not securing with new print condition data using prediction models. We have derived the print conditions that satisfy the pore size of 400 ㎛ among the predicted print conditions of pore size. We printed the scaffolds 5 times on the condition. We measured the pore size of the printed scaffold and compared the average pore size with the predicted pore size. We confirmed that error was less than 1%, and we were identify the model with the highest pore size prediction performance of scaffold.

Prediction of Significant Wave Height in Korea Strait Using Machine Learning

  • Park, Sung Boo;Shin, Seong Yun;Jung, Kwang Hyo;Lee, Byung Gook
    • Journal of Ocean Engineering and Technology
    • /
    • v.35 no.5
    • /
    • pp.336-346
    • /
    • 2021
  • The prediction of wave conditions is crucial in the field of marine and ocean engineering. Hence, this study aims to predict the significant wave height through machine learning (ML), a soft computing method. The adopted metocean data, collected from 2012 to 2020, were obtained from the Korea Institute of Ocean Science and Technology. We adopted the feedforward neural network (FNN) and long-short term memory (LSTM) models to predict significant wave height. Input parameters for the input layer were selected by Pearson correlation coefficients. To obtain the optimized hyperparameter, we conducted a sensitivity study on the window size, node, layer, and activation function. Finally, the significant wave height was predicted using the FNN and LSTM models, by varying the three input parameters and three window sizes. Accordingly, FNN (W48) (i.e., FNN with window size 48) and LSTM (W48) (i.e., LSTM with window size 48) were superior outcomes. The most suitable model for predicting the significant wave height was FNN(W48) owing to its accuracy and calculation time. If the metocean data were further accumulated, the accuracy of the ML model would have improved, and it will be beneficial to predict added resistance by waves when conducting a sea trial test.

Prediction of Electric Power on Distribution Line Using Machine Learning and Actual Data Considering Distribution Plan (배전계획을 고려한 실데이터 및 기계학습 기반의 배전선로 부하예측 기법에 대한 연구)

  • Kim, Junhyuk;Lee, Byung-Sung
    • KEPCO Journal on Electric Power and Energy
    • /
    • v.7 no.1
    • /
    • pp.171-177
    • /
    • 2021
  • In terms of distribution planning, accurate electric load prediction is one of the most important factors. The future load prediction has manually been performed by calculating the maximum electric load considering loads transfer/switching and multiplying it with the load increase rate. In here, the risk of human error is inherent and thus an automated maximum electric load forecasting system is required. Although there are many existing methods and techniques to predict future electric loads, such as regression analysis, many of them have limitations in reflecting the nonlinear characteristics of the electric load and the complexity due to Photovoltaics (PVs), Electric Vehicles (EVs), and etc. This study, therefore, proposes a method of predicting future electric loads on distribution lines by using Machine Learning (ML) method that can reflect the characteristics of these nonlinearities. In addition, predictive models were developed based on actual data collected at KEPCO's existing distribution lines and the adequacy of developed models was verified as well. Also, as the distribution planning has a direct bearing on the investment, and amount of investment has a direct bearing on the maximum electric load, various baseline such as maximum, lowest, median value that can assesses the adequacy and accuracy of proposed ML based electric load prediction methods were suggested.

A Multi-category Task for Bitrate Interval Prediction with the Target Perceptual Quality

  • Yang, Zhenwei;Shen, Liquan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.12
    • /
    • pp.4476-4491
    • /
    • 2021
  • Video service providers tend to face user network problems in the process of transmitting video streams. They strive to provide user with superior video quality in a limited bitrate environment. It is necessary to accurately determine the target bitrate range of the video under different quality requirements. Recently, several schemes have been proposed to meet this requirement. However, they do not take the impact of visual influence into account. In this paper, we propose a new multi-category model to accurately predict the target bitrate range with target visual quality by machine learning. Firstly, a dataset is constructed to generate multi-category models by machine learning. The quality score ladders and the corresponding bitrate-interval categories are defined in the dataset. Secondly, several types of spatial-temporal features related to VMAF evaluation metrics and visual factors are extracted and processed statistically for classification. Finally, bitrate prediction models trained on the dataset by RandomForest classifier can be used to accurately predict the target bitrate of the input videos with target video quality. The classification prediction accuracy of the model reaches 0.705 and the encoded video which is compressed by the bitrate predicted by the model can achieve the target perceptual quality.

Steel-UHPC composite dowels' pull-out performance studies using machine learning algorithms

  • Zhihua Xiong;Zhuoxi Liang;Xuyao Liu;Markus Feldmann;Jiawen Li
    • Steel and Composite Structures
    • /
    • v.48 no.5
    • /
    • pp.531-545
    • /
    • 2023
  • Composite dowels are implemented as a powerful alternative to headed studs for the efficient combination of Ultra High-Performance Concrete (UHPC) with high-strength steel in novel composite structures. They are required to provide sufficient shear resistance and ensure the transmission of tensile forces in the composite connection in order to prevent lifting of the concrete slab. In this paper, the load bearing capacity of puzzle-shaped and clothoidal-shaped dowels encased in UHPC specimen were investigated based on validated experimental test data. Considering the influence of the embedment depth and the spacing width of shear dowels, the characteristics of UHPC square plate on the load bearing capacity of composite structure, 240 numeric models have been constructed and analyzed. Three artificial intelligence approaches have been implemented to learn the discipline from collected experimental data and then make prediction, which includes Artificial Neural Network-Particle Swarm Optimization (ANN-PSO), Adaptive Neuro-Fuzzy Inference System (ANFIS) and an Extreme Learning Machine (ELM). Among the factors, the embedment depth of composite dowel is proved to be the most influential parameter on the load bearing capacity. Furthermore, the results of the prediction models reveal that ELM is capable to achieve more accurate prediction.

Causality, causal discovery, causal inference and counterfactuals in Civil Engineering: Causal machine learning and case studies for knowledge discovery

  • M.Z. Naser;Arash Teymori Gharah Tapeh
    • Computers and Concrete
    • /
    • v.31 no.4
    • /
    • pp.277-292
    • /
    • 2023
  • Much of our experiments are designed to uncover the cause(s) and effect(s) behind a phenomenon (i.e., data generating mechanism) we happen to be interested in. Uncovering such relationships allows us to identify the true workings of a phenomenon and, most importantly, to realize and articulate a model to explore the phenomenon on hand and/or allow us to predict it accurately. Fundamentally, such models are likely to be derived via a causal approach (as opposed to an observational or empirical mean). In this approach, causal discovery is required to create a causal model, which can then be applied to infer the influence of interventions, and answer any hypothetical questions (i.e., in the form of What ifs? Etc.) that commonly used prediction- and statistical-based models may not be able to address. From this lens, this paper builds a case for causal discovery and causal inference and contrasts that against common machine learning approaches - all from a civil and structural engineering perspective. More specifically, this paper outlines the key principles of causality and the most commonly used algorithms and packages for causal discovery and causal inference. Finally, this paper also presents a series of examples and case studies of how causal concepts can be adopted for our domain.