• 제목/요약/키워드: Bayesian additive model

검색결과 19건 처리시간 0.022초

Comparison of tree-based ensemble models for regression

  • Park, Sangho;Kim, Chanmin
    • Communications for Statistical Applications and Methods
    • /
    • 제29권5호
    • /
    • pp.561-589
    • /
    • 2022
  • When multiple classifications and regression trees are combined, tree-based ensemble models, such as random forest (RF) and Bayesian additive regression trees (BART), are produced. We compare the model structures and performances of various ensemble models for regression settings in this study. RF learns bootstrapped samples and selects a splitting variable from predictors gathered at each node. The BART model is specified as the sum of trees and is calculated using the Bayesian backfitting algorithm. Throughout the extensive simulation studies, the strengths and drawbacks of the two methods in the presence of missing data, high-dimensional data, or highly correlated data are investigated. In the presence of missing data, BART performs well in general, whereas RF provides adequate coverage. The BART outperforms in high dimensional, highly correlated data. However, in all of the scenarios considered, the RF has a shorter computation time. The performance of the two methods is also compared using two real data sets that represent the aforementioned situations, and the same conclusion is reached.

PDC Intelligent control-based theory for structure system dynamics

  • Chen, Tim;Lohnash, Megan;Owens, Emmanuel;Chen, C.Y.J.
    • Smart Structures and Systems
    • /
    • 제25권4호
    • /
    • pp.401-408
    • /
    • 2020
  • This paper deals with the problem of global stabilization for a class of nonlinear control systems. An effective approach is proposed for controlling the system interaction of structures through a combination of parallel distributed compensation (PDC) intelligent controllers and fuzzy observers. An efficient approximate inference algorithm using expectation propagation and a Bayesian additive model is developed which allows us to predict the total number of control systems, thereby contributing to a more adaptive trajectory for the closed-loop system and that of its corresponding model. The closed-loop fuzzy system can be made as close as desired, so that the behavior of the closed-loop system can be rigorously predicted by establishing that of the closed-loop fuzzy system.

지구 통계 모형을 이용한 양파 재배지 농업기상정보 생성 방법 (Production of Agrometeorological Information in Onion Fields using Geostatistical Models)

  • 임지은;윤상후
    • 한국환경과학회지
    • /
    • 제27권7호
    • /
    • pp.509-518
    • /
    • 2018
  • Weather is the most influential factor for crop cultivation. Weather information for cultivated areas is necessary for growth and production forecasting of agricultural crops. However, there are limitations in the meteorological observations in cultivated areas because weather equipment is not installed. This study tested methods of predicting the daily mean temperature in onion fields using geostatistical models. Three models were considered: inverse distance weight method, generalized additive model, and Bayesian spatial linear model. Data were collected from the AWS (automatic weather system), ASOS (automated synoptic observing system), and an agricultural weather station between 2013 and 2016. To evaluate the prediction performance, data from AWS and ASOS were used as the modeling data, and data from the agricultural weather station were used as the validation data. It was found that the Bayesian spatial linear regression performed better than other models. Consequently, high-resolution maps of the daily mean temperature of Jeonnam were generated using all observed weather information.

Analyzing effect and importance of input predictors for urban streamflow prediction based on a Bayesian tree-based model

  • Nguyen, Duc Hai;Bae, Deg-Hyo
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2022년도 학술발표회
    • /
    • pp.134-134
    • /
    • 2022
  • Streamflow forecasting plays a crucial role in water resource control, especially in highly urbanized areas that are very vulnerable to flooding during heavy rainfall event. In addition to providing the accurate prediction, the evaluation of effects and importance of the input predictors can contribute to water manager. Recently, machine learning techniques have applied their advantages for modeling complex and nonlinear hydrological processes. However, the techniques have not considered properly the importance and uncertainty of the predictor variables. To address these concerns, we applied the GA-BART, that integrates a genetic algorithm (GA) with the Bayesian additive regression tree (BART) model for hourly streamflow forecasting and analyzing input predictors. The Jungrang urban basin was selected as a case study and a database was established based on 39 heavy rainfall events during 2003 and 2020 from the rain gauges and monitoring stations. For the goal of this study, we used a combination of inputs that included the areal rainfall of the subbasins at current time step and previous time steps and water level and streamflow of the stations at time step for multistep-ahead streamflow predictions. An analysis of multiple datasets including different input predictors was performed to define the optimal set for streamflow forecasting. In addition, the GA-BART model could reasonably determine the relative importance of the input variables. The assessment might help water resource managers improve the accuracy of forecasts and early flood warnings in the basin.

  • PDF

Using Bayesian tree-based model integrated with genetic algorithm for streamflow forecasting in an urban basin

  • Nguyen, Duc Hai;Bae, Deg-Hyo
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2021년도 학술발표회
    • /
    • pp.140-140
    • /
    • 2021
  • Urban flood management is a crucial and challenging task, particularly in developed cities. Therefore, accurate prediction of urban flooding under heavy precipitation is critically important to address such a challenge. In recent years, machine learning techniques have received considerable attention for their strong learning ability and suitability for modeling complex and nonlinear hydrological processes. Moreover, a survey of the published literature finds that hybrid computational intelligent methods using nature-inspired algorithms have been increasingly employed to predict or simulate the streamflow with high reliability. The present study is aimed to propose a novel approach, an ensemble tree, Bayesian Additive Regression Trees (BART) model incorporating a nature-inspired algorithm to predict hourly multi-step ahead streamflow. For this reason, a hybrid intelligent model was developed, namely GA-BART, containing BART model integrating with Genetic algorithm (GA). The Jungrang urban basin located in Seoul, South Korea, was selected as a case study for the purpose. A database was established based on 39 heavy rainfall events during 2003 and 2020 that collected from the rain gauges and monitoring stations system in the basin. For the goal of this study, the different step ahead models will be developed based in the methods, including 1-hour, 2-hour, 3-hour, 4-hour, 5-hour, and 6-hour step ahead streamflow predictions. In addition, the comparison of the hybrid BART model with a baseline model such as super vector regression models is examined in this study. It is expected that the hybrid BART model has a robust performance and can be an optional choice in streamflow forecasting for urban basins.

  • PDF

통계적모형을 통한 고해상도 일별 평균기온 산정 (Generating high resolution of daily mean temperature using statistical models)

  • 윤상후
    • Journal of the Korean Data and Information Science Society
    • /
    • 제27권5호
    • /
    • pp.1215-1224
    • /
    • 2016
  • 고해상도 격자 단위 기후정보는 농업, 관광학, 생태학, 질병학 등 다양한 분야의 현상을 설명하는 중요 요인이다. 고해상도 기후정보는 동적 모형과 통계적 모형을 통해 얻을 수 있다. 통계적 모형은 동적 모형에 비해 계산 시간이 저렴하여 시공간 해상도가 높은 기후자료 생성에 주로 이용한다. 본 연구에서는 2003년부터 2012년까지 1월에 관측된 일 평균기온자료를 토대로 통계적 모형의 일 평균 기온을 생성하였다. 통계적 모형으로 선형모형을 기반으로한 일반선형모형, 일반화가법모형, 공간선형모형, 베이지안공간선형모형을 고려하였다. 예측성능평가를 위해 60개소의 지상관측소에서 관측된 일 평균기온을 모형적합 자료로 사용하여 352개소의 자동기상관측의 일 평균기온을 검증하였다. 평균제곱오차와 상관계수를 보면 베이지안공간모형의 예측성능이 다른 모형에 비해 상대적으로 우수하였다. 최종적으로 $1km{\times}1km$ 격자 단위 일 평균기온 지도를 생성하였다.

Adaptive Iterative Depeckling of SAR Imagery

  • Lee, Sang-Hoon
    • 대한원격탐사학회지
    • /
    • 제23권5호
    • /
    • pp.455-464
    • /
    • 2007
  • Lee(2007) suggested the Point-Jacobian iteration MAP estimation(PJIMAP) for noise removal of the images that are corrupted by multiplicative speckle noise. It is to find a MAP estimation of noisy-free imagery based on a Bayesian model using the lognormal distribution for image intensity and an MRF for image texture. When the image intensity is logarithmically transformed, the speckle noise is approximately Gaussian additive noise, and it tends to a normal probability much faster than the intensity distribution. The MRF is incorporated into digital image analysis by viewing pixel types as states of molecules in a lattice-like physical system. In this study, the MAP estimation is computed by the Point-Jacobian iteration using adaptive parameters. At each iteration, the parameters related to the Bayesian model are adaptively estimated using the updated information. The results of the proposed scheme were compared to them of PJIMAP with SAR simulation data generated by the Monte Carlo method. The experiments demonstrated an improvement in relaxing speckle noise and estimating noise-free intensity by using the adaptive parameters for the Ponit-Jacobian iteration.

Optimized machine learning algorithms for predicting the punching shear capacity of RC flat slabs

  • Huajun Yan;Nan Xie;Dandan Shen
    • Advances in concrete construction
    • /
    • 제17권1호
    • /
    • pp.27-36
    • /
    • 2024
  • Reinforced concrete (RC) flat slabs should be designed based on punching shear strength. As part of this study, machine learning (ML) algorithms were developed to accurately predict the punching shear strength of RC flat slabs without shear reinforcement. It is based on Bayesian optimization (BO), combined with four standard algorithms (Support vector regression, Decision trees, Random forests, Extreme gradient boosting) on 446 datasets that contain six design parameters. Furthermore, an analysis of feature importance is carried out by Shapley additive explanation (SHAP), in order to quantify the effect of design parameters on punching shear strength. According to the results, the BO method produces high prediction accuracy by selecting the optimal hyperparameters for each model. With R2 = 0.985, MAE = 0.0155 MN, RMSE = 0.0244 MN, the BO-XGBoost model performed better than the original XGBoost prediction, which had R2 = 0.917, MAE = 0.064 MN, RMSE = 0.121 MN in total dataset. Additionally, recommendations are provided on how to select factors that will influence punching shear resistance of RC flat slabs without shear reinforcement.

Genetic evaluation of eggshell color based on additive and dominance models in laying hens

  • Guo, Jun;Wang, Kehua;Qu, Liang;Dou, Taocun;Ma, Meng;Shen, Manman;Hu, Yuping
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제33권8호
    • /
    • pp.1217-1223
    • /
    • 2020
  • Objective: Eggshells with a uniform color and intensity are important for egg production because many consumers assess the quality of an egg according to the shell color. In the present study, we evaluated the influence of dominant effects on the variations in eggshell color after 32 weeks in a crossbred population. Methods: This study was conducted using 7,878 eggshell records from 2,626 hens. Heritability was estimated using a univariate animal model, which included inbreeding coefficients as a fixed effect and animal additive genetic, dominant genetic, and residuals as random effects. Genetic correlations were obtained using a bivariate animal model. The optimal diagnostic criteria identified in this study were: L🟉 value (lightness) using a dominance model, and a🟉 (redness), and b🟉 (yellowness) value using an additive model. Results: The estimated heritabilities were 0.65 for shell lightness, 0.42 for redness, and 0.60 for yellowness. The dominance heritability was 0.23 for lightness. The estimated genetic correlations were 0.61 between lightness and redness, -0.84 between lightness and yellowness, and -0.39 between redness and yellowness. Conclusion: These results indicate that dominant genetic effects could help to explain the phenotypic variance in eggshell color, especially based on data from blue-shelled chickens. Considering the dominant genetic variation identified for shell color, this variation should be employed to produce blue eggs for commercial purposes using a planned mating system.

IMAGE DENOISING BASED ON MIXTURE DISTRIBUTIONS IN WAVELET DOMAIN

  • Bae, Byoung-Suk;Lee, Jong-In;Kang, Moon-Gi
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2009년도 IWAIT
    • /
    • pp.246-249
    • /
    • 2009
  • Due to the additive white Gaussian noise (AWGN), images are often corrupted. In recent days, Bayesian estimation techniques to recover noisy images in the wavelet domain have been studied. The probability density function (PDF) of an image in wavelet domain can be described using highly-sharp head and long-tailed shapes. If a priori probability density function having the above properties would be applied well adaptively, better results could be obtained. There were some frequently proposed PDFs such as Gaussian, Laplace distributions, and so on. These functions model the wavelet coefficients satisfactorily and have its own of characteristics. In this paper, mixture distributions of Gaussian and Laplace distribution are proposed, which attempt to corporate these distributions' merits. Such mixture model will be used to remove the noise in images by adopting Maximum a Posteriori (MAP) estimation method. With respect to visual quality, numerical performance and computational complexity, the proposed technique gained better results.

  • PDF