• Title/Summary/Keyword: GBM model

Search Result 110, Processing Time 0.019 seconds

Form-finding of lifting self-forming GFRP elastic gridshells based on machine learning interpretability methods

  • Soheila, Kookalani;Sandy, Nyunn;Sheng, Xiang
    • Structural Engineering and Mechanics
    • /
    • v.84 no.5
    • /
    • pp.605-618
    • /
    • 2022
  • Glass fiber reinforced polymer (GFRP) elastic gridshells consist of long continuous GFRP tubes that form elastic deformations. In this paper, a method for the form-finding of gridshell structures is presented based on the interpretable machine learning (ML) approaches. A comparative study is conducted on several ML algorithms, including support vector regression (SVR), K-nearest neighbors (KNN), decision tree (DT), random forest (RF), AdaBoost, XGBoost, category boosting (CatBoost), and light gradient boosting machine (LightGBM). A numerical example is presented using a standard double-hump gridshell considering two characteristics of deformation as objective functions. The combination of the grid search approach and k-fold cross-validation (CV) is implemented for fine-tuning the parameters of ML models. The results of the comparative study indicate that the LightGBM model presents the highest prediction accuracy. Finally, interpretable ML approaches, including Shapely additive explanations (SHAP), partial dependence plot (PDP), and accumulated local effects (ALE), are applied to explain the predictions of the ML model since it is essential to understand the effect of various values of input parameters on objective functions. As a result of interpretability approaches, an optimum gridshell structure is obtained and new opportunities are verified for form-finding investigation of GFRP elastic gridshells during lifting construction.

Potential of multispectral imaging for maturity classification and recognition of oriental melon

  • Seongmin Lee;Kyoung-Chul Kim;Kangjin Lee;Jinhwan Ryu;Youngki Hong;Byeong-Hyo Cho
    • Korean Journal of Agricultural Science
    • /
    • v.50 no.3
    • /
    • pp.485-496
    • /
    • 2023
  • In this study, we aimed to apply multispectral imaging (713 - 920 nm, 10 bands) for maturity classification and recognition of oriental melons grown in hydroponic greenhouses. A total of 20 oriental melons were selected, and time series multispectral imaging of oriental melons was 7 - 9 times for each sample from April 21, 2023, to May 12, 2023. We used several approaches, such as Savitzky-Golay (SG), standard normal variate (SNV), and Combination of SG and SNV (SG + SNV), for pre-processing the multispectral data. As a result, 713 - 759 nm bands were preprocessed with SG for the maturity classification of oriental melons. Additionally, a Light Gradient Boosting Machine (LightGBM) was used to train the recognition model for oriental melon. R2 of recognition model were 0.92, 0.91 for the training and validation sets, respectively, and the F-scores were 96.6 and 79.4% for the training and testing sets, respectively. Therefore, multispectral imaging in the range of 713 - 920 nm can be used to classify oriental melons maturity and recognize their fruits.

Evaporative demand drought index forecasting in Busan-Ulsan-Gyeongnam region using machine learning methods (기계학습기법을 이용한 부산-울산-경남 지역의 증발수요 가뭄지수 예측)

  • Lee, Okjeong;Won, Jeongeun;Seo, Jiyu;Kim, Sangdan
    • Journal of Korea Water Resources Association
    • /
    • v.54 no.8
    • /
    • pp.617-628
    • /
    • 2021
  • Drought is a major natural disaster that causes serious social and economic losses. Local drought forecasts can provide important information for drought preparedness. In this study, we propose a new machine learning model that predicts drought by using historical drought indices and meteorological data from 10 sites from 1981 to 2020 in the southeastern part of the Korean Peninsula, Busan-Ulsan-Gyeongnam. Using Bayesian optimization techniques, a hyper-parameter-tuned Random Forest, XGBoost, and Light GBM model were constructed to predict the evaporative demand drought index on a 6-month time scale after 1-month. The model performance was compared by constructing a single site model and a regional model, respectively. In addition, the possibility of improving the model performance was examined by constructing a fine-tuned model using data from a individual site based on the regional model.

Research on the Financial Data Fraud Detection of Chinese Listed Enterprises by Integrating Audit Opinions

  • Leiruo Zhou;Yunlong Duan;Wei Wei
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.12
    • /
    • pp.3218-3241
    • /
    • 2023
  • Financial fraud undermines the sustainable development of financial markets. Financial statements can be regarded as the key source of information to obtain the operating conditions of listed companies. Current research focuses more on mining financial digital data instead of looking into text data. However, text data can reveal emotional information, which is an important basis for detecting financial fraud. The audit opinion of the financial statement is especially the fair opinion of a certified public accountant on the quality of enterprise financial reports. Therefore, this research was carried out by using the data features of 4,153 listed companies' financial annual reports and audits of text opinions in the past six years, and the paper puts forward a financial fraud detection model integrating audit opinions. First, the financial data index database and audit opinion text database were built. Second, digitized audit opinions with deep learning Bert model was employed. Finally, both the extracted audit numerical characteristics and the financial numerical indicators were used as the training data of the LightGBM model. What is worth paying attention to is that the imbalanced distribution of sample labels is also one of the focuses of financial fraud research. To solve this problem, data enhancement and Focal Loss feature learning functions were used in data processing and model training respectively. The experimental results show that compared with the conventional financial fraud detection model, the performance of the proposed model is improved greatly, with Area Under the Curve (AUC) and Accuracy reaching 81.42% and 78.15%, respectively.

Development of Glioblastoma In Vivo Model for the Research of Brain Cancer Diagnosis and Therapy (뇌암 진단 및 치료 연구를 위한 교모세포종 동물모델 개발)

  • Kang, Seonghee;Kang, Bosun
    • Journal of the Korean Society of Radiology
    • /
    • v.8 no.7
    • /
    • pp.389-395
    • /
    • 2014
  • The research was carried out to develop a animal model of malignant brain tumor for the researches in glioblastoma multiform (GBM) diagnosis and therapy. C6 cells were transplanted into the right striatum of SD rat using stereotactic instrument for the development. The developed animal model was verified by MRI and H&E stain assay of anatomicohistological examination. The MRI observations showed that the tumor developed at the injection site at the 7 days after glioblastoma inoculation. At 14 days post inoculation, the tumor grew to a large volume occupying almost a half of the right cerebral hemisphere. It was confirmed that the expression of excessive mitosis and pleomorphism in anatomicohistological examination. The developed animal model must be necessary and useful tool for the in vivo level research in the development of the new modality for the diagnosis and therapy of brain cancer.

Decision based uncertainty model to predict rockburst in underground engineering structures using gradient boosting algorithms

  • Kidega, Richard;Ondiaka, Mary Nelima;Maina, Duncan;Jonah, Kiptanui Arap Too;Kamran, Muhammad
    • Geomechanics and Engineering
    • /
    • v.30 no.3
    • /
    • pp.259-272
    • /
    • 2022
  • Rockburst is a dynamic, multivariate, and non-linear phenomenon that occurs in underground mining and civil engineering structures. Predicting rockburst is challenging since conventional models are not standardized. Hence, machine learning techniques would improve the prediction accuracies. This study describes decision based uncertainty models to predict rockburst in underground engineering structures using gradient boosting algorithms (GBM). The model input variables were uniaxial compressive strength (UCS), uniaxial tensile strength (UTS), maximum tangential stress (MTS), excavation depth (D), stress ratio (SR), and brittleness coefficient (BC). Several models were trained using different combinations of the input variables and a 3-fold cross-validation resampling procedure. The hyperparameters comprising learning rate, number of boosting iterations, tree depth, and number of minimum observations were tuned to attain the optimum models. The performance of the models was tested using classification accuracy, Cohen's kappa coefficient (k), sensitivity and specificity. The best-performing model showed a classification accuracy, k, sensitivity and specificity values of 98%, 93%, 1.00 and 0.957 respectively by optimizing model ROC metrics. The most and least influential input variables were MTS and BC, respectively. The partial dependence plots revealed the relationship between the changes in the input variables and model predictions. The findings reveal that GBM can be used to anticipate rockburst and guide decisions about support requirements before mining development.

Store Sales Prediction Using Gradient Boosting Model (그래디언트 부스팅 모델을 활용한 상점 매출 예측)

  • Choi, Jaeyoung;Yang, Heeyoon;Oh, Hayoung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.2
    • /
    • pp.171-177
    • /
    • 2021
  • Through the rapid developments in machine learning, there have been diverse utilization approaches not only in industrial fields but also in daily life. Implementations of machine learning on financial data, also have been of interest. Herein, we employ machine learning algorithms to store sales data and present future applications for fintech enterprises. We utilize diverse missing data processing methods to handle missing data and apply gradient boosting machine learning algorithms; XGBoost, LightGBM, CatBoost to predict the future revenue of individual stores. As a result, we found that using median imputation onto missing data with the appliance of the xgboost algorithm has the best accuracy. By employing the proposed method, fintech enterprises and customers can attain benefits. Stores can benefit by receiving financial assistance beforehand from fintech companies, while these corporations can benefit by offering financial support to these stores with low risk.

A Study on Fraud Detection in the C2C Used Trade Market Using Doc2vec

  • Lim, Do Hyun;Ahn, Hyunchul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.3
    • /
    • pp.173-182
    • /
    • 2022
  • In this paper, we propose a machine learning model that can prevent fraudulent transactions in advance and interpret them using the XAI approach. For the experiment, we collected a real data set of 12,258 mobile phone sales posts from Joonggonara, a major domestic online C2C resale trading platform. Characteristics of the text corresponding to the post body were extracted using Doc2vec, dimensionality was reduced through PCA, and various derived variables were created based on previous research. To mitigate the data imbalance problem in the preprocessing stage, a complex sampling method that combines oversampling and undersampling was applied. Then, various machine learning models were built to detect fraudulent postings. As a result of the analysis, LightGBM showed the best performance compared to other machine learning models. And as a result of SHAP, if the price is unreasonably low compared to the market price and if there is no indication of the transaction area, there was a high probability that it was a fraudulent post. Also, high price, no safe transaction, the more the courier transaction, and the higher the ratio of 0 in the price also led to fraud.

Prediction of Vertical Sea Water Temperature Profile in the East Sea Based on Machine Learning and XBT Data

  • Kim, Young-Joo;Lee, Soo-Jin;Kim, Young-Won
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.11
    • /
    • pp.47-55
    • /
    • 2022
  • Recently, researches on the prediction of sea water temperature using artificial intelligence models has been actively conducted in Korea. However, most researches in the sea around the Korean peninsula mainly focus on predicting sea surface temperatures. Unlike previous researches, this research predicted the vertical sea water temperature profile of the East Sea, which is very important in submarine operations and anti-submarine warfare, using XBT(eXpendable Bathythermograph) data and machine learning models(RandomForest, XGBoost, LightGBM). The model was trained using XBT data measured from sea surface to depth of 200m in a specific area of the East Sea, and the prediction accuracy was evaluated through MAE(Mean Absolute Error) and vertical sea water temperature profile graphs.

A Study on Predicting Student Dropout in College: The Importance of Early Academic Performance (전문대학 학생의 학업중단 예측에 관한 연구: 초기 학업 성적의 중요성)

  • Sangjo Oh;JiHwan Sim
    • Journal of Industrial Convergence
    • /
    • v.22 no.2
    • /
    • pp.23-32
    • /
    • 2024
  • This study utilized minimum number of demographic variables and first-semester GPA of students to predict the final academic status of students at a vocational college in Seoul. The results from XGBoost and LightGBM models revealed that these variables significantly impacted the prediction of students' dismissal. This suggests that early academic performance could be an important indicator of potential academic dropout. Additionally, the possibility that academic years required to award an associate degree at the vocational college could influence the final academic status was confirmed, indicating that the duration of study is a crucial factor in students' decisions to discontinue their studies. The study attempted to model without relying on psychological, social, or economic factors, focusing solely on academic achievement. This is expected to aid in the development of an early warning system for preventing academic dropout in the future.