• Title/Summary/Keyword: Gradient boosting

Search Result 225, Processing Time 0.027 seconds

Understanding Customer Purchasing Behavior in E-Commerce using Explainable Artificial Intelligence Techniques (XAI 기법을 이용한 전자상거래의 고객 구매 행동 이해)

  • Lee, Jaejun;Jeong, Ii Tae;Lim, Do Hyun;Kwahk, Kee-Young;Ahn, Hyunchul
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2021.07a
    • /
    • pp.387-390
    • /
    • 2021
  • 최근 전자 상거래 시장이 급격한 성장을 이루면서 고객들의 급변하는 니즈를 파악하는 것이 기업들의 수익에 직결되는 요소로 인식되고 있다. 이에 기업들은 고객들의 니즈를 신속하고 정확하게 파악하기 위해, 기축적된 고객 관련 각종 데이터를 활용하려는 시도를 강화하고 있다. 기존 시도들은 주로 구매 행동 예측에 중점을 두었으나 고객 행동의 전후 과정을 해석하는데 있어 어려움이 존재했다. 본 연구에서는 고객이 구매한 상품을 확정 또는 환불하는 행동을 취할 때 해당 행동이 발생하는데 있어 어떤 요소들이 작용하였는지를 파악하고, 어떤 고객이 환불할 지를 예측하는 예측 모형을 새롭게 제시한다. 예측 모형 구현에는 트리 기반 앙상블 방법을 사용해 예측력을 높인 XGBoost 기법을 적용하였으며, 고객 의도에 영향을 미치는 요소들을 파악하기 위하여 대표적인 설명가능한 인공지능(XAI) 기법 중 하나인 SHAP 기법을 적용하였다. 이를 통해 특정 고객 행동에 대한 각 요인들의 전반적인 영향 뿐만 아니라, 각 개별 고객에 대해서도 어떤 요소가 환불결정에 영향을 미쳤는지 파악할 수 있었다. 이를 통해 기업은 고객 개개인의 의사 결정에 영향을 미치는 요소를 파악하여 개인화 마케팅에 사용할 수 있을 것으로 기대된다.

  • PDF

Analysis of algal spatial distribution characteristics using hyperspectral images and machine learning in upstream reach of Baekje weir (초분광영상과 머신러닝을 이용한 백제보 상류구간 조류 공간분포 특성분석)

  • Jang, Wonjin;Kim, Jinuk;Chung, Jeehun;Park, Yongeun;Kim, Seongjoon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.89-89
    • /
    • 2021
  • 부영양화된 호수나 유속이 느린 하천에서 발생하는 녹조의 과도한 발생은 하천 생태계 훼손, 동식물의 건강, 담수의 오염 등 환경 사회 경제적으로 큰 피해를 준다. 현재 수질 측정망은 정해진 지점에서 Chlorophyll-a(Chl-a), Phycocyanin(PC)을 대표농도로 산정하고 조류경보에 활용하고 있으나, 일주일에 한번씩 샘플링을 통해 Chl-a 및 PC를 측정하여 시공간적인 신뢰성의 문제가 제기될 수 있다. 본 연구에서는 기존 점단위 조류 모니터링의 한계점을 개선하기 위해 초분광영상 자료를 머신러닝 기법에 적용하여 Chl-a 및 PC 산정 알고리즘을 개발하였다. 이를 위해 Chl-a와 PC의 최대 흡수, 반사 파장대, 주요 물 흡수 파장대 자료를 조합하여 9개의 파장비를 구축하였으며, 기존 연구에서 활용한 머신러닝 기법인 Partial Least Square, Random Forest, Gradient Boosting, Support Vector Machine, K-Nearest Neighbor, Artificial Neural Network를 검토하여 최적 모델을 선정하였다. 학습된 머신러닝의 성능을 R2, NSE, RMSE 목적함수를 이용해 평가하였으며, 그 결과 ANN이 각각 PC 0.801, 0.755, 11.774 mg/m3, Chl-a 0.733, 0.622, 8.736 mg/m3로 가장 우수한 성능을 보였다. 최적화 된 ANN 모델을 백제보 상류 2016-2017년 항공 초분광영상에 적용하여 시공간에 따른 조류 분포변화를 평가하고자 한다.

  • PDF

The Analysis of the Activity Patterns of Dog with Wearable Sensors Using Machine Learning

  • Hussain, Ali;Ali, Sikandar;Kim, Hee-Cheol
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.141-143
    • /
    • 2021
  • The Activity patterns of animal species are difficult to access and the behavior of freely moving individuals can not be assessed by direct observation. As it has become large challenge to understand the activity pattern of animals such as dogs, and cats etc. One approach for monitoring these behaviors is the continuous collection of data by human observers. Therefore, in this study we assess the activity patterns of dog using the wearable sensors data such as accelerometer and gyroscope. A wearable, sensor -based system is suitable for such ends, and it will be able to monitor the dogs in real-time. The basic purpose of this study was to develop a system that can detect the activities based on the accelerometer and gyroscope signals. Therefore, we purpose a method which is based on the data collected from 10 dogs, including different nine breeds of different sizes and ages, and both genders. We applied six different state-of-the-art classifiers such as Random forests (RF), Support vector machine (SVM), Gradient boosting machine (GBM), XGBoost, k-nearest neighbors (KNN), and Decision tree classifier, respectively. The Random Forest showed a good classification result. We achieved an accuracy 86.73% while the detecting the activity.

  • PDF

Prediction of medication-related osteonecrosis of the jaw (MRONJ) using automated machine learning in patients with osteoporosis associated with dental extraction and implantation: a retrospective study

  • Da Woon Kwack;Sung Min Park
    • Journal of the Korean Association of Oral and Maxillofacial Surgeons
    • /
    • v.49 no.3
    • /
    • pp.135-141
    • /
    • 2023
  • Objectives: This study aimed to develop and validate machine learning (ML) models using H2O-AutoML, an automated ML program, for predicting medication-related osteonecrosis of the jaw (MRONJ) in patients with osteoporosis undergoing tooth extraction or implantation. Patients and Methods: We conducted a retrospective chart review of 340 patients who visited Dankook University Dental Hospital between January 2019 and June 2022 who met the following inclusion criteria: female, age ≥55 years, osteoporosis treated with antiresorptive therapy, and recent dental extraction or implantation. We considered medication administration and duration, demographics, and systemic factors (age and medical history). Local factors, such as surgical method, number of operated teeth, and operation area, were also included. Six algorithms were used to generate the MRONJ prediction model. Results: Gradient boosting demonstrated the best diagnostic accuracy, with an area under the receiver operating characteristic curve (AUC) of 0.8283. Validation with the test dataset yielded a stable AUC of 0.7526. Variable importance analysis identified duration of medication as the most important variable, followed by age, number of teeth operated, and operation site. Conclusion: ML models can help predict MRONJ occurrence in patients with osteoporosis undergoing tooth extraction or implantation based on questionnaire data acquired at the first visit.

Estimation of lightweight aggregate concrete characteristics using a novel stacking ensemble approach

  • Kaloop, Mosbeh R.;Bardhan, Abidhan;Hu, Jong Wan;Abd-Elrahman, Mohamed
    • Advances in nano research
    • /
    • v.13 no.5
    • /
    • pp.499-512
    • /
    • 2022
  • This study investigates the efficiency of ensemble machine learning for predicting the lightweight-aggregate concrete (LWC) characteristics. A stacking ensemble (STEN) approach was proposed to estimate the dry density (DD) and 28 days compressive strength (Fc-28) of LWC using two meta-models called random forest regressor (RFR) and extra tree regressor (ETR), and two novel ensemble models called STEN-RFR and STEN-ETR, were constructed. Four standalone machine learning models including artificial neural network, gradient boosting regression, K neighbor regression, and support vector regression were used to compare the performance of the proposed models. For this purpose, a sum of 140 LWC mixtures with 21 influencing parameters for producing LWC with a density less than 1000 kg/m3, were used. Based on the experimental results with multiple performance criteria, it can be concluded that the proposed STEN-ETR model can be used to estimate the DD and Fc-28 of LWC. Moreover, the STEN-ETR approach was found to be a significant technique in prediction DD and Fc-28 of LWC with minimal prediction error. In the validation phase, the accuracy of the proposed STEN-ETR model in predicting DD and Fc-28 was found to be 96.79% and 81.50%, respectively. In addition, the significance of cement, water-cement ratio, silica fume, and aggregate with expanded glass variables is efficient in modeling DD and Fc-28 of LWC.

Development of a Collaboration Recommendation Model between Global Consulting Firms using Link Prediction

  • Yu, Young-su;Koo, Bon-sang
    • International conference on construction engineering and project management
    • /
    • 2020.12a
    • /
    • pp.381-386
    • /
    • 2020
  • Global construction and engineering consulting (E&C) firms are actively seeking entry into overseas markets based on loan projects from multilateral development banks to provide a basis for entry into overseas markets and sustainable growth. Bids on these projects are competitive between global top firms in terms of the technical level and price due to the limited number of projects; thus, developing a successful partnership to complement competence has become an essential element to win bids. In this regard, many studies have analyzed enterprises through characteristic analyses or the derivation of influential factors from the past social networks based on social network analysis (SNA). However, few studies have been conducted to reflect the process of changes to analyze collaborative relationships. Thus, this study aims to identify dynamic changes in past social networks and develop a model that can predict changes in the relationships between E&C firms based on similarities or differences between firms, presenting a methodology to target firms for appropriate collaboration. The analysis results demonstrate that the sensitivity of the developed prediction model was 70.26%, which could accurately predict 163 out of 232 actual cooperative cases.

  • PDF

Predicting the compressive strength of SCC containing nano silica using surrogate machine learning algorithms

  • Neeraj Kumar Shukla;Aman Garg;Javed Bhutto;Mona Aggarwal;Mohamed Abbas;Hany S. Hussein;Rajesh Verma;T.M. Yunus Khan
    • Computers and Concrete
    • /
    • v.32 no.4
    • /
    • pp.373-381
    • /
    • 2023
  • Fly ash, granulated blast furnace slag, marble waste powder, etc. are just some of the by-products of other sectors that the construction industry is looking to include into the many types of concrete they produce. This research seeks to use surrogate machine learning methods to forecast the compressive strength of self-compacting concrete. The surrogate models were developed using Gradient Boosting Machine (GBM), Support Vector Machine (SVM), Random Forest (RF), and Gaussian Process Regression (GPR) techniques. Compressive strength is used as the output variable, with nano silica content, cement content, coarse aggregate content, fine aggregate content, superplasticizer, curing duration, and water-binder ratio as input variables. Of the four models, GBM had the highest accuracy in determining the compressive strength of SCC. The concrete's compressive strength is worst predicted by GPR. Compressive strength of SCC with nano silica is found to be most affected by curing time and least by fine aggregate.

Empirical evaluations for predicting the damage of FRC wall subjected to close-in explosions

  • Duc-Kien Thai;Thai-Hoan Pham;Duy-Liem Nguyen;Tran Minh Tu;Phan Van Tien
    • Steel and Composite Structures
    • /
    • v.49 no.1
    • /
    • pp.65-79
    • /
    • 2023
  • This paper presents a development of empirical evaluations, which can be used to evaluate the damage of fiber-reinforced concrete composites (FRC) wall subjected to close-in blast loads. For this development, a combined application of numerical simulation and machine learning approaches are employed. First, finite element modeling of FRC wall under blast loading is developed and verified using experimental data. Numerical analyses are then carried out to investigate the dynamic behavior of the FRC wall under blast loading. In addition, a data set of 384 samples on the damage of FRC wall due to blast loads is then produced in order to develop machine learning models. Second, three robust machine learning models of Random Forest (RF), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGBoost) are employed to propose empirical evaluations for predicting the damage of FRC wall. The proposed empirical evaluations are very useful for practical evaluation and design of FRC wall subjected to blast loads.

Machine learning-based probabilistic predictions of shear resistance of welded studs in deck slab ribs transverse to beams

  • Vitaliy V. Degtyarev;Stephen J. Hicks
    • Steel and Composite Structures
    • /
    • v.49 no.1
    • /
    • pp.109-123
    • /
    • 2023
  • Headed studs welded to steel beams and embedded within the concrete of deck slabs are vital components of modern composite floor systems, where safety and economy depend on the accurate predictions of the stud shear resistance. The multitude of existing deck profiles and the complex behavior of studs in deck slab ribs makes developing accurate and reliable mechanical or empirical design models challenging. The paper addresses this issue by presenting a machine learning (ML) model developed from the natural gradient boosting (NGBoost) algorithm capable of producing probabilistic predictions and a database of 464 push-out tests, which is considerably larger than the databases used for developing existing design models. The proposed model outperforms models based on other ML algorithms and existing descriptive equations, including those in EC4 and AISC 360, while offering probabilistic predictions unavailable from other models and producing higher shear resistances for many cases. The present study also showed that the stud shear resistance is insensitive to the concrete elastic modulus, stud welding type, location of slab reinforcement, and other parameters considered important by existing models. The NGBoost model was interpreted by evaluating the feature importance and dependence determined with the SHapley Additive exPlanations (SHAP) method. The model was calibrated via reliability analyses in accordance with the Eurocodes to ensure that its predictions meet the required reliability level and facilitate its use in design. An interactive open-source web application was created and deployed to the cloud to allow for convenient and rapid stud shear resistance predictions with the developed model.

Developing a Predictive Model of Young Job Seekers' Preference for Hidden Champions Using Machine Learning and Analyzing the Relative Importance of Preference Factors (머신러닝을 활용한 청년 구직자의 강소기업 선호 예측모형 개발 및 요인별 상대적 중요도 분석)

  • Cho, Yoon Ju;Kim, Jin Soo;Bae, Hwan seok;Yang, Sung-Byung;Yoon, Sang-Hyeak
    • The Journal of Information Systems
    • /
    • v.32 no.4
    • /
    • pp.229-245
    • /
    • 2023
  • Purpose This study aims to understand the inclinations of young job seekers towards "hidden champions" - small but competitive companies that are emerging as potential solutions to the growing disparity between youth-targeted job vacancies and job seekers. We utilize machine learning techniques to discern the appeal of these hidden champions. Design/methodology/approach We examined the characteristics of small and medium-sized enterprises using data sourced from the Ministry of Employment and Labor and Youth Worknet. By comparing the efficacy of five machine learning classification models (i.e., Logistic Regression, Random Forest Classifier, Gradient Boosting Classifier, LGBM Classifier, and XGB Classifier), we discovered that the predictive model utilizing the LGBM Classifier yielded the most consistent performance. Findings Our analysis of the relative significance of preference determinants revealed that industry type, geographical location, and employee count are pivotal factors influencing preference. Drawing from these insights, we propose targeted strategic interventions for policymakers, hidden champions, and young job seekers.