• 제목/요약/키워드: Cross-Validation Approach

검색결과 130건 처리시간 0.027초

Using CNN- VGG 16 to detect the tennis motion tracking by information entropy and unascertained measurement theory

  • Zhong, Yongfeng;Liang, Xiaojun
    • Advances in nano research
    • /
    • 제12권2호
    • /
    • pp.223-239
    • /
    • 2022
  • Object detection has always been to pursue objects with particular properties or representations and to predict details on objects including the positions, sizes and angle of rotation in the current picture. This was a very important subject of computer vision science. While vision-based object tracking strategies for the analysis of competitive videos have been developed, it is still difficult to accurately identify and position a speedy small ball. In this study, deep learning (DP) network was developed to face these obstacles in the study of tennis motion tracking from a complex perspective to understand the performance of athletes. This research has used CNN-VGG 16 to tracking the tennis ball from broadcasting videos while their images are distorted, thin and often invisible not only to identify the image of the ball from a single frame, but also to learn patterns from consecutive frames, then VGG 16 takes images with 640 to 360 sizes to locate the ball and obtain high accuracy in public videos. VGG 16 tests 99.6%, 96.63%, and 99.5%, respectively, of accuracy. In order to avoid overfitting, 9 additional videos and a subset of the previous dataset are partly labelled for the 10-fold cross-validation. The results show that CNN-VGG 16 outperforms the standard approach by a wide margin and provides excellent ball tracking performance.

Form-finding of lifting self-forming GFRP elastic gridshells based on machine learning interpretability methods

  • Soheila, Kookalani;Sandy, Nyunn;Sheng, Xiang
    • Structural Engineering and Mechanics
    • /
    • 제84권5호
    • /
    • pp.605-618
    • /
    • 2022
  • Glass fiber reinforced polymer (GFRP) elastic gridshells consist of long continuous GFRP tubes that form elastic deformations. In this paper, a method for the form-finding of gridshell structures is presented based on the interpretable machine learning (ML) approaches. A comparative study is conducted on several ML algorithms, including support vector regression (SVR), K-nearest neighbors (KNN), decision tree (DT), random forest (RF), AdaBoost, XGBoost, category boosting (CatBoost), and light gradient boosting machine (LightGBM). A numerical example is presented using a standard double-hump gridshell considering two characteristics of deformation as objective functions. The combination of the grid search approach and k-fold cross-validation (CV) is implemented for fine-tuning the parameters of ML models. The results of the comparative study indicate that the LightGBM model presents the highest prediction accuracy. Finally, interpretable ML approaches, including Shapely additive explanations (SHAP), partial dependence plot (PDP), and accumulated local effects (ALE), are applied to explain the predictions of the ML model since it is essential to understand the effect of various values of input parameters on objective functions. As a result of interpretability approaches, an optimum gridshell structure is obtained and new opportunities are verified for form-finding investigation of GFRP elastic gridshells during lifting construction.

Gaussian process regression model to predict factor of safety of slope stability

  • Arsalan, Mahmoodzadeh;Hamid Reza, Nejati;Nafiseh, Rezaie;Adil Hussein, Mohammed;Hawkar Hashim, Ibrahim;Mokhtar, Mohammadi;Shima, Rashidi
    • Geomechanics and Engineering
    • /
    • 제31권5호
    • /
    • pp.453-460
    • /
    • 2022
  • It is essential for geotechnical engineers to conduct studies and make predictions about the stability of slopes, since collapse of a slope may result in catastrophic events. The Gaussian process regression (GPR) approach was carried out for the purpose of predicting the factor of safety (FOS) of the slopes in the study that was presented here. The model makes use of a total of 327 slope cases from Iran, each of which has a unique combination of geometric and shear strength parameters that were analyzed by PLAXIS software in order to determine their FOS. The K-fold (K = 5) technique of cross-validation (CV) was used in order to conduct an analysis of the accuracy of the models' predictions. In conclusion, the GPR model showed excellent ability in the prediction of FOS of slope stability, with an R2 value of 0.8355, RMSE value of 0.1372, and MAPE value of 6.6389%, respectively. According to the results of the sensitivity analysis, the characteristics (friction angle) and (unit weight) are, in descending order, the most effective, the next most effective, and the least effective parameters for determining slope stability.

Enhancing prediction accuracy of concrete compressive strength using stacking ensemble machine learning

  • Yunpeng Zhao;Dimitrios Goulias;Setare Saremi
    • Computers and Concrete
    • /
    • 제32권3호
    • /
    • pp.233-246
    • /
    • 2023
  • Accurate prediction of concrete compressive strength can minimize the need for extensive, time-consuming, and costly mixture optimization testing and analysis. This study attempts to enhance the prediction accuracy of compressive strength using stacking ensemble machine learning (ML) with feature engineering techniques. Seven alternative ML models of increasing complexity were implemented and compared, including linear regression, SVM, decision tree, multiple layer perceptron, random forest, Xgboost and Adaboost. To further improve the prediction accuracy, a ML pipeline was proposed in which the feature engineering technique was implemented, and a two-layer stacked model was developed. The k-fold cross-validation approach was employed to optimize model parameters and train the stacked model. The stacked model showed superior performance in predicting concrete compressive strength with a correlation of determination (R2) of 0.985. Feature (i.e., variable) importance was determined to demonstrate how useful the synthetic features are in prediction and provide better interpretability of the data and the model. The methodology in this study promotes a more thorough assessment of alternative ML algorithms and rather than focusing on any single ML model type for concrete compressive strength prediction.

경동맥 혈관 MRI에서 라디오믹스를 이용한 동맥경화증 진단 모델 (Diagnosis Atherosclerosis Model Using Radiomics Approach in Carotid Vessel MRI)

  • 김종훈;박현진
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2022년도 추계학술대회
    • /
    • pp.289-290
    • /
    • 2022
  • 동맥경화증은 경동맥 혈관 벽이 두꺼워지는 질병으로 진단을 위해 혈관 벽의 두께를 모니터링하는 것이 중요하다. 본 연구에서는 경동맥 MRI 영상에서 324개의 라디오믹스 특징을 추출하고 머신러닝 기법을 이용하여 동맥경화증을 진단하는 모델을 제안한다. 라디오믹스 특징을 통해 로지스틱 회귀, 서포트 벡터 머신, 랜덤 포레스트, XGBoost의 총 4가지 분류 모델을 학습하였다. 5-fold 교차 검증에서 가장 높은 성능의 모델인 XGBoost는 정확도 0.9023, 민감도 0.9517, 특이도 0.8035, AUC 0.8776의 결과값을 보여준다.

  • PDF

Online analysis of iron ore slurry using PGNAA technology with artificial neural network

  • Haolong Huang;Pingkun Cai;Xuwen Liang;Wenbao Jia
    • Nuclear Engineering and Technology
    • /
    • 제56권7호
    • /
    • pp.2835-2841
    • /
    • 2024
  • Real-time analysis of metallic mineral grade and slurry concentration is significant for improving flotation efficiency and product quality. This study proposes an online detection method of ore slurry combining the Prompt Gamma Neutron Activation Analysis (PGNAA) technology and artificial neural network (ANN), which can provide mineral information rapidly and accurately. Firstly, a PGNAA analyzer based on a D-T neutron generator and a BGO detector was used to obtain a gamma-ray spectrum dataset of ore slurry samples, which was used to construct and optimize the ANN model for adaptive analysis. The evaluation metrics calculated by leave-one-out cross-validation indicated that, compared with the weighted library least squares (WLLS) approach, ANN obtained more precise and stable results, with mean absolute percentage errors of 4.66% and 2.80% for Fe grade and slurry concentration, respectively, and the highest average standard deviation of only 0.0119. Meanwhile, the analytical errors of the samples most affected by matrix effects was reduced to 0.61 times and 0.56 times of the WLLS method, respectively.

An Ensemble Approach to Detect Fake News Spreaders on Twitter

  • Sarwar, Muhammad Nabeel;UlAmin, Riaz;Jabeen, Sidra
    • International Journal of Computer Science & Network Security
    • /
    • 제22권5호
    • /
    • pp.294-302
    • /
    • 2022
  • Detection of fake news is a complex and a challenging task. Generation of fake news is very hard to stop, only steps to control its circulation may help in minimizing its impacts. Humans tend to believe in misleading false information. Researcher started with social media sites to categorize in terms of real or fake news. False information misleads any individual or an organization that may cause of big failure and any financial loss. Automatic system for detection of false information circulating on social media is an emerging area of research. It is gaining attention of both industry and academia since US presidential elections 2016. Fake news has negative and severe effects on individuals and organizations elongating its hostile effects on the society. Prediction of fake news in timely manner is important. This research focuses on detection of fake news spreaders. In this context, overall, 6 models are developed during this research, trained and tested with dataset of PAN 2020. Four approaches N-gram based; user statistics-based models are trained with different values of hyper parameters. Extensive grid search with cross validation is applied in each machine learning model. In N-gram based models, out of numerous machine learning models this research focused on better results yielding algorithms, assessed by deep reading of state-of-the-art related work in the field. For better accuracy, author aimed at developing models using Random Forest, Logistic Regression, SVM, and XGBoost. All four machine learning algorithms were trained with cross validated grid search hyper parameters. Advantages of this research over previous work is user statistics-based model and then ensemble learning model. Which were designed in a way to help classifying Twitter users as fake news spreader or not with highest reliability. User statistical model used 17 features, on the basis of which it categorized a Twitter user as malicious. New dataset based on predictions of machine learning models was constructed. And then Three techniques of simple mean, logistic regression and random forest in combination with ensemble model is applied. Logistic regression combined in ensemble model gave best training and testing results, achieving an accuracy of 72%.

Predicting restraining effects in CFS channels: A machine learning approach

  • Seyed Mohammad Mojtabaei;Rasoul Khandan;Iman Hajirasouliha
    • Steel and Composite Structures
    • /
    • 제51권4호
    • /
    • pp.441-456
    • /
    • 2024
  • This paper aims to develop Machine Learning (ML) algorithms to predict the buckling resistance of cold-formed steel (CFS) channels with restrained flanges, widely used in typical CFS sheathed wall panels, and provide practical design tools for engineers. The effects of cross-sectional restraints were first evaluated on the elastic buckling behaviour of CFS channels subjected to pure axial compressive load or bending moment. Feedforward multi-layer Artificial Neural Networks (ANNs) were then trained on different datasets comprising CFS channels with various dimensions and properties, plate thicknesses, and restraining conditions on one or two flanges, while the elastic distortional buckling resistance of the elements were determined according to the Finite Strip Method (FSM). To develop less biased networks and ensure that every observation from the original dataset has the chance of appearing in the training and test set, a K-fold cross-validation technique was implemented. In addition, the hyperparameters of the ANNs were tuned using a grid search technique to provide ANNs with optimum performances. The results demonstrated that the trained ANNs were able to predict the elastic distortional buckling resistance of CFS flange-restrained elements with an average accuracy of 99% in terms of coefficient of determination. The developed models were then used to propose a simple ANN-based design formula for the prediction of the elastic distortional buckling stress of CFS flange-restrained elements. Finally, the proposed formula was further evaluated on a separate set of unseen data to ensure its accuracy for practical applications.

다중 선형 회귀를 이용한 PNU/CME CGCM의 동아시아 여름철 강수예측 보정 연구 (A Correction of East Asian Summer Precipitation Simulated by PNU/CME CGCM Using Multiple Linear Regression)

  • 황윤정;안중배
    • 한국지구과학회지
    • /
    • 제28권2호
    • /
    • pp.214-226
    • /
    • 2007
  • 강수는 다양한 대기 변수들의 영향으로 나타나기 때문에 비선형성이 매우 강하다. 따라서 역학 모형을 통해 예측된 강수의 보정은 비선형 모형인 인공 신경망 등을 통해 가능할 것이지만, 인공 신경망의 경우 초기 가중치 선택, 지역 최소화 문제, 뉴런의 수 결정 등의 문제로 인한 한계가 있다. 그러므로 본 연구에서는 가장 보편적으로 사용되는 다중 선형 회귀 모형을 이용하여 CGCM에 의해 모사된 강수를 보정하였으며, 예측성을 살펴보았다. 이를 위하여 우선 PNU/CME 접합 대순환 모형(Coupled General Circulation model, CGCM)(박혜선과 안중배, 2004)을 이용하여 1979년부터 2005년까지 매해 4월부터 8월까지 5개월간 앙상블 적분을 하였다. 적분 결과 중 한반도를 포함한 동북아시아 지역$(110^{\circ}E-145^{\circ}E,\;25^{\circ}N-55^{\circ}N)$의 여름철인 6월(리드 2), 7월(리드 3), 8월(리드 4) 및 여름철 평균인 JJA(from June to August) 기간의 PNU/CME CGCM에 의해 모사된 강수를 보정하기 위해 다중 선형 회귀(Multiple Linear Regression, MLR)를 이용하였다. PNU/CME 접합 대순환 모형의 결과 중 강수, 500 hPa 연직 속도, 200 hPa 발산장, 지상 기온 등의 예측 인자와 관측 강수와의 선형적인 관계를 이용하여 MLR 모형을 구축하였다. 그리고 교차 검증(cross- validation)을 수행하여 PNU/CME 접합 대순환 모형의 결과와 교차 검증 결과를 비교하였다. 상관계수, 적중률 (hit rate), 오보율(false alarm rate) 그리고 Heidke 기술 점수(Heidke skill score) 등을 살펴본 바, 보정하지 않은 모형의 결과에 비해 MLR 모형을 이용하여 보정한 결과의 강수에 대한 예측성이 뛰어난 것을 알 수 있었다.

유출예측을 위한 진화적 기계학습 접근법의 구현: 알제리 세이보스 하천의 사례연구 (Implementation on the evolutionary machine learning approaches for streamflow forecasting: case study in the Seybous River, Algeria)

  • 자크로프 마샵;보첼키아 하미드;스탬바울 마대니;김성원;싱 비제이
    • 한국수자원학회논문집
    • /
    • 제53권6호
    • /
    • pp.395-408
    • /
    • 2020
  • 본 연구논문은 북부아프리카의 알제리에 위치한 하천유역에서 다중선행일 유출량의 예측을 위하여 진화적 최적화기법과 k-fold 교차검증을 결합한 세 개의 서로 다른 기계학습 접근법 (인공신경망, 적응 뉴로퍼지 시스템, 그리고 웨이블릿 기반 신경망)을 개발하고 적용하는 것이다. 인공신경망과 적응 뉴로퍼지 시스템은 root mean squared error (RMSE), Nash-Sutcliffe efficiency (NSE), correlation coefficient (R), 그리고 peak flow criteria (PFC) 의 네 개의 통계지표를 기반으로 하여 모형의 훈련 및 테스팅 결과 유사한 모형수행결과를 나타내었다. 웨이블릿 기반 신경망모형은 하루선행일 테스팅의 결과 RMSE = 8.590 ㎥/sec 과 PFC = 0.252로 분석되어서 인공신경망의 RMSE = 19.120 ㎥/sec, PFC = 0.446 과 적응 뉴로퍼지 시스템의 RMSE = 18.520 ㎥/sec, PFC = 0.444 보다 양호한 결과를 나타내었고, NSE와 R의 값도 웨이블릿 기반 신경망모형이 우수한 것으로 나타났다. 그러므로 웨이블릿 기반 신경망은 알제리 세이보스 하천에서 다중선행일의 예측을 위하여 효율적인 도구로 사용할 수 있다.