• Title/Summary/Keyword: Random forests

Search Result 111, Processing Time 0.026 seconds

GeoAI-Based Forest Fire Susceptibility Assessment with Integration of Forest and Soil Digital Map Data

  • Kounghoon Nam;Jong-Tae Kim;Chang-Ju Lee;Gyo-Cheol Jeong
    • The Journal of Engineering Geology
    • /
    • v.34 no.1
    • /
    • pp.107-115
    • /
    • 2024
  • This study assesses forest fire susceptibility in Gangwon-do, South Korea, which hosts the largest forested area in the nation and constitutes ~21% of the country's forested land. With 81% of its terrain forested, Gangwon-do is particularly susceptible to wildfires, as evidenced by the fact that seven out of the ten most extensive wildfires in Korea have occurred in this region, with significant ecological and economic implications. Here, we analyze 480 historical wildfire occurrences in Gangwon-do between 2003 and 2019 using 17 predictor variables of wildfire occurrence. We utilized three machine learning algorithms—random forest, logistic regression, and support vector machine—to construct wildfire susceptibility prediction models and identify the best-performing model for Gangwon-do. Forest and soil map data were integrated as important indicators of wildfire susceptibility and enhanced the precision of the three models in identifying areas at high risk of wildfires. Of the three models examined, the random forest model showed the best predictive performance, with an area-under-the-curve value of 0.936. The findings of this study, especially the maps generated by the models, are expected to offer important guidance to local governments in formulating effective management and conservation strategies. These strategies aim to ensure the sustainable preservation of forest resources and to enhance the well-being of communities situated in areas adjacent to forests. Furthermore, the outcomes of this study are anticipated to contribute to the safeguarding of forest resources and biodiversity and to the development of comprehensive plans for forest resource protection, biodiversity conservation, and environmental management.

Predicting Photovoltaic Power Generation with Random Forests (랜덤 포레스트를 이용한 태양광 발전량 예측)

  • Lee, Woonghee;Kim, Younghoon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2016.10a
    • /
    • pp.397-400
    • /
    • 2016
  • 태양광 발전 방식은 기존 고갈 가능성이 있는 에내지를 대체하기 위해 많은 개발이 이루어져왔다. 태양광 발전 모듈의 인버터에는 발전량에 영향을 주는 다양한 속성들이 계측되어 저장된다. 본 연구에서는 이런 데이터에, 발전량에 영향을 주는 외부 요인인 기상 데이터를 추가하고, 랜덤 포레스트를 써서 과거 몇일까지의 데이터를 고려했을 때 가장 예측 성능이 높은지 실험을 통해 검증하였다. 2일 전부터 최대 365일 전까지의 데이터를 고려한 결과 5일 정도의 과거 데이터를 고려했을 때 예측 성능이 가장 높고, 고려하는 기간이 길어질수록 예측 성능이 떨어지는 경향을 보였다.

Predicting the Performance of Forecasting Strategies for Naval Spare Parts Demand: A Machine Learning Approach

  • Moon, Seongmin
    • Management Science and Financial Engineering
    • /
    • v.19 no.1
    • /
    • pp.1-10
    • /
    • 2013
  • Hierarchical forecasting strategy does not always outperform direct forecasting strategy. The performance generally depends on demand features. This research guides the use of the alternative forecasting strategies according to demand features. This paper developed and evaluated various classification models such as logistic regression (LR), artificial neural networks (ANN), decision trees (DT), boosted trees (BT), and random forests (RF) for predicting the relative performance of the alternative forecasting strategies for the South Korean navy's spare parts demand which has non-normal characteristics. ANN minimized classification errors and inventory costs, whereas LR minimized the Brier scores and the sum of forecasting errors.

Classification ofWarm Temperate Vegetations and GIS-based Forest Management System

  • Cho, Sung-Min
    • International journal of advanced smart convergence
    • /
    • v.10 no.1
    • /
    • pp.216-224
    • /
    • 2021
  • Aim of this research was to classify forest types at Wando in Jeonnam Province and develop warm temperate forest management system with application of Remote Sensing and GIS. Another emphasis was given to the analysis of satellite images to compare forest type changes over 10 year periods from 2009 to 2019. We have accomplished this study by using ArcGIS Pro and ENVI. For this research, Landsat satellite images were obtained by means of terrestrial, airborne and satellite imagery. Based on the field survey data, all land uses and forest types were divided into 5 forest classes; Evergreen broad-leaved forest, Evergreen Coniferous forest, Deciduous broad-leaved forest, Mixed fores, and others. Supervised classification was carried out with a random forest classifier based on manually collected training polygons in ROI. Accuracy assessment of the different forest types and land-cover classifications was calculated based on the reference polygons. Comparison of forest changes over 10 year periods resulted in different vegetation biomass volumes, producing the loss of deciduous forests in 2019 probably due to the expansion of residential areas and rapid deforestation.

Applications of Machine Learning for Online Learning Systems towards Children with Speech Disorders

  • Jadi, Amr;Alzahrani, Ali
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.8
    • /
    • pp.55-60
    • /
    • 2022
  • Specific Language Impairment is one of the serious disorders that interferes with spontaneous communication skills in children. Children suffering from this disorder may have reading, speaking, or listening impairments, and such type of disorders are also termed Autism Speech Disorder (ASD) in medical terminology. The aim of the article is to define specific language impairment in children and the problems it can cause. The different methods adopted by speech pathologists to diagnose language impairment. Finally implementing machine learning models to automate the process and help speech pathologists and pediatricians/ in diagnosing the specific language impairment.

Emerging Machine Learning in Wearable Healthcare Sensors

  • Gandha Satria Adi;Inkyu Park
    • Journal of Sensor Science and Technology
    • /
    • v.32 no.6
    • /
    • pp.378-385
    • /
    • 2023
  • Human biosignals provide essential information for diagnosing diseases such as dementia and Parkinson's disease. Owing to the shortcomings of current clinical assessments, noninvasive solutions are required. Machine learning (ML) on wearable sensor data is a promising method for the real-time monitoring and early detection of abnormalities. ML facilitates disease identification, severity measurement, and remote rehabilitation by providing continuous feedback. In the context of wearable sensor technology, ML involves training on observed data for tasks such as classification and regression with applications in clinical metrics. Although supervised ML presents challenges in clinical settings, unsupervised learning, which focuses on tasks such as cluster identification and anomaly detection, has emerged as a useful alternative. This review examines and discusses a variety of ML algorithms such as Support Vector Machines (SVM), Random Forests (RF), Decision Trees (DT), Neural Networks (NN), and Deep Learning for the analysis of complex clinical data.

Optimized machine learning algorithms for predicting the punching shear capacity of RC flat slabs

  • Huajun Yan;Nan Xie;Dandan Shen
    • Advances in concrete construction
    • /
    • v.17 no.1
    • /
    • pp.27-36
    • /
    • 2024
  • Reinforced concrete (RC) flat slabs should be designed based on punching shear strength. As part of this study, machine learning (ML) algorithms were developed to accurately predict the punching shear strength of RC flat slabs without shear reinforcement. It is based on Bayesian optimization (BO), combined with four standard algorithms (Support vector regression, Decision trees, Random forests, Extreme gradient boosting) on 446 datasets that contain six design parameters. Furthermore, an analysis of feature importance is carried out by Shapley additive explanation (SHAP), in order to quantify the effect of design parameters on punching shear strength. According to the results, the BO method produces high prediction accuracy by selecting the optimal hyperparameters for each model. With R2 = 0.985, MAE = 0.0155 MN, RMSE = 0.0244 MN, the BO-XGBoost model performed better than the original XGBoost prediction, which had R2 = 0.917, MAE = 0.064 MN, RMSE = 0.121 MN in total dataset. Additionally, recommendations are provided on how to select factors that will influence punching shear resistance of RC flat slabs without shear reinforcement.

Data Mining based Forest Fires Prediction Models using Meteorological Data (기상 데이터를 이용한 데이터 마이닝 기반의 산불 예측 모델)

  • Kim, Sam-Keun;Ahn, Jae-Geun
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.8
    • /
    • pp.521-529
    • /
    • 2020
  • Forest fires are one of the most important environmental risks that have adverse effects on many aspects of life, such as the economy, environment, and health. The early detection, quick prediction, and rapid response of forest fires can play an essential role in saving property and life from forest fire risks. For the rapid discovery of forest fires, there is a method using meteorological data obtained from local sensors installed in each area by the Meteorological Agency. Meteorological conditions (e.g., temperature, wind) influence forest fires. This study evaluated a Data Mining (DM) approach to predict the burned area of forest fires. Five DM models, e.g., Stochastic Gradient Descent (SGD), Support Vector Machines (SVM), Decision Tree (DT), Random Forests (RF), and Deep Neural Network (DNN), and four feature selection setups (using spatial, temporal, and weather attributes), were tested on recent real-world data collected from Gyeonggi-do area over the last five years. As a result of the experiment, a DNN model using only meteorological data showed the best performance. The proposed model was more effective in predicting the burned area of small forest fires, which are more frequent. This knowledge derived from the proposed prediction model is particularly useful for improving firefighting resource management.

Developing a regional fog prediction model using tree-based machine-learning techniques and automated visibility observations (시정계 자료와 기계학습 기법을 이용한 지역 안개예측 모형 개발)

  • Kim, Daeha
    • Journal of Korea Water Resources Association
    • /
    • v.54 no.12
    • /
    • pp.1255-1263
    • /
    • 2021
  • While it could become an alternative water resource, fog could undermine traffic safety and operational performance of infrastructures. To reduce such adverse impacts, it is necessary to have spatially continuous fog risk information. In this work, tree-based machine-learning models were developed in order to quantify fog risks with routine meteorological observations alone. The Extreme Gradient Boosting (XGB), Light Gradient Boosting (LGB), and Random Forests (RF) were chosen for the regional fog models using operational weather and visibility observations within the Jeollabuk-do province. Results showed that RF seemed to show the most robust performance to categorize between fog and non-fog situations during the training and evaluation period of 2017-2019. While the LGB performed better than in predicting fog occurrences than the others, its false alarm ratio was the highest (0.695) among the three models. The predictability of the three models considerably declined when applying them for an independent period of 2020, potentially due to the distinctively enhanced air quality in the year under the global lockdown. Nonetheless, even in 2020, the three models were all able to produce fog risk information consistent with the spatial variation of observed fog occurrences. This work suggests that the tree-based machine learning models could be used as tools to find locations with relatively high fog risks.

Analysis of facial expression recognition (표정 분류 연구)

  • Son, Nayeong;Cho, Hyunsun;Lee, Sohyun;Song, Jongwoo
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.5
    • /
    • pp.539-554
    • /
    • 2018
  • Effective interaction between user and device is considered an important ability of IoT devices. For some applications, it is necessary to recognize human facial expressions in real time and make accurate judgments in order to respond to situations correctly. Therefore, many researches on facial image analysis have been preceded in order to construct a more accurate and faster recognition system. In this study, we constructed an automatic recognition system for facial expressions through two steps - a facial recognition step and a classification step. We compared various models with different sets of data with pixel information, landmark coordinates, Euclidean distances among landmark points, and arctangent angles. We found a fast and efficient prediction model with only 30 principal components of face landmark information. We applied several prediction models, that included linear discriminant analysis (LDA), random forests, support vector machine (SVM), and bagging; consequently, an SVM model gives the best result. The LDA model gives the second best prediction accuracy but it can fit and predict data faster than SVM and other methods. Finally, we compared our method to Microsoft Azure Emotion API and Convolution Neural Network (CNN). Our method gives a very competitive result.