Search | Korea Science

Prediction of Global Industrial Water Demand using Machine Learning

Panda, Manas Ranjan;Kim, Yeonjoo
- Proceedings of the Korea Water Resources Association Conference
- /
- 2022.05a
- /
- pp.156-156
- /
- 2022
Explicitly spatially distributed and reliable data on industrial water demand is very much important for both policy makers and researchers in order to carry a region-specific analysis of water resources management. However, such type of data remains scarce particularly in underdeveloped and developing countries. Current research is limited in using different spatially available socio-economic, climate data and geographical data from different sources in accordance to predict industrial water demand at finer resolution. This study proposes a random forest regression (RFR) model to predict the industrial water demand at 0.50× 0.50 spatial resolution by combining various features extracted from multiple data sources. The dataset used here include National Polar-orbiting Partnership (NPP)/Visible Infrared Imaging Radiometer Suite (VIIRS) night-time light (NTL), Global Power Plant database, AQUASTAT country-wise industrial water use data, Elevation data, Gross Domestic Product (GDP), Road density, Crop land, Population, Precipitation, Temperature, and Aridity. Compared with traditional regression algorithms, RF shows the advantages of high prediction accuracy, not requiring assumptions of a prior probability distribution, and the capacity to analyses variable importance. The final RF model was fitted using the parameter settings of ntree = 300 and mtry = 2. As a result, determinate coefficients value of 0.547 is achieved. The variable importance of the independent variables e.g. night light data, elevation data, GDP and population data used in the training purpose of RF model plays the major role in predicting the industrial water demand.
PDF

Research on Covert Communication Technology Based on Matrix Decomposition of Digital Currency Transaction Amount

Lejun Zhang;Bo Zhang;Ran Guo;Zhujun Wang;Guopeng Wang;Jing Qiu;Shen Su;Yuan Liu;Guangxia Xu;Zhihong Tian;Sergey Gataullin
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.18 no.4
- /
- pp.1020-1041
- /
- 2024
With the development of covert communication technologies, the number of covert communication technologies using blockchain as a carrier is increasing. However, using the transaction amount of digital currency as a carrier for covert communication has problems such as low embedding rate, large consumption of transaction amount, and easy detection. In this paper, firstly, by experimentally analyzing the distribution of bitcoin transaction amounts, we determine the most suitable range of amounts for matrix decomposition. Secondly, we design a novel matrix decomposition method that can successfully decompose a large amount matrix into two small amount matrices and utilize the elements in the small amount matrices for covert communication. Finally, we analyze the feasibility of the novel matrix decomposition method in this scheme in detail from four aspects, and verify it by experimental comparison, which proves that our scheme not only improves the embedding rate and reduces the consumption of transaction amount, but also has a certain degree of resistance to detection.
https://doi.org/10.3837/tiis.2024.04.011 인용 PDF HTML

Predicting concrete's compressive strength through three hybrid swarm intelligent methods

Zhang Chengquan;Hamidreza Aghajanirefah;Kseniya I. Zykova;Hossein Moayedi;Binh Nguyen Le
- Computers and Concrete
- /
- v.32 no.2
- /
- pp.149-163
- /
- 2023
One of the main design parameters traditionally utilized in projects of geotechnical engineering is the uniaxial compressive strength. The present paper employed three artificial intelligence methods, i.e., the stochastic fractal search (SFS), the multi-verse optimization (MVO), and the vortex search algorithm (VSA), in order to determine the compressive strength of concrete (CSC). For the same reason, 1030 concrete specimens were subjected to compressive strength tests. According to the obtained laboratory results, the fly ash, cement, water, slag, coarse aggregates, fine aggregates, and SP were subjected to tests as the input parameters of the model in order to decide the optimum input configuration for the estimation of the compressive strength. The performance was evaluated by employing three criteria, i.e., the root mean square error (RMSE), mean absolute error (MAE), and the determination coefficient (R²). The evaluation of the error criteria and the determination coefficient obtained from the above three techniques indicates that the SFS-MLP technique outperformed the MVO-MLP and VSA-MLP methods. The developed artificial neural network models exhibit higher amounts of errors and lower correlation coefficients in comparison with other models. Nonetheless, the use of the stochastic fractal search algorithm has resulted in considerable enhancement in precision and accuracy of the evaluations conducted through the artificial neural network and has enhanced its performance. According to the results, the utilized SFS-MLP technique showed a better performance in the estimation of the compressive strength of concrete (R²=0.99932 and 0.99942, and RMSE=0.32611 and 0.24922). The novelty of our study is the use of a large dataset composed of 1030 entries and optimization of the learning scheme of the neural prediction model via a data distribution of a 20:80 testing-to-training ratio.
https://doi.org/10.12989/cac.2023.32.2.149 인용

Visit Push Motivation for a Trading Area and Flow, Satisfaction, and Revisit Intention (상권방문 추진동기와 몰입, 만족, 재방문 의도)

Lee, Soo-Duck;Lee, Yong-Ki
- Journal of Distribution Science
- /
- v.16 no.9
- /
- pp.65-77
- /
- 2018
Purpose - A trading area is very closely related to consumer life. A trading area is a cultural and social space that consumes culture and promotes human relationships as well as an economic space where consumers live their daily lives. In this context, a trading area research should be conducted objectively and empirically because it deals with the activities of consumer's life. The purpose of this study is to identify the intrinsic psychological motivation(push motivation) caused when consumers visit a trading area and to demonstrate how the push motivation for a trading area influence on consumer's flow, satisfaction, revisit intention. Research design, data, and methodology - In order to develop research hypotheses for this study, the development procedures for push motivation scale are as follows; (1) generating initial pool of items based on previous studies, (2) expert judgement to evaluate content and face validity, and (3) assessing convergent and discriminant validity using confirmatory factor analysis. In order to achieve these purposes, online surveys were conducted on frequent or familiar visitors to the trading areas around the Gangnam, Kunkuk University and Hongik University Station. Among the 1,343 questionnaires collected, 1,157 cases were analyzed by using SPSS 22.0 and SmartPLS 3.0 statistical package program, except for 186 responses in which responses were judged to be unfaithful. Results - The push motivation was classified into five sub-dimensions of excitement/stimulus, rest/relaxation, exit/refreshing, knowledge/learning and human relationship promotion as multidimensional and complex factors composed of individual and social-related dimensions. The excitement/stimulus and human relationship promotion of push motivation have positive effects on satisfaction. However, all dimensions of the push motivation have positive effects on flow. And flow has a positive effect on satisfaction and revisit intention. Meanwhile, the mediation test using boostrapping shows that flow plays a full mediating role in the relationship between rest/relaxation, exit/refreshing, knowledge/learning and satisfaction, but a partial mediating rol e between excitement/stimulus, human relationship promotion and satisfaction. Finally, satisfaction plays a partial mediating role between flow and revisit intention. Conclusions - This study shows that the push motivation is multidimensional and compositive depending on the situation of a consumer. In addition, it is found that the human relationship promotion(a social-related motivation) has a much more important effect on flow and satisfaction than other push motivations of individual dimensions. It also shows that satisfaction increases when consumers are being flowed at their visit and degree of revisit intention also grows as satisfaction increases. As implications of this study, a marketer should try to understand consumer's visit motivation at first and then develop factors that increase their flow, satisfaction, revisit intention. It also requires a marketer to approach subjects on a trading area more objectively and empirically based on the psychology and behavior of consumers, in order to establish a proper and efficient strategy on development of a trading area.
https://doi.org/10.15722/jds.16.9.201809.65 인용 PDF HTML

A Study on Efficient AI Model Drift Detection Methods for MLOps (MLOps를 위한 효율적인 AI 모델 드리프트 탐지방안 연구)

Ye-eun Lee;Tae-jin Lee
- Journal of Internet Computing and Services
- /
- v.24 no.5
- /
- pp.17-27
- /
- 2023
Today, as AI (Artificial Intelligence) technology develops and its practicality increases, it is widely used in various application fields in real life. At this time, the AI model is basically learned based on various statistical properties of the learning data and then distributed to the system, but unexpected changes in the data in a rapidly changing data situation cause a decrease in the model's performance. In particular, as it becomes important to find drift signals of deployed models in order to respond to new and unknown attacks that are constantly created in the security field, the need for lifecycle management of the entire model is gradually emerging. In general, it can be detected through performance changes in the model's accuracy and error rate (loss), but there are limitations in the usage environment in that an actual label for the model prediction result is required, and the detection of the point where the actual drift occurs is uncertain. there is. This is because the model's error rate is greatly influenced by various external environmental factors, model selection and parameter settings, and new input data, so it is necessary to precisely determine when actual drift in the data occurs based only on the corresponding value. There are limits to this. Therefore, this paper proposes a method to detect when actual drift occurs through an Anomaly analysis technique based on XAI (eXplainable Artificial Intelligence). As a result of testing a classification model that detects DGA (Domain Generation Algorithm), anomaly scores were extracted through the SHAP(Shapley Additive exPlanations) Value of the data after distribution, and as a result, it was confirmed that efficient drift point detection was possible.
https://doi.org/10.7472/jksii.2023.24.5.17 인용 PDF HTML

Retrieval of Hourly Aerosol Optical Depth Using Top-of-Atmosphere Reflectance from GOCI-II and Machine Learning over South Korea (GOCI-II 대기상한 반사도와 기계학습을 이용한 남한 지역 시간별 에어로졸 광학 두께 산출)

Seyoung Yang;Hyunyoung Choi;Jungho Im
- Korean Journal of Remote Sensing
- /
- v.39 no.5_3
- /
- pp.933-948
- /
- 2023
Atmospheric aerosols not only have adverse effects on human health but also exert direct and indirect impacts on the climate system. Consequently, it is imperative to comprehend the characteristics and spatiotemporal distribution of aerosols. Numerous research endeavors have been undertaken to monitor aerosols, predominantly through the retrieval of aerosol optical depth (AOD) via satellite-based observations. Nonetheless, this approach primarily relies on a look-up table-based inversion algorithm, characterized by computationally intensive operations and associated uncertainties. In this study, a novel high-resolution AOD direct retrieval algorithm, leveraging machine learning, was developed using top-of-atmosphere reflectance data derived from the Geostationary Ocean Color Imager-II (GOCI-II), in conjunction with their differences from the past 30-day minimum reflectance, and meteorological variables from numerical models. The Light Gradient Boosting Machine (LGBM) technique was harnessed, and the resultant estimates underwent rigorous validation encompassing random, temporal, and spatial N-fold cross-validation (CV) using ground-based observation data from Aerosol Robotic Network (AERONET) AOD. The three CV results consistently demonstrated robust performance, yielding R²=0.70-0.80, RMSE=0.08-0.09, and within the expected error (EE) of 75.2-85.1%. The Shapley Additive exPlanations(SHAP) analysis confirmed the substantial influence of reflectance-related variables on AOD estimation. A comprehensive examination of the spatiotemporal distribution of AOD in Seoul and Ulsan revealed that the developed LGBM model yielded results that are in close concordance with AERONET AOD over time, thereby confirming its suitability for AOD retrieval at high spatiotemporal resolution (i.e., hourly, 250 m). Furthermore, upon comparing data coverage, it was ascertained that the LGBM model enhanced data retrieval frequency by approximately 8.8% in comparison to the GOCI-II L2 AOD products, ameliorating issues associated with excessive masking over very illuminated surfaces that are often encountered in physics-based AOD retrieval processes.
https://doi.org/10.7780/kjrs.2023.39.5.3.5 인용 PDF HTML

Who Gets Government SME R&D Subsidy? Application of Gradient Boosting Model (Gradient Boosting 모형을 이용한 중소기업 R&D 지원금 결정요인 분석)

Kang, Sung Won;Kang, HeeChan
- The Journal of Society for e-Business Studies
- /
- v.25 no.4
- /
- pp.77-109
- /
- 2020
In this paper, we build a gradient Boosting model to predict government SME R&D subsidy, select features of high importance, and measure the impact of each features to the predicted subsidy using PDP and SHAP value. Unlike previous empirical researches, we focus on the effect of the R&D subsidy distribution pattern to the incentive of the firms participating subsidy competition. We used the firm data constructed by KISTEP linking government R&D subsidy record with financial statements provided by NICE, and applied a Gradient Boosting model to predict R&D subsidy. We found that firms with higher R&D performance and larger R&D investment tend to have higher R&D subsidies, but firms with higher operation profit or total asset turnover rate tend to have lower R&D subsidies. Our results suggest that current government R&D subsidy distribution pattern provides incentive to improve R&D project performance, but not business performance.
https://doi.org/10.7838/jsebs.2020.25.4.077 인용 PDF KSCI

Comparative Assessment of Linear Regression and Machine Learning for Analyzing the Spatial Distribution of Ground-level NO₂ Concentrations: A Case Study for Seoul, Korea (서울 지역 지상 NO₂ 농도 공간 분포 분석을 위한 회귀 모델 및 기계학습 기법 비교)

Kang, Eunjin;Yoo, Cheolhee;Shin, Yeji;Cho, Dongjin;Im, Jungho
- Korean Journal of Remote Sensing
- /
- v.37 no.6_1
- /
- pp.1739-1756
- /
- 2021
Atmospheric nitrogen dioxide (NO₂) is mainly caused by anthropogenic emissions. It contributes to the formation of secondary pollutants and ozone through chemical reactions, and adversely affects human health. Although ground stations to monitor NO₂ concentrations in real time are operated in Korea, they have a limitation that it is difficult to analyze the spatial distribution of NO₂ concentrations, especially over the areas with no stations. Therefore, this study conducted a comparative experiment of spatial interpolation of NO₂ concentrations based on two linear-regression methods(i.e., multi linear regression (MLR), and regression kriging (RK)), and two machine learning approaches (i.e., random forest (RF), and support vector regression (SVR)) for the year of 2020. Four approaches were compared using leave-one-out-cross validation (LOOCV). The daily LOOCV results showed that MLR, RK, and SVR produced the average daily index of agreement (IOA) of 0.57, which was higher than that of RF (0.50). The average daily normalized root mean square error of RK was 0.9483%, which was slightly lower than those of the other models. MLR, RK and SVR showed similar seasonal distribution patterns, and the dynamic range of the resultant NO₂ concentrations from these three models was similar while that from RF was relatively small. The multivariate linear regression approaches are expected to be a promising method for spatial interpolation of ground-level NO₂ concentrations and other parameters in urban areas.
https://doi.org/10.7780/kjrs.2021.37.6.1.21 인용 PDF KSCI HTML

Improved Estimation of Hourly Surface Ozone Concentrations using Stacking Ensemble-based Spatial Interpolation (스태킹 앙상블 모델을 이용한 시간별 지상 오존 공간내삽 정확도 향상)

KIM, Ye-Jin;KANG, Eun-Jin;CHO, Dong-Jin;LEE, Si-Woo;IM, Jung-Ho
- Journal of the Korean Association of Geographic Information Studies
- /
- v.25 no.3
- /
- pp.74-99
- /
- 2022
Surface ozone is produced by photochemical reactions of nitrogen oxides(NOx) and volatile organic compounds(VOCs) emitted from vehicles and industrial sites, adversely affecting vegetation and the human body. In South Korea, ozone is monitored in real-time at stations(i.e., point measurements), but it is difficult to monitor and analyze its continuous spatial distribution. In this study, surface ozone concentrations were interpolated to have a spatial resolution of 1.5km every hour using the stacking ensemble technique, followed by a 5-fold cross-validation. Base models for the stacking ensemble were cokriging, multi-linear regression(MLR), random forest(RF), and support vector regression(SVR), while MLR was used as the meta model, having all base model results as additional input variables. The results showed that the stacking ensemble model yielded the better performance than the individual base models, resulting in an averaged R of 0.76 and RMSE of 0.0065ppm during the study period of 2020. The surface ozone concentration distribution generated by the stacking ensemble model had a wider range with a spatial pattern similar with terrain and urbanization variables, compared to those by the base models. Not only should the proposed model be capable of producing the hourly spatial distribution of ozone, but it should also be highly applicable for calculating the daily maximum 8-hour ozone concentrations.
https://doi.org/10.11108/kagis.2022.25.3.074 인용 PDF KSCI

A Natural Scene Statistics Based Publication Classification Algorithm Using Support Vector Machine (서포트 벡터 머신을 이용한 자연 연상 통계 기반 저작물 식별 알고리즘)

Song, Hyewon;Kim, Doyoung;Lee, Sanghoon
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.42 no.5
- /
- pp.959-966
- /
- 2017
Currently, the market of digital contents such as e-books, cartoons and webtoons is growing up, but the copyrights infringement are serious issue due to their distribution through illegal ways. However, the technologies for copyright protection are not developed enough. Therefore, in this paper, we propose the NSS-based publication classification method for copyright protection. Using histogram calculated by NSS, we propose classification method for digital contents using SVM. The proposed algorithm will be useful for copyright protection because it lets us distinguish illegal distributed digital contents more easily.
https://doi.org/10.7840/kics.2017.42.5.959 인용 PDF KSCI

Search Result 104, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)