• Title/Summary/Keyword: input variables

Search Result 1,791, Processing Time 0.023 seconds

An Empirical Comparison and Verification Study on the Seaport Clustering Measurement Using Meta-Frontier DEA and Integer Programming Models (메타프론티어 DEA모형과 정수계획모형을 이용한 항만클러스터링 측정에 대한 실증적 비교 및 검증연구)

  • Park, Ro-Kyung
    • Journal of Korea Port Economic Association
    • /
    • v.33 no.2
    • /
    • pp.53-82
    • /
    • 2017
  • The purpose of this study is to show the clustering trend and compare empirical results, as well as to choose the clustering ports for 3 Korean ports (Busan, Incheon, and Gwangyang) by using meta-frontier DEA (Data Envelopment Analysis) and integer models on 38 Asian container ports over the period 2005-2014. The models consider 4 input variables (birth length, depth, total area, and number of cranes) and 1 output variable (container TEU). The main empirical results of the study are as follows. First, the meta-frontier DEA for Chinese seaports identifies as most efficient ports (in decreasing order) Shanghai, Hongkong, Ningbo, Qingdao, and Guangzhou, while efficient Korean seaports are Busan, Incheon, and Gwangyang. Second, the clustering results of the integer model show that the Busan port should cluster with Dubai, Hongkong, Shanghai, Guangzhou, Ningbo, Qingdao, Singapore, and Kaosiung, while Incheon and Gwangyang should cluster with Shahid Rajaee, Haifa, Khor Fakkan, Tanjung Perak, Osaka, Keelong, and Bangkok ports. Third, clustering through the integer model sharply increases the group efficiency of Incheon (401.84%) and Gwangyang (354.25%), but not that of the Busan port. Fourth, the efficiency ranking comparison between the two models before and after the clustering using the Wilcoxon signed-rank test is matched with the average level of group efficiency (57.88 %) and the technology gap ratio (80.93%). The policy implication of this study is that Korean port policy planners should employ meta-frontier DEA, as well as integer models when clustering is needed among Asian container ports for enhancing the efficiency. In addition Korean seaport managers and port authorities should introduce port development and management plans accounting for the reference and clustered seaports after careful analysis.

Hydrogeochemical Characterization of Groundwater in Jeju Island using Principal Component Analysis and Geostatistics (주성분분석과 지구통계법을 이용한 제주도 지하수의 수리지화학 특성 연구)

  • Ko Kyung-Seok;Kim Yongie;Koh Dong-Chan;Lee Kwang-Sik;Lee Seung-Gu;Kang Cheol-Hee;Seong Hyun-Jeong;Park Won-Bae
    • Economic and Environmental Geology
    • /
    • v.38 no.4 s.173
    • /
    • pp.435-450
    • /
    • 2005
  • The purpose of the study is to analyze the hydrogeochemical characteristics by multivariate statistical method, to interpret the hydrogeochemical processes for the new variables calculated from principal components analysis (PCA), and to infer the groundwater flow and circulation mechanism by applying the geostatistical methods for each element and principal component. Chloride and nitrate are the most influencing components for groundwater quality, and the contents of $NO_3$ increased by the input of agricultural activities show the largest variation. The results of PCA, a multivariate statistical method, show that the first three principal components explain $73.9\%$ of the total variance. PC1 indicates the increase of dissolved ions, PC2 is related with the dissolution of carbonate minerals and nitrate contamination, and PC3 shows the effect of cation exchange process and silicate mineral dissolution. From the results of experimental semivariogram, the components of groundwater are divided into two groups: one group includes electrical conductivity (EC), Cl, Na, and $NO_3$, and the other includes $HCO_3,\;SiO_2,$ Ca, and Sr. The results for spatial distribution of groundwater components showed that EC, Cl, and Na increased with approaching the coastal line and nitrate has close relationship with the presence of agricultural land. These components are also correlated with the topographic features reflecting the groundwater recharge effect. The kriging analysis by using principal components shows that PC 1 has the different spatial distribution of Cl, Na, and EC, possibly due to the influence of pH, Ca, Sr, and $HCO_3$ for PC1. It was considered that the linear anomaly zone of PC2 in western area was caused by the dissolution of carbonate mineral. Consequently, the application of multivariate and geostatistical methods for groundwater in the study area is very useful for determining the quantitative analysis of water quality data and the characteristics of spatial distribution.

Bankruptcy Prediction Modeling Using Qualitative Information Based on Big Data Analytics (빅데이터 기반의 정성 정보를 활용한 부도 예측 모형 구축)

  • Jo, Nam-ok;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.2
    • /
    • pp.33-56
    • /
    • 2016
  • Many researchers have focused on developing bankruptcy prediction models using modeling techniques, such as statistical methods including multiple discriminant analysis (MDA) and logit analysis or artificial intelligence techniques containing artificial neural networks (ANN), decision trees, and support vector machines (SVM), to secure enhanced performance. Most of the bankruptcy prediction models in academic studies have used financial ratios as main input variables. The bankruptcy of firms is associated with firm's financial states and the external economic situation. However, the inclusion of qualitative information, such as the economic atmosphere, has not been actively discussed despite the fact that exploiting only financial ratios has some drawbacks. Accounting information, such as financial ratios, is based on past data, and it is usually determined one year before bankruptcy. Thus, a time lag exists between the point of closing financial statements and the point of credit evaluation. In addition, financial ratios do not contain environmental factors, such as external economic situations. Therefore, using only financial ratios may be insufficient in constructing a bankruptcy prediction model, because they essentially reflect past corporate internal accounting information while neglecting recent information. Thus, qualitative information must be added to the conventional bankruptcy prediction model to supplement accounting information. Due to the lack of an analytic mechanism for obtaining and processing qualitative information from various information sources, previous studies have only used qualitative information. However, recently, big data analytics, such as text mining techniques, have been drawing much attention in academia and industry, with an increasing amount of unstructured text data available on the web. A few previous studies have sought to adopt big data analytics in business prediction modeling. Nevertheless, the use of qualitative information on the web for business prediction modeling is still deemed to be in the primary stage, restricted to limited applications, such as stock prediction and movie revenue prediction applications. Thus, it is necessary to apply big data analytics techniques, such as text mining, to various business prediction problems, including credit risk evaluation. Analytic methods are required for processing qualitative information represented in unstructured text form due to the complexity of managing and processing unstructured text data. This study proposes a bankruptcy prediction model for Korean small- and medium-sized construction firms using both quantitative information, such as financial ratios, and qualitative information acquired from economic news articles. The performance of the proposed method depends on how well information types are transformed from qualitative into quantitative information that is suitable for incorporating into the bankruptcy prediction model. We employ big data analytics techniques, especially text mining, as a mechanism for processing qualitative information. The sentiment index is provided at the industry level by extracting from a large amount of text data to quantify the external economic atmosphere represented in the media. The proposed method involves keyword-based sentiment analysis using a domain-specific sentiment lexicon to extract sentiment from economic news articles. The generated sentiment lexicon is designed to represent sentiment for the construction business by considering the relationship between the occurring term and the actual situation with respect to the economic condition of the industry rather than the inherent semantics of the term. The experimental results proved that incorporating qualitative information based on big data analytics into the traditional bankruptcy prediction model based on accounting information is effective for enhancing the predictive performance. The sentiment variable extracted from economic news articles had an impact on corporate bankruptcy. In particular, a negative sentiment variable improved the accuracy of corporate bankruptcy prediction because the corporate bankruptcy of construction firms is sensitive to poor economic conditions. The bankruptcy prediction model using qualitative information based on big data analytics contributes to the field, in that it reflects not only relatively recent information but also environmental factors, such as external economic conditions.

An Empirical Comparative Study of the Seaport Clustering Measurement Using Bootstrapped DEA and Game Cross-efficiency Models (부트스트랩 DEA모형과 게임교차효율성모형을 이용한 항만클러스터링 측정에 대한 실증적 비교연구)

  • Park, Ro-Kyung
    • Journal of Korea Port Economic Association
    • /
    • v.32 no.1
    • /
    • pp.29-58
    • /
    • 2016
  • The purpose of this paper is to show the clustering trend and the comparison of empirical results and is to choose the clustering ports for 3 Korean ports(Busan, Incheon and Gwangyang Ports) by using the bootstrapped DEA(Data Envelopment Analysis) and game Cross-efficiency models for 38 Asian ports during the period 2003-2013 with 4 input variables(birth length, depth, total area, and number of cranes) and 1 output variable(container TEU). The main empirical results of this paper are as follows. First, bootstrapped DEA efficiency of SW and LT is 0.7660, 0.7341 respectively. Clustering results of the bootstrapped DEA analysis show that 3 Korean ports [ Busan (6.46%), Incheon (3.92%), and Gwangyang (2.78%)] can increase the efficiency in the SW model, but the LT model shows clustering values of -1.86%, -0.124%, and 2.11% for Busan, Gwangyang, and Incheon respectively. Second, the game cross-efficiency model suggests that Korean ports should be clustered with Hong Kong, Shanghi, Guangzhou, Ningbo, Port Klang, Singapore, Kaosiung, Keelong, and Bangkok ports. This clustering enhances the efficiency of Gwangyang by 0.131%, and decreases that of Busan by-1.08%, and that of Incheon by -0.009%. Third, the efficiency ranking comparison between the two models using the Wilcoxon Signed-rank Test was matched with the average level of SW (72.83 %) and LT (68.91%). The policy implication of this paper is that Korean port policy planners should introduce the bootstrapped DEA, and game cross-efficiency models when clustering is needed among Asian ports for enhancing the efficiency of inputs and outputs. Also, the results of SWOT(Strength, Weakness, Opportunity, and Threat) analysis among the clustering ports should be considered.

Development of a deep neural network model to estimate solar radiation using temperature and precipitation (온도와 강수를 이용하여 일별 일사량을 추정하기 위한 심층 신경망 모델 개발)

  • Kang, DaeGyoon;Hyun, Shinwoo;Kim, Kwang Soo
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.21 no.2
    • /
    • pp.85-96
    • /
    • 2019
  • Solar radiation is an important variable for estimation of energy balance and water cycle in natural and agricultural ecosystems. A deep neural network (DNN) model has been developed in order to estimate the daily global solar radiation. Temperature and precipitation, which would have wider availability from weather stations than other variables such as sunshine duration, were used as inputs to the DNN model. Five-fold cross-validation was applied to train and test the DNN models. Meteorological data at 15 weather stations were collected for a long term period, e.g., > 30 years in Korea. The DNN model obtained from the cross-validation had relatively small value of RMSE ($3.75MJ\;m^{-2}\;d^{-1}$) for estimates of the daily solar radiation at the weather station in Suwon. The DNN model explained about 68% of variation in observed solar radiation at the Suwon weather station. It was found that the measurements of solar radiation in 1985 and 1998 were considerably low for a small period of time compared with sunshine duration. This suggested that assessment of the quality for the observation data for solar radiation would be needed in further studies. When data for those years were excluded from the data analysis, the DNN model had slightly greater degree of agreement statistics. For example, the values of $R^2$ and RMSE were 0.72 and $3.55MJ\;m^{-2}\;d^{-1}$, respectively. Our results indicate that a DNN would be useful for the development a solar radiation estimation model using temperature and precipitation, which are usually available for downscaled scenario data for future climate conditions. Thus, such a DNN model would be useful for the impact assessment of climate change on crop production where solar radiation is used as a required input variable to a crop model.

International and domestic research trends in longitudinal connectivity evaluations of aquatic ecosystems, and the applicability analysis of fish-based models (수생태계 종적 연결성 평가를 위한 국내외 연구 현황 및 어류기반 종적 연속성 평가모델 적용성 분석)

  • Kim, Ji Yoon;Kim, Jai-Gu;Bae, Dae-Yeul;Kim, Hye-Jin;Kim, Jeong-Eun;Lee, Ho-Seong;Lim, Jun-Young;An, Kwang-Guk
    • Korean Journal of Environmental Biology
    • /
    • v.38 no.4
    • /
    • pp.634-649
    • /
    • 2020
  • Recently, stream longitudinal connectivity has been a topic of investigation due to the frequent disconnections and the impact of aquatic ecosystems caused by the construction of small and medium-sized weirs and various artificial structures (fishways) directly influencing the stream ecosystem health. In this study, the international and domestic research trends of the longitudinal connectivity in aquatic ecosystems were evaluated and the applicability of fish-based longitudinal connectivity models used in developed countries was analyzed. For these purposes, we analyzed the current status of research on longitudinal connectivity and structural problems, fish monitoring methodology, monitoring approaches, longitudinal disconnectivity of fish movement, and biodiversity. In addition, we analyzed the current status and some technical limitations of physical habitat suitability evaluation, ecology-based water flow, eco-hydrological modeling for fish habitat connectivity, and the s/w program development for agent-based model. Numerous references, data, and various reports were examined to identify worldwide longitudinal stream connectivity evaluation models in European and non-European countries. The international approaches to longitudinal connectivity evaluations were categorized into five phases including 1) an approach integrating fish community and artificial structure surveys (two types input variables), 2) field monitoring approaches, 3) a stream geomorphological approach, 4) an artificial structure-based DB analytical approach, and 5) other approaches. the overall evaluation of survey methodologies and applicability for longitudinal stream connectivity suggested that the ICE model (Information sur la Continuite Ecologique) and the ICF model (Index de Connectivitat Fluvial), widely used in European countries, were appropriate for the application of longitudinal connectivity evaluations in Korean streams.

Predicting the Pre-Harvest Sprouting Rate in Rice Using Machine Learning (기계학습을 이용한 벼 수발아율 예측)

  • Ban, Ho-Young;Jeong, Jae-Hyeok;Hwang, Woon-Ha;Lee, Hyeon-Seok;Yang, Seo-Yeong;Choi, Myong-Goo;Lee, Chung-Keun;Lee, Ji-U;Lee, Chae Young;Yun, Yeo-Tae;Han, Chae Min;Shin, Seo Ho;Lee, Seong-Tae
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.22 no.4
    • /
    • pp.239-249
    • /
    • 2020
  • Rice flour varieties have been developed to replace wheat, and consumption of rice flour has been encouraged. damage related to pre-harvest sprouting was occurring due to a weather disaster during the ripening period. Thus, it is necessary to develop pre-harvest sprouting rate prediction system to minimize damage for pre-harvest sprouting. Rice cultivation experiments from 20 17 to 20 19 were conducted with three rice flour varieties at six regions in Gangwon-do, Chungcheongbuk-do, and Gyeongsangbuk-do. Survey components were the heading date and pre-harvest sprouting at the harvest date. The weather data were collected daily mean temperature, relative humidity, and rainfall using Automated Synoptic Observing System (ASOS) with the same region name. Gradient Boosting Machine (GBM) which is a machine learning model, was used to predict the pre-harvest sprouting rate, and the training input variables were mean temperature, relative humidity, and total rainfall. Also, the experiment for the period from days after the heading date (DAH) to the subsequent period (DA2H) was conducted to establish the period related to pre-harvest sprouting. The data were divided into training-set and vali-set for calibration of period related to pre-harvest sprouting, and test-set for validation. The result for training-set and vali-set showed the highest score for a period of 22 DAH and 24 DA2H. The result for test-set tended to overpredict pre-harvest sprouting rate on a section smaller than 3.0 %. However, the result showed a high prediction performance (R2=0.76). Therefore, it is expected that the pre-harvest sprouting rate could be able to easily predict with weather components for a specific period using machine learning.

A Review on Ocean Acidification and Factors Affecting It in Korean Waters (우리나라 주변 바다의 산성화 현황과 영향 요인 분석)

  • Kim, Tae-Wook;Kim, Dongseon;Park, Geun-Ha;Ko, Young Ho;Mo, Ahra
    • Journal of the Korean earth science society
    • /
    • v.43 no.1
    • /
    • pp.91-109
    • /
    • 2022
  • The ocean is a significant sink for atmospheric anthropogenic CO2, absorbing one-third of the total CO2 emitted by human activities. In return, oceans have experienced significant declines in seawater pH and the aragonite saturation state also called ocean acidification. This study evaluates the distribution of aragonite saturation state, an indicator to assess the potential threat from ocean acidification, by combining newly obtained data from the west coast of South Korea with previous datasets covering the Yellow Sea, East Sea, northern South China Sea, and southeast coast of South Korea. In general, offshore waters absorb atmospheric CO2; however, most of the collected water samples show aragonite oversaturation. On the southeast coast, the aragonite saturation state was significantly affected by river discharge and associated variables, such as freshwater input with nutrients, seasonal stratification, biological carbon fixation, and bacterial remineralization. In summer, hypoxia and mixing with relatively acidic freshwater made the Jinhae and Gwangyang Bays undersaturated with respect to aragonite, possibly threatening marine organisms with CaCO3 shells. However, widespread aragonite undersaturation was not observed on the west coast, which receives considerable river water discharge. In addition, occasional upwelling events may have worsened the ocean acidification in the southwestern part of the East Sea. These results highlight the importance of investigating site-specific ocean acidification processes in coastal waters. Along with the above-mentioned seasonal factors, the dissolution of atmospheric CO2 and the deposition of atmospheric acidic substances will continue to reduce the aragonite saturation state in Korean waters. To protect marine ecosystems and resources, an ocean acidification monitoring program should be established for Korean waters.

Rainfall image DB construction for rainfall intensity estimation from CCTV videos: focusing on experimental data in a climatic environment chamber (CCTV 영상 기반 강우강도 산정을 위한 실환경 실험 자료 중심 적정 강우 이미지 DB 구축 방법론 개발)

  • Byun, Jongyun;Jun, Changhyun;Kim, Hyeon-Joon;Lee, Jae Joon;Park, Hunil;Lee, Jinwook
    • Journal of Korea Water Resources Association
    • /
    • v.56 no.6
    • /
    • pp.403-417
    • /
    • 2023
  • In this research, a methodology was developed for constructing an appropriate rainfall image database for estimating rainfall intensity based on CCTV video. The database was constructed in the Large-Scale Climate Environment Chamber of the Korea Conformity Laboratories, which can control variables with high irregularity and variability in real environments. 1,728 scenarios were designed under five different experimental conditions. 36 scenarios and a total of 97,200 frames were selected. Rain streaks were extracted using the k-nearest neighbor algorithm by calculating the difference between each image and the background. To prevent overfitting, data with pixel values greater than set threshold, compared to the average pixel value for each image, were selected. The area with maximum pixel variability was determined by shifting with every 10 pixels and set as a representative area (180×180) for the original image. After re-transforming to 120×120 size as an input data for convolutional neural networks model, image augmentation was progressed under unified shooting conditions. 92% of the data showed within the 10% absolute range of PBIAS. It is clear that the final results in this study have the potential to enhance the accuracy and efficacy of existing real-world CCTV systems with transfer learning.

Analysis of Uncertainty in Ocean Color Products by Water Vapor Vertical Profile (수증기 연직 분포에 의한 GOCI-II 해색 산출물 오차 분석)

  • Kyeong-Sang Lee;Sujung Bae;Eunkyung Lee;Jae-Hyun Ahn
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.6_2
    • /
    • pp.1591-1604
    • /
    • 2023
  • In ocean color remote sensing, atmospheric correction is a vital process for ensuring the accuracy and reliability of ocean color products. Furthermore, in recent years, the remote sensing community has intensified its requirements for understanding errors in satellite data. Accordingly, research is currently addressing errors in remote sensing reflectance (Rrs) resulting from inaccuracies in meteorological variables (total ozone, pressure, wind field, and total precipitable water) used as auxiliary data for atmospheric correction. However, there has been no investigation into the error in Rrs caused by the variability of the water vapor profile, despite it being a recognized error source. In this study, we used the Second Simulation of a Satellite Signal Vector version 2.1 simulation to compute errors in water vapor transmittance arising from variations in the water vapor profile within the GOCI-II observation area. Subsequently, we conducted an analysis of the associated errors in ocean color products. The observed water vapor profile not only exhibited a complex shape but also showed significant variations near the surface, leading to differences of up to 0.007 compared to the US standard 62 water vapor profile used in the GOCI-II atmospheric correction. The resulting variation in water vapor transmittance led to a difference in aerosol reflectance estimation, consequently introducing errors in Rrs across all GOCI-II bands. However, the error of Rrs in the 412-555 nm due to the difference in the water vapor profile band was found to be below 2%, which is lower than the required accuracy. Also, similar errors were shown in other ocean color products such as chlorophyll-a concentration, colored dissolved organic matter, and total suspended matter concentration. The results of this study indicate that the variability in water vapor profiles has minimal impact on the accuracy of atmospheric correction and ocean color products. Therefore, improving the accuracy of the input data related to the water vapor column concentration is even more critical for enhancing the accuracy of ocean color products in terms of water vapor absorption correction.