• Title/Summary/Keyword: 결측자료 보완

Search Result 55, Processing Time 0.024 seconds

Daily Runoff Simulation and Analysis Using Rainfall-Runoff Model on Nakdong River (강우-유출모형에 의한 낙동강수계 일유출모의와 분석)

  • Maeng Sung Jin;Lee Soon Hyuk;Ryoo Kyoung Sik;Song Gi Heon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2005.05b
    • /
    • pp.619-622
    • /
    • 2005
  • 적용대상 유역은 낙동강수계로 하였으며 소유역 분할은 총 25개로 하였으며, 강우관측소의 선정과 Thiessen 계수의 산정은 최근에 한국수자원공사에서 새로 추가한 강우관측소를 위주로 대상 연도별로 달리하여 강우관측소를 선정하였다. 강우자료의 결측치는 RDS 방법을 사용하여 보완하였다. 대상연도별 소유역별로 일간 유역 평균 강우량을 산정하였다. 적용 모형의 선정은 한국수자원공사 실무부서에서의 적용사례가 빈번한 SSARR 모형을 최종적으로 선정하였다. SSARR 모형의 입력자료를 물리적 매개변수, 수문기상 매개변수 및 내부처리 매개변수로 구분하여 구축하였고 매개변수의 민감도분석과 함께 모형의 보정을 실시하였다. 민감도 분석 결과, 유역유출과 관련된 매개변수에서는 고수시와 저수시의 경우 지표수와 복류수의 분리하는 매개변수에서 민감도가 크게 나타났다. 저수시의 경우 지하수 중 회귀지하수가 차지하는 비율이 크게 나타났고, 지표수, 복류수, 지하수 및 회귀지하수의 저류시간에서 비교적 큰 민감도를 나타내었다. 1983년부터 2003년까지 21개년에 걸쳐 25개 소유역별로 일평균 자연유출량을 산정하여 이를 이용한 반순, 순, 월 및 연평균 자연유출량을 산정하였다.

  • PDF

지점우량 자료의 분포형 설정과 내용안전년수에 따르는 확률강우량에 관한 고찰 - 국내 3개지점 서울, 부산 및 대구를 중심으로 -

  • Lee, Won-Hwan;Lee, Gil-Chun;Jeong, Yeon-Gyu
    • Water for future
    • /
    • v.5 no.1
    • /
    • pp.27-36
    • /
    • 1972
  • This thesis is the study of the rainfall probability depth in the major areas of Korea, such as Seoul, Pusan and Taegu. The purpose of the paper is to analyze the rainfall in connection with the safe planning of the hydraulic structures and with the project life. The methodology used in this paper is the statistical treatment of the rainfall data in the above three areas. The scheme of the paper is the following. 1. The complementation of the rainfall data We tried to select the maximm values among the values gained by the three methods: Fourier Series Method, Trend Diagram Method and Mean Value Method. By the selection of the maximum values we tried to complement the rainfall data lacking in order to prevent calamities. 2. The statistical treatment of the data The data are ordered by the small numbers, transformed into log, $\sqrt{}, \sqrt[3]{}, \sqrt[4], and$\sqrt[5], and calculated their statistical values through the electronic computer. 3. The examination of the distribution types and the determination of the optimum distibution types By the $x^2-Test$ the distribution types of rainfall data are examined, and rejected some part of the data in order to seek the normal rainfall distribution types. In this way, the optimum distribution types are determined. 4. The computation of rainfall probability depth in the safety project life We tried to study the interrelation between the return period and the safety project life, and to present the rainfall probability depth of the safety project life. In conclusion we set up the optimum distribution types of the rainfall depths, formulated the optimum distributions, and presented the chart of the rainfall probability depth about the factor of safety and the project life.ct life.

  • PDF

A Model for Groundwater Time-series from the Well Field of Riverbank Filtration (강변여과 취수정 주변 지하수위를 위한 시계열 모형)

  • Lee, Sang-Il;Lee, Sang-Ki;Hamm, Se-Yeong
    • Journal of Korea Water Resources Association
    • /
    • v.42 no.8
    • /
    • pp.673-680
    • /
    • 2009
  • Alternatives to conventional water resources are being sought due to the scarcity and the poor quality of surface water. Riverbank filtration (RBF) is one of them and considered as a promising source of water supply in some cities. Changwon City has started RBF in 2001 and field data have been accumulated. This study is to develop a time-series model for groundwater level data collected from the pumping area of RBF. The site is Daesan-myeon, Changwon City, where groundwater level data have been measured for the last five years (Jan. 2003$\sim$Dec. 2007). Minute-based groundwater levels was averaged out to monthly data to see the long-term behavior. Time-series analysis was conducted according to the Box-Jenkins method. The resulted model turned out to be a seasonal ARIMA model, and its forecasting performance was satisfactory. We believe this study will provide a prototype for other riverbank filtration sites where the predictability of groundwater level is essential for the reliable supply of water.

Korean women wage analysis using selection models (표본 선택 모형을 이용한 국내 여성 임금 데이터 분석)

  • Jeong, Mi Ryang;Kim, Mijeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.1077-1085
    • /
    • 2017
  • In this study, we have found the major factors which affect Korean women's wage analysing the data provided by 2015 Korea Labor Panel Survey (KLIPS). In general, wage data is difficult to analyze because random sampling is infeasible. Heckman sample selection model is the most widely used method for analysing the data with sample selection. Heckman proposed two kinds of selection models: the one is the model with maximum likelihood method and the other is the Heckman two stage model. Heckman two stage model is known to be robust to the normal assumption of bivariate error terms. Recently, Marchenko and Genton (2012) proposed the Heckman selectiont model which generalizes the Heckman two stage model and concluded that Heckman selection-t model is more robust to the error assumptions. Employing the two models, we carried out the analysis of the data and we compared those results.

A point-scale gap filling of the flux-tower data using the artificial neural network (인공신경망 기법을 이용한 청미천 유역 Flux tower 결측치 보정)

  • Jeon, Hyunho;Baik, Jongjin;Lee, Seulchan;Choi, Minha
    • Journal of Korea Water Resources Association
    • /
    • v.53 no.11
    • /
    • pp.929-938
    • /
    • 2020
  • In this study, we estimated missing evapotranspiration (ET) data at a eddy-covariance flux tower in the Cheongmicheon farmland site using the Artificial Neural Network (ANN). The ANN showed excellent performance in numerical analysis and is expanding in various fields. To evaluate the performance the ANN-based gap-filling, ET was calculated using the existing gap-filling methods of Mean Diagnostic Variation (MDV) and Food and Aggregation Organization Penman-Monteith (FAO-PM). Then ET was evaluated by time series method and statistical analysis (coefficient of determination, index of agreement (IOA), root mean squared error (RMSE) and mean absolute error (MAE). For the validation of each gap-filling model, we used 30 minutes of data in 2015. Of the 121 missing values, the ANN method showed the best performance by supplementing 70, 53 and 84 missing values, respectively, in the order of MDV, FAO-PM, and ANN methods. Analysis of the coefficient of determination (MDV, FAO-PM, and ANN methods followed by 0.673, 0.784, and 0.841, respectively.) and the IOA (The MDV, FAO-PM, and ANN methods followed by 0.899, 0.890, and 0.951 respectively.) indicated that, all three methods were highly correlated and considered to be fully utilized, and among them, ANN models showed the highest performance and suitability. Based on this study, it could be used more appropriately in the study of gap-filling method of flux tower data using machine learning method.

Linear interpolation and Machine Learning Methods for Gas Leakage Prediction Base on Multi-source Data Integration (다중소스 데이터 융합 기반의 가스 누출 예측을 위한 선형 보간 및 머신러닝 기법)

  • Dashdondov, Khongorzul;Jo, Kyuri;Kim, Mi-Hye
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.3
    • /
    • pp.33-41
    • /
    • 2022
  • In this article, we proposed to predict natural gas (NG) leakage levels through feature selection based on a factor analysis (FA) of the integrating the Korean Meteorological Agency data and natural gas leakage data for considering complex factors. The paper has been divided into three modules. First, we filled missing data based on the linear interpolation method on the integrated data set, and selected essential features using FA with OrdinalEncoder (OE)-based normalization. The dataset is labeled by K-means clustering. The final module uses four algorithms, K-nearest neighbors (KNN), decision tree (DT), random forest (RF), Naive Bayes (NB), to predict gas leakage levels. The proposed method is evaluated by the accuracy, area under the ROC curve (AUC), and mean standard error (MSE). The test results indicate that the OrdinalEncoder-Factor analysis (OE-F)-based classification method has improved successfully. Moreover, OE-F-based KNN (OE-F-KNN) showed the best performance by giving 95.20% accuracy, an AUC of 96.13%, and an MSE of 0.031.

Imputation of missing precipitation data using machine learning algorithms (머신러닝 알고리즘을 이용한 결측 강우 데이터 추정에 관한 연구)

  • Heechan Han
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.320-320
    • /
    • 2023
  • 강우 데이터는 수문기상, 환경, 농업, 자연재해, 그리고 수자원 시스템 분야에서 가장 필수적인 기본 요소 중 하나이다. 또한 강우 데이터는 수문학적 분석에서 활용되는 필수 입력 자료 중 하나로 관측 데이터의 품질에 따라 수문 모형을 이용한 모의 결과물의 정확도가 결정된다고 할 수 있다. 따라서, 강우 관측소별로 강우 데이터의 품질을 어떻게 관리하느냐에 따라 수문 모형의 활용 범위 및 수자원 관리의 효율성이 결정될 수 있다. 강우의 시공간적 변동성은 수 많은 인자들과 직간접적으로 연계되어 있기 때문에 미계측 강우 자료에 대해 직접 관측이 아닌 수치 모형을 이용하여 강우의 발생과 강우량을 산정하는 것은 매우 복잡한 과제 중 하나이다. 현재 국내에서 운용되고 있는 강우 관측소의 경우에도 미계측 된 강우 데이터가 존재함으로써 강우 데이터의 활용에 제한이 생기는 경우가 있다. 따라서, 이러한 미계측 데이터의 추정 및 보완은 보다 효과적인 수재해 방지, 수자원 관리를 위한 필수 과제 중 하나이다. 일반적으로, 미계측 강우를 산정하기 위해서 Kriging, Thiessen, 등우선법, 그리고 역거리 관측법 등 다양한 수문학적 방법들이 적용되고 있다. 이러한 방법들은 산악효과나 강우 관측소의 분포 상태 등을 고려하지 못하기 때문에 측정하는 지역에 따라 강우 추정 오차가 커질 수 있다는 한계가 있다. 최근에는 데이터 관측 시스템과 빅데이터 기술의 발전과 활용 가능한 데이터의 양이 증가함에 따라 머신러닝을 활용한 사례가 증가하고 있다. 머신러닝은 데이터 사이의 관계를 기반으로 분류, 회귀, 그리고 예측 문제에 주로 사용되는 기법 중 하나이다. 따라서, 본 연구에서는 광주광역시 지역에 위치한 주요 강우 관측 지점들을 대상으로 미계측 된 시강우 데이터를 추정 및 복원하고자 한다. 여기서 데이터 추정 기술이란 미계측 강우의 발생 유무 및 강우량을 추정할 수 있는 기술을 의미한다. 이를 위해 대표적인 머신러닝 알고리즘인 인공신경망(Artificial Neural Network) 및 랜덤포레스트(Random Forest)를 적용하였다.

  • PDF

A Hydrological Study on Rainfall Frequency Atlas in Korea (한국 확률강우량도 작성을 위한 수문학적 연구)

  • 이원환
    • Water for future
    • /
    • v.10 no.1
    • /
    • pp.39-51
    • /
    • 1977
  • The purpose of this study is to make "The Rainfall Frequency Atlas in Korea" by the analytical method with new hydrological concepts. In this study, all of the rainfall datas in Korea was used for surveying of the basic data, and so we can get 103 sites for annual rainfall data and 100 sites for the max. in a day that are suitable to the purpose of the study. The above data groups are possible to estimate the normal standard period by the moving average method with $\pm$5% of significance level of variance ratio between the max. and min. moving average and arithmetic mean, but it may be impossible to study until 1990's for the short duration under 18-hr because the sites, having the short duration data, are only 12. The results of this study are as follows; 1. The normal standard period estimated by the moving average method is 20 year with $\pm$5% of significance level of variance ratio, and 30 year with$\pm$2-3%. 2. For the annual and max. rainfall in a day, it is possible to make the rainfall frequency atlas with 30normal standard period, but it may be impossible until 1990's for short duration. 3. "Y-k method" developed by writer is best suitable in the rainfal frequency analysis in Korea because of its convenience and reduction in the amount of calculation compared with other methods. 4. To improve the utilization of the rainfall frequency atlas, the larger-sized and the more detailed iso-precipitation atlas must be drawn.atlas must be drawn.

  • PDF

Study on the Retreatment Techniques for NOAA Sea Surface Temperature Imagery (NOAA 수온영상 재처리 기법에 관한 연구)

  • Kim, Sang-Woo;Kang, Yong-Q.;Ahn, Ji-Sook
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.17 no.4
    • /
    • pp.331-337
    • /
    • 2011
  • We described for the production of cloud-free satellite sea surface temperature(SST) data around Northeast Asian using NOAA AVHRR(Advanced Very High Resolution Radiometer) SST data during 1990-2005. As a result of Markov model, it was found that the value of Markov coefficient in the strong current region such as Kuroshio region showed smaller than that in the weak current. The variations of average SST and regional difference of seasonal day-to-day SST in spring and fall were larger than those in summer and winter. In particular, the distribution of the regional difference appeared large in the vicinity of continental in spring and fall. The difference of seasonal day-to-day SST was also small in Kuroshio region and southern part of East Sea due to the heat advection by warm currents.

Traffic Safety Countermeasures According to the Accident Area Patterns and Impact Factor Analysis of the Large-scale Traffic Accident Locations (대형 교통사고 발생지점 유형화와 영향요인 분석에 따른 교통안전대책 방안에 관한 연구)

  • Kim, Bong-Gi;Jeong, Heon-Yeong;Go, Sang-Seon
    • Journal of Korean Society of Transportation
    • /
    • v.24 no.1 s.87
    • /
    • pp.39-52
    • /
    • 2006
  • This study divided the large-scale traffic accident locations into its own characteristics by using Cluster Analysis. Also, Quantification II and Classification and Regression Tree methods were used enabling evaluation for the amount of affecting influence by the crash type. After these analyses, we tested the fitness of the results and suggested the simplification of the quantification index. With the results from the discussed procedure, obvious differences were observed by groups according to the characteristics of crash type from the Discrimination and Classification analysis of divided four groups. Thus, measures and supplementary measures for the traffic accidents could be suggested in groups systematically. However, a lot of missing values in variables caused a huge loss of data and made this study difficult for more detailed analysis, With this difficulty. recording mandatory log files with a standardized format is also recommended to Prevent this Problem in advance.