• Title/Summary/Keyword: Estimation techniques

Search Result 1,501, Processing Time 0.027 seconds

Estimation of CO2 Net Atmospheric Flux in the Middle and Lower Nakdong River, and Influence Factors Analysis (낙동강 중하류에서 이산화탄소 순배출 플럭스 산정 및 영향인자 분석)

  • Lee, Eunju;Chung, Sewoong;Park, Hyungseok;Kim, Sungjin;Park, Daeyeon
    • Journal of Korean Society on Water Environment
    • /
    • v.35 no.4
    • /
    • pp.316-331
    • /
    • 2019
  • Carbon dioxide($CO_2$) emission from rivers to the atmosphere is a key component in the global carbon cycle. Most of the rivers are supersaturated with $CO_2$. At a global scale, the amount of $CO_2$ emission from rivers is reported to be five-fold greater than that from lakes and reservoirs, but relevant data are rare in Korea. The objectives of this study is to estimate the $CO_2$ net atmospheric flux(NAF) from the upstream of Gangjeong-Goryeong Weir(GGW), Dalseong Weir(DSW), Hapcheon-Changnyeong Weir(HCW), and Changnyeong-Haman Weir(CHW) located in Nakdong River South Korea) using field and laboratory experiments and to apply data mining techniques to develop parsimonious prediction models that can be used to estimate $CO_2$ NAF with physical and water quality variables that can be collected easily. As a result, the study sites were all heterotrophic systems that often released $CO_2$ to the atmosphere, except when the algal photosynthesis was active.The median $CO_2$ NAF was minimum $391.5mg-CO_2/m^2$ day at GGW and maximum $1472.7mg-CO_2/m^2$ day at DSW. The $CO_2$ NAF showed a negative correlation with pH and Chl-a since the overgrowth of the algae consumed $CO_2$ in the water and increased the pH. As the parsimonious multiple regression model and random forest model developed, this study showed an excellent performance with the $Adj.R^2$ value higher than 0.77 in all weirs. Thus, these methods can be used to estimate $CO_2$ NAF in the river even if there is no $pCO_2$ measurement data.

A comparison of imputation methods using nonlinear models (비선형 모델을 이용한 결측 대체 방법 비교)

  • Kim, Hyein;Song, Juwon
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.4
    • /
    • pp.543-559
    • /
    • 2019
  • Data often include missing values due to various reasons. If the missing data mechanism is not MCAR, analysis based on fully observed cases may an estimation cause bias and decrease the precision of the estimate since partially observed cases are excluded. Especially when data include many variables, missing values cause more serious problems. Many imputation techniques are suggested to overcome this difficulty. However, imputation methods using parametric models may not fit well with real data which do not satisfy model assumptions. In this study, we review imputation methods using nonlinear models such as kernel, resampling, and spline methods which are robust on model assumptions. In addition, we suggest utilizing imputation classes to improve imputation accuracy or adding random errors to correctly estimate the variance of the estimates in nonlinear imputation models. Performances of imputation methods using nonlinear models are compared under various simulated data settings. Simulation results indicate that the performances of imputation methods are different as data settings change. However, imputation based on the kernel regression or the penalized spline performs better in most situations. Utilizing imputation classes or adding random errors improves the performance of imputation methods using nonlinear models.

Water resources potential assessment of ungauged catchments in Lake Tana Basin, Ethiopia

  • Damtew, Getachew Tegegne;Kim, Young-Oh
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2015.05a
    • /
    • pp.217-217
    • /
    • 2015
  • The objective of this study was mainly to evaluate the water resources potential of Lake Tana Basin (LTB) by using Soil and Water Assessment Tool (SWAT). From SWAT simulation of LTB, about 5236 km2 area of LTB is gauged watershed and the remaining 9878 km2 area is ungauged watershed. For calibration of model parameters, four gauged stations were considered namely: Gilgel Abay, Gummera, Rib, and Megech. The SWAT-CUP built-in techniques, particle swarm optimization (PSO) and generalized likelihood uncertainty estimation (GLUE) method was used for calibration of model parameters and PSO method were selected for the study based on its performance results in four gauging stations. However the level of sensitivity of flow parameters differ from catchment to catchment, the curve number (CN2) has been found the most sensitive parameters in all gauged catchments. To facilitate the transfer of data from gauged catchments to ungauged catchments, clustering of hydrologic response units (HRUs) were done based on physical similarity measured between gauged and ungauged catchment attributes. From SWAT land use/ soil use/slope reclassification of LTB, a total of 142 HRUs were identified and these HRUs are clustered in to 39 similar hydrologic groups. In order to transfer the optimized model parameters from gauged to ungauged catchments based on these clustered hydrologic groups, this study evaluates three parameter transfer schemes: parameters transfer based on homogeneous regions (PT-I), parameter transfer based on global averaging (PT-II), and parameter transfer by considering Gilgel Abay catchment as a representative catchment (PT-III) since its model performance values are better than the other three gauged catchments. The performance of these parameter transfer approach was evaluated based on values of Nash-Sutcliffe efficiency (NSE) and coefficient of determination (R2). The computed NSE values was found to be 0.71, 0.58, and 0.31 for PT-I, PT-II and PT-III respectively and the computed R2 values was found to be 0.93, 0.82, and 0.95 for PT-I, PT-II, and PT-III respectively. Based on the performance evaluation criteria, PT-I were selected for modelling ungauged catchments by transferring optimized model parameters from gauged catchment. From the model result, yearly average stream flow for all homogeneous regions was found 29.54 m3/s, 112.92 m3/s, and 130.10 m3/s for time period (1989 - 2005) for region-I, region-II, and region-III respectively.

  • PDF

A Study on the Variation of Water Quality and the Evaluation of Target Water Quality Using LDC in Major Tributaries of Nakdong River Basin (낙동강수계 주요 지류의 수질특성변화 및 LDC를 이용한 목표수질 평가에 관한 연구)

  • Lee, Sangsoo;Kang, Junmo;Park, Hyerim;Kang, Jeonghun;Kim, Shin;Kim, Jin-pil;Kim, Gyeonghoon
    • Journal of Korean Society on Water Environment
    • /
    • v.36 no.6
    • /
    • pp.521-534
    • /
    • 2020
  • In this study, the variation of water quality was analyzed for six sites in major tributaries of the Nakdong River Basin. Standard-FDC (Flow Duration Curve) was developed using PM (Percentile Method), one of the statistical FDC estimation methods. The LDC (Load Duration Curve) was obtained using the developed FDC. The current method and the LDC evaluation method were compared and analyzed to evaluate the achievement of TWQ (Target Water Quality). Regarding the monthly flow rate variation, the five sites showed the distribution of the lowest flow rate between May and June, indicating a high probability of dry weathering of the streams. The variation of water quality confirmed the vulnerable timing of flow rate in each site, and it is therefore deemed necessary to plan to reduce T-P and TOC. A comparison and evaluation of TWQ showed that there was a difference between the TWQ values achieved by the two techniques. In addition, the margin ratio to the 50% excess ratio can be found in the LDC evaluation. The results of the LDC evaluation by section and by month showed whether or not the water quality was exceeded by flow conditions, along with the vulnerable sections and timing. Accordingly, it is judged that this method can be used for water quality management in TMDLs (Total Maximum Daily Loads).

A Study on the Method of Computing Standard Wartime Maintenance Man-Hour Incorporating Wartime Maintenance Condition (전장 정비환경을 고려한 전시 표준정비인시 산출방안 연구)

  • Kim, Min-Hyuk
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.22 no.6
    • /
    • pp.477-483
    • /
    • 2021
  • In a military maintenance system, the standard maintenance man-hour of weapon systems is a tool to estimate the maintenance capabilities of maintenance units, provide standards for determining the maintenance needs and workload, and provide basic data for establishing a maintenance plan. The standard maintenance man-hours of major weapon systems have already been derived and used, but the standard maintenance man-hour in a wartime maintenance environment has not been computed. Therefore, the standard wartime maintenance man-hours need to be derived and This study proposes a process and method of computing the maintenance man-hours. In addition, this work suggests the criteria of collecting and screening data that is necessary for estimating the standard maintenance man-hours and introduces a methodology for analyzing the characteristics of maintenance man-hour distribution in the process. The proposed process first designs a model that reflects the wartime maintenance environment, selects statistical techniques, collects maintenance data, analyzes the descriptive statistics, estimates the distribution, and finally presents representative values of maintenance man-hour. Based on the proposed method, the standard wartime maintenance man-hours of the four weapon systems were calculated, and the distribution of the maintenance man-hours was analyzed to follow a lognormal distribution, and the method presented reliable results.

Estimation of regional flow duration curve applicable to ungauged areas using machine learning technique (머신러닝 기법을 이용한 미계측 유역에 적용 가능한 지역화 유황곡선 산정)

  • Jeung, Se Jin;Lee, Seung Pil;Kim, Byung Sik
    • Journal of Korea Water Resources Association
    • /
    • v.54 no.spc1
    • /
    • pp.1183-1193
    • /
    • 2021
  • Low flow affects various fields such as river water supply management and planning, and irrigation water. A sufficient period of flow data is required to calculate the Flow Duration Curve. However, in order to calculate the Flow Duration Curve, it is essential to secure flow data for more than 30 years. However, in the case of rivers below the national river unit, there is no long-term flow data or there are observed data missing for a certain period in the middle, so there is a limit to calculating the Flow Duration Curve for each river. In the past, statistical-based methods such as Multiple Regression Analysis and ARIMA models were used to predict sulfur in the unmeasured watershed, but recently, the demand for machine learning and deep learning models is increasing. Therefore, in this study, we present the DNN technique, which is a machine learning technique that fits the latest paradigm. The DNN technique is a method that compensates for the shortcomings of the ANN technique, such as difficult to find optimal parameter values in the learning process and slow learning time. Therefore, in this study, the Flow Duration Curve applicable to the unmeasured watershed is calculated using the DNN model. First, the factors affecting the Flow Duration Curve were collected and statistically significant variables were selected through multicollinearity analysis between the factors, and input data were built into the machine learning model. The effectiveness of machine learning techniques was reviewed through statistical verification.

Model Inversion Attack: Analysis under Gray-box Scenario on Deep Learning based Face Recognition System

  • Khosravy, Mahdi;Nakamura, Kazuaki;Hirose, Yuki;Nitta, Naoko;Babaguchi, Noboru
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.3
    • /
    • pp.1100-1118
    • /
    • 2021
  • In a wide range of ML applications, the training data contains privacy-sensitive information that should be kept secure. Training the ML systems by privacy-sensitive data makes the ML model inherent to the data. As the structure of the model has been fine-tuned by training data, the model can be abused for accessing the data by the estimation in a reverse process called model inversion attack (MIA). Although, MIA has been applied to shallow neural network models of recognizers in literature and its threat in privacy violation has been approved, in the case of a deep learning (DL) model, its efficiency was under question. It was due to the complexity of a DL model structure, big number of DL model parameters, the huge size of training data, big number of registered users to a DL model and thereof big number of class labels. This research work first analyses the possibility of MIA on a deep learning model of a recognition system, namely a face recognizer. Second, despite the conventional MIA under the white box scenario of having partial access to the users' non-sensitive information in addition to the model structure, the MIA is implemented on a deep face recognition system by just having the model structure and parameters but not any user information. In this aspect, it is under a semi-white box scenario or in other words a gray-box scenario. The experimental results in targeting five registered users of a CNN-based face recognition system approve the possibility of regeneration of users' face images even for a deep model by MIA under a gray box scenario. Although, for some images the evaluation recognition score is low and the generated images are not easily recognizable, but for some other images the score is high and facial features of the targeted identities are observable. The objective and subjective evaluations demonstrate that privacy cyber-attack by MIA on a deep recognition system not only is feasible but also is a serious threat with increasing alert state in the future as there is considerable potential for integration more advanced ML techniques to MIA.

Estimation of Employment Creation Center considering Spatial Autocorrelation: A Case of Changwon City (공간자기상관을 고려한 고용창출중심지 추정: 창원시 사례를 중심으로)

  • JEONG, Ha-Yeong;LEE, Tai-Hun;HWANG, In-Sik
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.25 no.1
    • /
    • pp.77-100
    • /
    • 2022
  • In the era of low growth, many provincial cities are experiencing population decline and aging. Population decline phenomena such as reduction of productive manpower, reduction of finances, deterioration of quality of life, and collapse of the community base are occurring in a chain and are being pushed to the brink of extinction of the cities. This study aims to propose a methodology to objectively estimate the employment creation centers and setting the basic unit of industrial-centered zoning by applying spatial statistical techniques and GIS for the application of the compact city plan as an efficient spatial management policy in a city with a declining population. In details, based on reviewing previous studies on compact city, 'employment complex index(ECI)' were defined considering the number of workers, the number of settlers, and the area of development land, the employment creation center was estimated by applying the 'Local Moran's I' and 'Getis-Ord's Hot-Spot Analysis'. As a case study, changes in the four years of 2013, 2015, 2017, and 2019 were compared and analyzed for Changwon City. As a result, it was confirmed that the employment creation center is becoming compacted and polycentric, which is a significant result that reflects the actual situation well. This results provide the basic data for functional and institutional territorial governance for the regional revitalization platform, and provide meaningful information necessary for spatial policy decision-making, such as population reduction, regional gross domestic product, and public facility arrangement that can respond to energy savings, transportation plans, and medical and health plans.

Characteristics of Measurement Errors due to Reflective Sheet Targets - Surveying for Sejong VLBI IVP Estimation (반사 타겟의 관측 오차 특성 분석 - 세종 VLBI IVP 결합 측량)

  • Hong, Chang-Ki;Bae, Tae-Suk
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.40 no.4
    • /
    • pp.325-332
    • /
    • 2022
  • Determination of VLBI IVP (Very Long Baseline Interferometry Invariant Point) position with high accuracy is required to compute local tie vectors between the space geodetic techniques. In general, reflective targets are attached on VLBI antenna and slant distances, horizontal and vertical angles are measured from the pillars. Then, adjustment computation is performed by using the mathematical model which connects measurements and unknown parameters. This indicates that the accuracy of the estimated solutions is affected by the accuracy of the measurements. One of issues in local tie surveying, however, is that the reflective targets are not in favorable condition, that is, the reflective sheet target cannot be perfectly aligned to the instrument perpendicularly. Deviation from the line of sight of an instrument may cause different type of measurement errors. This inherent limitation may lead to incorrect stochastic modeling for the measurements in adjustment computation procedures. In this study, error characteristics by measurement types and pillars are analyzed, respectively. The analysis on the studentized residuals is performed after adjustment computation. The normality of the residuals is tested and then equal variance test between the measurement types are performed. The results show that there are differences in variance according to the measurement types. Differences in variance between distances and angle measurements are observed when F-test is performed for the measurements from each pillar. Therefore, more detailed stochastic modeling is required for optimal solutions, especially in local tie survey.

Estimation of the Input Wave Height of the Wave Generator for Regular Waves by Using Artificial Neural Networks and Gaussian Process Regression (인공신경망과 가우시안 과정 회귀에 의한 규칙파의 조파기 입력파고 추정)

  • Jung-Eun, Oh;Sang-Ho, Oh
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.34 no.6
    • /
    • pp.315-324
    • /
    • 2022
  • The experimental data obtained in a wave flume were analyzed using machine learning techniques to establish a model that predicts the input wave height of the wavemaker based on the waves that have experienced wave shoaling and to verify the performance of the established model. For this purpose, artificial neural network (NN), the most representative machine learning technique, and Gaussian process regression (GPR), one of the non-parametric regression analysis methods, were applied respectively. Then, the predictive performance of the two models was compared. The analysis was performed independently for the case of using all the data at once and for the case by classifying the data with a criterion related to the occurrence of wave breaking. When the data were not classified, the error between the input wave height at the wavemaker and the measured value was relatively large for both the NN and GPR models. On the other hand, if the data were divided into non-breaking and breaking conditions, the accuracy of predicting the input wave height was greatly improved. Among the two models, the overall performance of the GPR model was better than that of the NN model.