• 제목/요약/키워드: missing data estimation method

검색결과 88건 처리시간 0.024초

텍스쳐 감지를 이용한 화소값 기울기 필터 및 중간값 필터 기반의 비디오 시퀀스 디인터레이싱 (Intensity Gradient filter and Median Filter based Video Sequence Deinterlacing Using Texture Detection)

  • 강근화;구수일;정제창
    • 한국통신학회논문지
    • /
    • 제34권4C호
    • /
    • pp.371-379
    • /
    • 2009
  • 본 논문에서는 텍스쳐 감지를 이용한 화소값 기울기 필터 및 중간값 필터 기반의 비디오 시퀀스 디인터레이싱 알고리듬을 제안한다. 먼저 보간 할 픽셀의 주변 픽셀들을 이용하여 현재 보간 할 영역이 텍스쳐가 존재하는 영역인지 아니면 평탄한 영역인지를 판단한다. 제안하는 알고리듬에서는 보간 할 영역이 평탄한 영역으로 판단되면 중간값 필터를 이용하여 보간을 하고, 텍스쳐 영역으로 판단되면 화소값 기울기 필터를 이용하여 보간을 하게 된다. 그러므로 현재의 보간 할 영역은 두 개의 카테고리로 분류 할 수 있다. 제안하는 알고리듬은 상황에 맞게 적응적으로 보간을 수행하므로 좀 더 선명하고 정확한 영상을 얻을 수 있다. 그리고 여러 가지 CIF 동영상에 대한 실험 결과는 제안하는 알고리듬이 기존의 알고리듬 보다 객관적, 주관적으로 우수함을 보여준다.

데이터베이스 정규화 이론을 이용한 국민건강영양조사 중 다년도 식이조사 자료 정제 및 통합 (Data Cleaning and Integration of Multi-year Dietary Survey in the Korea National Health and Nutrition Examination Survey (KNHANES) using Database Normalization Theory)

  • 권남지;서지혜;이헌주
    • 한국환경보건학회지
    • /
    • 제43권4호
    • /
    • pp.298-306
    • /
    • 2017
  • Objectives: Since 1998, the Korea National Health and Nutrition Examination Survey (KNHANES) has been conducted in order to investigate the health and nutritional status of Koreans. The food intake data of individuals in the KNHANES has also been utilized as source dataset for risk assessment of chemicals via food. To improve the reliability of intake estimation and prevent missing data for less-responded foods, the structure of integrated long-standing datasets is significant. However, it is difficult to merge multi-year survey datasets due to ineffective cleaning processes for handling extensive numbers of codes for each food item along with changes in dietary habits over time. Therefore, this study aims at 1) cleaning the process of abnormal data 2) generation of integrated long-standing raw data, and 3) contributing to the production of consistent dietary exposure factors. Methods: Codebooks, the guideline book, and raw intake data from KNHANES V and VI were used for analysis. The violation of the primary key constraint and the $1^{st}-3rd$ normal form in relational database theory were tested for the codebook and the structure of the raw data, respectively. Afterwards, the cleaning process was executed for the raw data by using these integrated codes. Results: Duplication of key records and abnormality in table structures were observed. However, after adjusting according to the suggested method above, the codes were corrected and integrated codes were newly created. Finally, we were able to clean the raw data provided by respondents to the KNHANES survey. Conclusion: The results of this study will contribute to the integration of the multi-year datasets and help improve the data production system by clarifying, testing, and verifying the primary key, integrity of the code, and primitive data structure according to the database normalization theory in the national health data.

표준통계분류를 이용한 내수시장 규모 추정방법에 관한 연구 (A Study on an Estimation Method of Domestic Market Size by Using the Standard Statistical Classifications)

  • 유형선;서주환;전승표;서진이
    • 기술혁신학회지
    • /
    • 제18권3호
    • /
    • pp.387-415
    • /
    • 2015
  • 본 연구에서는 표준통계분류체계 간 연계를 통해 산업 혹은 제품의 내수 시장규모를 추정하는 방법을 제안하고 실질적 활용 가능성을 타진하였다. 이를 위해 KSIC 분류로 조사된 통계청의 광업 제조업 조사 결과와 HS 분류로 조사된 무역데이터를 통계청과 UN 통계처에서 제공하는 연계표를 활용하여 연계하였다. KSIC-ISIC-HS 간 통합연계표를 이용하여 국내시장규모를 분석하는데 있어 가장 큰 문제는 분류체계 간 중복 연결 문제인데, 본 연구에서는 각 품목별 출하액과 무역액 사이에 강한 상관관계가 있음을 활용하여 출하액의 상대적인 비중을 가중치로 중복 연결된 HS 무역액을 배분하는 방법을 제시하였다. 이를 이용하면 제조업 분야의 총 125개 모든 ISIC 품목별 국내시장규모를 분석하고 이를 바탕으로 미래의 단기 시장 규모를 예측할 수 있다. 본 연구에서 제시한 방법은 ISIC 분류보다 세분화 된 품목에 대한 분석의 한계, 제조업 이외의 분야에 대한 적용 한계, 출하액 결측치로 인한 오차 등의 한계가 있으나, 내수 시장규모 정보를 가장 객관적이고 신뢰성 있으며 지속적으로 활용 가능한 데이터를 이용하여 분석 제공할 수 있는 방법을 제시한 점에 본 연구의 의의가 있다.

자동차 와이퍼 피봇의 각속도 및 각가속도 측정 (Measuring Angular Speed and Angular Acceleration for Automotive Windshield Wiper Pivot)

  • 이병수
    • 한국자동차공학회논문집
    • /
    • 제13권4호
    • /
    • pp.58-65
    • /
    • 2005
  • A method measuring angular speed and estimating angular acceleration of an automotive wind shield wiper pivot with limited resources has been proposed. Limited resources refer to the fact that processes cannot be operated in real-time with a regular notebook running a Microsoft Windows. Also, they refer to the fact that data acquisition cards have only two general purpose counters as many generic cards do. An optical incremental encoder has been employed for measuring angular motion. To measure the angular speed of the pivot, periods for the encoder's output pulses have been measured as the speed is related to the reciprocal of the period. Since only information acquired from one counter channel is the magnitude of the angular speed, sign correction is necessary. Also the information for the exact time when a pivot passes left and right dead points is also missing and the situation is inherent to the hardware setup. To find out the zero-crossing time of the angular speed, a linear interpolation technique has been employed. Lastly, to overcome the imperfection of the mechanical encoders, the angular speed has been curve fitted to a spline. Angular acceleration can be obtained by a differentiation of the angular speed.

Numerical Model for Cerebrovascular Hemodynamics with Indocyanine Green Fluorescence Videoangiography

  • Hwayeong Cheon;Young-Je Son;Sung Bae Park;Pyoung-Seop Shim;Joo-Hiuk Son;Hee-Jin Yang
    • Journal of Korean Neurosurgical Society
    • /
    • 제66권4호
    • /
    • pp.382-392
    • /
    • 2023
  • Objective : The use of indocyanine green videoangiography (ICG-VA) to assess blood flow in the brain during cerebrovascular surgery has been increasing. Clinical studies on ICG-VA have predominantly focused on qualitative analysis. However, quantitative analysis numerical modelling for time profiling enables a more accurate evaluation of blood flow kinetics. In this study, we established a multiple exponential modified Gaussian (multi-EMG) model for quantitative ICG-VA to understand accurately the status of cerebral hemodynamics. Methods : We obtained clinical data of cerebral blood flow acquired the quantitative analysis ICG-VA during cerebrovascular surgery. Varied asymmetric peak functions were compared to find the most matching function form with clinical data by using a nonlinear regression algorithm. To verify the result of the nonlinear regression, the mode function was applied to various types of data. Results : The proposed multi-EMG model is well fitted to the clinical data. Because the primary parameters-growth and decay rates, and peak center and heights-of the model are characteristics of model function, they provide accurate reference values for assessing cerebral hemodynamics in various conditions. In addition, the primary parameters can be estimated on the curves with partially missed data. The accuracy of the model estimation was verified by a repeated curve fitting method using manipulation of missing data. Conclusion : The multi-EMG model can possibly serve as a universal model for cerebral hemodynamics in a comparison with other asymmetric peak functions. According to the results, the model can be helpful for clinical research assessment of cerebrovascular hemodynamics in a clinical setting.

영산호 운영을 위한 홍수예보모형의 개발(I) -나주지점의 홍수유출 추정- (River Flow Forecasting Model for the Youngsan Estuary Reservoir Operations(I) -Estimation Runof Hydrographs at Naju Station)

  • 박창언;박승우
    • 한국농공학회지
    • /
    • 제36권4호
    • /
    • pp.95-102
    • /
    • 1994
  • The series of the papers consist of three parts to describe the development, calibration, and applications of the flood forecasting models for the Youngsan Estuarine Dam located at the mouth of the Youngsan river. And this paper discusses the hydrologic model for inflow simulation at Naju station, which constitutes 64 percent of the drainage basin of 3521 .6km$^2$ in area. A simplified TANK model was formulated to simulate hourly runoff from rainfall And the model parameters were optirnized using historical storm data, and validated with the records. The results of this paper were summarized as follows. 1. The simplified TANK model was formulated to conceptualize the hourly rainfall-run-off relationships at a watershed with four tanks in series having five runoff outlets. The runoff from each outlet was assumed to be proportional to the storage exceeding a threshold value. And each tank was linked with a drainage hole from the upper one. 2. Fifteen storm events from four year records from 1984 to 1987 were selected for this study. They varied from 81 to 289rn'm The watershed averaged, hourly rainfall data were determined from those at fifteen raingaging stations using a Thiessen method. Some missing and unrealistic records at a few stations were estimated or replaced with the values determined using a reciprocal distance square method from abjacent ones. 3. An univariate scheme was adopted to calibrate the model parameters using historical records. Some of the calibrated parameters were statistically related to antecedent precipitation. And the model simulated the streamflow close to the observed, with the mean coefficient of determination of 0.94 for all storm events. 4. The simulated streamflow were in good agreement with the historical records for ungaged condition simulation runs. The mean coefficient of determination for the runs was 0.93, nearly the same as calibration runs. This may indicates that the model performs very well in flood forecasting situations for the watershed.

  • PDF

Modified parity space averaging approaches for online cross-calibration of redundant sensors in nuclear reactors

  • Kassim, Moath;Heo, Gyunyoung
    • Nuclear Engineering and Technology
    • /
    • 제50권4호
    • /
    • pp.589-598
    • /
    • 2018
  • To maintain safety and reliability of reactors, redundant sensors are usually used to measure critical variables and estimate their averaged time-dependency. Nonhealthy sensors can badly influence the estimation result of the process variable. Since online condition monitoring was introduced, the online cross-calibration method has been widely used to detect any anomaly of sensor readings among the redundant group. The cross-calibration method has four main averaging techniques: simple averaging, band averaging, weighted averaging, and parity space averaging (PSA). PSA is used to weigh redundant signals based on their error bounds and their band consistency. Using the consistency weighting factor (C), PSA assigns more weight to consistent signals that have shared bands, based on how many bands they share, and gives inconsistent signals of very low weight. In this article, three approaches are introduced for improving the PSA technique: the first is to add another consistency factor, so called trend consistency (TC), to include a consideration of the preserving of any characteristic edge that reflects the behavior of equipment/component measured by the process parameter; the second approach proposes replacing the error bound/accuracy based weighting factor ($W^a$) with a weighting factor based on the Euclidean distance ($W^d$), and the third approach proposes applying $W^d$, TC, and C, all together. Cold neutron source data sets of four redundant hydrogen pressure transmitters from a research reactor were used to perform the validation and verification. Results showed that the second and third modified approaches lead to reasonable improvement of the PSA technique. All approaches implemented in this study were similar in that they have the capability to (1) identify and isolate a drifted sensor that should undergo calibration, (2) identify a faulty sensor/s due to long and continuous missing data range, and (3) identify a healthy sensor.

도시하천 소배수구역의 결측 강우량 산정 방법 비교 (Comparison of Estimation Methods for the Missing Rainfall data in a Urban Sub-drainage Area)

  • 김충수;김형섭
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2006년도 학술발표회 논문집
    • /
    • pp.701-705
    • /
    • 2006
  • 강우자료는 수문 모델링 작업에서 가장 기초적인 수문학적 입력자료로 시간과 공간에 따른 변동성이 크므로 규명하기 복잡한 수문현상 중의 하나이다. 산악지역이 많은 우리나라의 지형학적 특성과 태풍, 장마 및 특히, 최근의 게릴라성 집중호우 등으로 인하여 이러한 변동성이 더욱 커지고 있는 실정이다. 장기간 실측된 수문기상 기초 자료가 부족한 우리나라의 실정상 홍수예보 및 수공구조물 설계를 위해 정확한 강우량 자료의 취득이 선행돼야 한다. 따라서 적절한 장소에 수문관측소 설치 및 관리를 통해 양호한 강우량 자료를 획득해야 하지만, 현장 여건상 등의 이유로 미계측 및 결측, 이상자료가 발생하고 있다. 따라서 이러한 미계측 혹은 결측지점의 우량을 추정할 수 있는 방법을 비교, 분석하여 적절한 보정과정을 수행할 필요가 있다. 그간의 연구에서는 미계측 지점 혹은 산악지역에서의 점 강우량 보정방법에 대한 연구가 진행되었지만, 본 연구에서는 '도시홍수재해관리기술연구사업단'에서 운영 중인 도시하천 유역 특히 소배수구역에서의 결측 자료에 대해 여러 추정 방법을 비교, 분석하여 적절한 방안을 찾고자 한다. 이를 위하여 중랑천 유역의 3개 소배수 구역(월계1 배수구역, 군자 배수구역, 어린이대공원 배수구역)에 설치된 3개 우량관측소와 건설교통부 관할 우량관측소 2개소의 우량자료를 사용하였다. 본 연구에서는 결측치 보간을 위하여 널리 이용되고 있는 산술평균법(Arithmetic Average method), 역거리법(Reciprocal Distance Squared method), 거리고도비율법(Ratio of Distance and Elevation method), 인근관측소와의 관계식 이용, 크리깅방법(Simple Kriging method)을 비교, 검토 적용하였다. 중랑천 유역의 소배수구역을 대상으로 연중 발생하는 큰 호우사상에 대해 임의의 강우관측소를 결측지점으로 가정하고 주변의 강우관측소로부터 각각의 방법을 이용해 가중치들을 산정하여 결측지점의 강우량 값을 보정하고자 하였다. 또한 각각의 방법을 이용하여 얻어진 결과에 대해 실측값과 보정값의 오차정도를 평균절대오차법(Mean Absolute Error)과 제곱평균제곱근오차법(Root Mean Squared Error)에 의해 산정하여 보정 방법간의 효율성을 검토하고자 하였다.

  • PDF

Terra MODIS NDVI 및 LST 자료와 RNN-LSTM을 활용한 토양수분 산정 (RNN-LSTM Based Soil Moisture Estimation Using Terra MODIS NDVI and LST)

  • 장원진;이용관;이지완;김성준
    • 한국농공학회논문집
    • /
    • 제61권6호
    • /
    • pp.123-132
    • /
    • 2019
  • This study is to estimate the spatial soil moisture using Terra MODIS (Moderate Resolution Imaging Spectroradiometer) satellite data and machine learning technique. Using the 3 years (2015~2017) data of MODIS 16 days composite NDVI (Normalized Difference Vegetation Index) and daily Land Surface Temperature (LST), ground measured precipitation and sunshine hour of KMA (Korea Meteorological Administration), the RDA (Rural Development Administration) 10 cm~30 cm average TDR (Time Domain Reflectometry) measured soil moisture at 78 locations was tested. For daily analysis, the missing values of MODIS LST by clouds were interpolated by conditional merging method using KMA surface temperature observation data, and the 16 days NDVI was linearly interpolated to 1 day interval. By applying the RNN-LSTM (Recurrent Neural Network-Long Short Term Memory) artificial neural network model, 70% of the total period was trained and the rest 30% period was verified. The results showed that the coefficient of determination ($R^2$), Root Mean Square Error (RMSE), and Nash-Sutcliffe Efficiency were 0.78, 2.76%, and 0.75 respectively. In average, the clay soil moisture was estimated well comparing with the other soil types of silt, loam, and sand. This is because the clay has the intrinsic physical property for having narrow range of soil moisture variation between field capacity and wilting point.

에너지분야 온실가스 인벤토리의 불확도에 관한 연구: Tier 1 에러전파방법을 이용한 추정 (An Analysis of Uncertainties in Energy Category: Estimation by using Tier 1 Method)

  • 황인창;진상현
    • 자원ㆍ환경경제연구
    • /
    • 제23권2호
    • /
    • pp.249-280
    • /
    • 2014
  • IPCC는 국가별 온실가스 배출량이 얼마나 확실한 값인가를 보여줄 수 있는 불확도를 함께 보고하도록 규정하고 있다. 그렇지만 한국 정부는 IPCC 기본값을 그대로 적용하고 있는 수준에 불과하며, 그나마도 결측된 값들이 있어서 전체적인 불확도를 산정하지 못한 채 항목별 불확도만을 나열하고 있을 뿐이다. 이에 본 논문에서는 국가 온실가스 배출량의 85.3%를 차지하는 에너지분야를 대상으로 Tier 1 수준의 에러전파방법을 이용해서 온실가스 인벤토리의 불확도를 추정하고 있다. 분석결과 국내 에너지분야 온실가스 배출량의 불확도는 3.4%였으며, 이는 핀란드와 유사한 수치인 것으로 밝혀졌다. 그렇지만 온실가스별로는 이산화탄소의 불확도가 2.7%에 불과했지만, 메탄은 116%, 아산화질소는 473%에 달할 정도로 차이가 큰 것으로 나타났다. 따라서 본 논문에서는 한국 정부가 에너지분야의 불확도를 낮추려면 이산화탄소 보다는 메탄과 아산화질소를 대상으로 활동도뿐만 아니라 배출계수의 개선이 필요하다는 정책적 함의가 제시될 수 있었다. 결론적으로는 IPCC 기본값 대신에 신뢰도 높은 한국 고유의 배출계수를 개발하는 작업이 필요함을 제안하고 있다.