• Title/Summary/Keyword: 가중확률

Search Result 181, Processing Time 0.028 seconds

Weighting Effect on the Weighted Mean in Finite Population (유한모집단에서 가중평균에 포함된 가중치의 효과)

  • Kim, Kyu-Seong
    • Survey Research
    • /
    • v.7 no.2
    • /
    • pp.53-69
    • /
    • 2006
  • Weights can be made and imposed in both sample design stage and analysis stage in a sample survey. While in design stage weights are related with sample data acquisition quantities such as sample selection probability and response rate, in analysis stage weights are connected with external quantities, for instance population quantities and some auxiliary information. The final weight is the product of all weights in both stage. In the present paper, we focus on the weight in analysis stage and investigate the effect of such weights imposed on the weighted mean when estimating the population mean. We consider a finite population with a pair of fixed survey value and weight in each unit, and suppose equal selection probability designs. Under the condition we derive the formulas of the bias as well as mean square error of the weighted mean and show that the weighted mean is biased and the direction and amount of the bias can be explained by the correlation between survey variate and weight: if the correlation coefficient is positive, then the weighted mein over-estimates the population mean, on the other hand, if negative, then under-estimates. Also the magnitude of bias is getting larger when the correlation coefficient is getting greater. In addition to theoretical derivation about the weighted mean, we conduct a simulation study to show quantities of the bias and mean square errors numerically. In the simulation, nine weights having correlation coefficient with survey variate from -0.2 to 0.6 are generated and four sample sizes from 100 to 400 are considered and then biases and mean square errors are calculated in each case. As a result, in the case or 400 sample size and 0.55 correlation coefficient, the amount or squared bias of the weighted mean occupies up to 82% among mean square error, which says the weighted mean might be biased very seriously in some cases.

  • PDF

Derivation of Design Floods by the Probability Weighted Moments in the Wakeby Distribution (Wakeby 분포모형의 확률가중모멘트기법에 의한 설계홍수량 유도)

  • 이순혁;송기헌;맹승진;류경식;지호근
    • Magazine of the Korean Society of Agricultural Engineers
    • /
    • v.42 no.6
    • /
    • pp.63-71
    • /
    • 2000
  • The purpose of this study is to derive optimal design floods by the Wakeby distribution model using the probability weighted moments. Parameters for the Wakeby distribution were estimated by the probability weighted moments for the annual flood flows of the applied watersheds. Design floods obtained by the Wakeby and GEV distributions were compared by the relative mean errors, relative absolute errors and root mean square errors. In general, it has shown that the design floods by the Wakeby distribution using the methods of the probability weighted moments are closer to those of the observed data in comparison with those obtained by the GEV distribution.

  • PDF

Evaluation of Extreme Rainfall based on Typhoon using Nonparametric Monte Carlo Simulation and Locally Weighted Polynomial Regression (비매개변수적 모의발생기법과 지역가중다항식을 이용한 태풍의 극치강우량 평가)

  • Oh, Tae-Suk;Moon, Young-Il;Chun, Si-Young;Kwon, Hyun-Han
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.29 no.2B
    • /
    • pp.193-205
    • /
    • 2009
  • Typhoons occurred in the tropical Pacific region, these might be affected the Korea moving toward north. The strong winds and the heavy rains by the typhoons caused a natural disaster in Korea. In the research, the heavy rainfall events based on typhoons were evaluated quantitative through various statistical techniques. First, probability precipitation and typhoon probability precipitation were compared using frequency analysis. Second, EST probability precipitation was calculated by Empirical Simulation Techniques (EST). Third, NL probability precipitation was estimated by coupled Nonparametric monte carlo simulation and Locally weighted polynomial regression. At the analysis results, the typhoons can be effected Gangneung and Mokpo stations more than other stations. Conversely, the typhoons can be effected Seoul and Inchen stations less than other stations. Also, EST and NL probability precipitation were estimated by the long-term simulation using observed data. Consequently, major hydrologic structures and regions where received the big typhoons impact should be review necessary. Also, EST and NL techniques can be used for climate change by the global warming. Because, these techniques used the relationship between the heavy rainfall events and the typhoons characteristics.

A comparison study of inverse censoring probability weighting in censored regression (중도절단 회귀모형에서 역절단확률가중 방법 간의 비교연구)

  • Shin, Jungmin;Kim, Hyungwoo;Shin, Seung Jun
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.6
    • /
    • pp.957-968
    • /
    • 2021
  • Inverse censoring probability weighting (ICPW) is a popular technique in survival data analysis. In applications of the ICPW technique such as the censored regression, it is crucial to accurately estimate the censoring probability. A simulation study is undertaken in this article to see how censoring probability estimate influences model performance in censored regression using the ICPW scheme. We compare three censoring probability estimators, including Kaplan-Meier (KM) estimator, Cox proportional hazard model estimator, and local KM estimator. For the local KM estimator, we propose to reduce the predictor dimension to avoid the curse of dimensionality and consider two popular dimension reduction tools: principal component analysis and sliced inverse regression. Finally, we found that the Cox proportional hazard model estimator shows the best performance as a censoring probability estimator in both mean and median censored regressions.

Approximate Variance of Least Square Estimators for Regression Coefficient under Inclusion Probability Proportional to Size Sampling (포함확률비례추출에서 회귀계수 최소제곱추정량의 근사분산)

  • Kim, Kyu-Seong
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.1
    • /
    • pp.23-32
    • /
    • 2012
  • This paper deals with the bias and variance of regression coefficient estimators in a finite population. We derive approximate formulas for the bias, variance and mean square error of two estimators when we select a fixed-size inclusion probability proportional to the size sample and then estimate regression coefficients by the ordinary least square estimator as well as the weighted least square estimator based on the selected sample data. Necessary and sufficient conditions for the comparison of the two estimators in terms of variance and mean square error are suggested. In addition, a simple example is introduced to numerically compare the variance and mean square error of the two estimators.

A Study on the Mixed Model Approach and Symbol Probability Weighting Function for Maximization of Inter-Speaker Variation (화자간 변별력 최대화를 위한 혼합 모델 방식과 심볼 확률 가중함수에 관한 연구)

  • Chin Se-Hoon;Kang Chul-Ho
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.7
    • /
    • pp.410-415
    • /
    • 2005
  • Recently, most of the speaker verification systems are based on the pattern recognition approach method. And performance of the pattern-classifier depends on how to classify a variety of speakers' feature parameters. In order to classify feature parameters efficiently and effectively, it is of great importance to enlarge variations between speakers and effectively measure distances between feature parameters. Therefore, this paper would suggest the positively mixed model scheme that can enlarge inter-speaker variation by searching the individual model with world model at the same time. During decision procedure, we can maximize inter-speaker variation by using the proposed mixed model scheme. We also make use of a symbol probability weighting function in this system so as to reduce vector quantization errors by measuring symbol probability derived from the distance rate of between the world codebook and individual codebook. As the result of our experiment using this method, we could halve the Detection Cost Function (DCF) of the system from $2.37\%\;to\;1.16\%$.

Spatiotemporal analysis of the effect of rainfall data periods on probability rainfall (강우자료 기간에 따른 확률강우량의 시공간적 분석)

  • Lee, Moonyoung;An, Heejin;Lee, Jiwan;Kim, Kewtae;Jung, Younghun;Kim, Seongjoon;Um, Myoung-Jin;Park, Daeryong
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.339-339
    • /
    • 2022
  • 본 연구는 최신 강우 자료를 사용하여 자료의 기간을 네 가지 경우로 나누어 기간별 확률강우량을 산정하고 각 기간에 따른 확률강우량의 변화 특성을 파악하고자 하였다. 2020년을 기준으로 시강우 자료 관측기간이 40년 이상이 되는 62개 국내 강우관측소를 연구 대상으로 선정하였으며, 지점별 강우자료의 분석 기간은 최근 10년, 20년, 30년, 40년의 경우로 나누어 분석하였다. 분석기간에 따른 확률강우량은 Gumbel 분포형에 확률가중모멘트법을 적용하여 산정하였고, 이를 연강수량과 함께 공간적으로 분포시킨 결과, 연강수량의 분포에서 나타나지 않는 변화들이 확률강우량의 분포에서 명확히 드러나는 것을 확인할 수 있었다. 또한, 지속기간의 시간이 증가되고 재현기간이 커질수록 경기 북부와 전라남북도 경계 및 영동지방의 확률강우량이 증가하는 경향을 보였고, 최근 40년과 비교하였을 경우, 최근 10년, 20년, 30년 확률강우량의 변화량 결과에서 전라남도 지역은 지속기간 길어질수록 변화 양상이 뚜렷하게 보였으며, 강원도 지역은 최근 10년, 20년 변화량이 상이하게 나타났다. 기간에 따라 확률강우량의 변화량이 크거나 작은 대표 지역들을 선정하여 기간별 확률강우량의 IDF 곡선을 도시하여 비교 및 분석하였다.

  • PDF

Application of Jackknife Method for Determination of Representative Probability Distribution of Annual Maximum Rainfall (연최대강우량의 대표확률분포형 결정을 위한 Jackknife기법의 적용)

  • Lee, Jae-Joon;Lee, Sang-Won;Kwak, Chang-Jae
    • Journal of Korea Water Resources Association
    • /
    • v.42 no.10
    • /
    • pp.857-866
    • /
    • 2009
  • In this study, basic data is consisted annual maximum rainfall at 56 stations that has the rainfall records more than 30years in Korea. The 14 probability distributions which has been widely used in hydrologic frequency analysis are applied to the basic data. The method of moments, method of maximum likelihood and probability weighted moments method are used to estimate the parameters. And 4-tests (chi-square test, Kolmogorov-Smirnov test, Cramer von Mises test, probability plot correlation coefficient (PPCC) test) are used to determine the goodness of fit of probability distributions. This study emphasizes the necessity for considering the variability of the estimate of T-year event in hydrologic frequency analysis and proposes a framework for evaluating probability distribution models. The variability (or estimation error) of T-year event is used as a criterion for model evaluation as well as three goodness of fit criteria (SLSC, MLL, and AIC) in the framework. The Jackknife method plays a important role in estimating the variability. For the annual maxima of rainfall at 56 stations, the Gumble distribution is regarded as the best one among probability distribution models with two or three parameters.

A Study of Frequency Analysis by using Locally Weighted Polynomial Method (지역가중다항식을 이용한 빈도해석에 관한 연구)

  • Moon, Young-Il;Jeong, Min-Su;Choi, Byung-Gyu;You, Seung-Yeon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2006.05a
    • /
    • pp.804-808
    • /
    • 2006
  • 유량자료를 이용한 매개변수적 빈도해석 방법은 주관적인 분포형 선정문제를 안고 있다. 이러한 분포형 선택문제는 수문자료의 오랜 축척에 따른 통계적 분석을 통해 하나의 확률분포형을 선택할 수 있는 경우 극복될 수 있을 것이다. 그러나, 일반적으로 수문자료의 관측 기간이 짧아 하나의 분포형을 선택하는데 어려움을 갖고 있다. 반면에, 지역가중다항식을 이용한 빈도해석의 경우 단일분포형 선택문제가 아닌 자료로 부터 매개변수를 선택하고 추정함으로서 White noise를 제거 또는 감소하며 자연계의 이질적, 다중변수적 그리고 시공간적 특성을 잘 반영할 수 있는 것으로 알려져 있다. 따라서 본 연구에서는 단일 주관분포 선택문제가 아닌 자료로부터 매개변수의 선택 추정이 이루어지는 지역가중다항식을 이용한 빈도해석을 수행하였다. 분석에는 서울강우자료로 매개변수적 빈도해석을 수행하는 경우 Gumbel, GEV(Type I Extreme Value) 그리고 LN2 (Log-Normal 2) 등의 분포형을 적용하여 지역 가중다항 추정자의 산출 결과와 비교 검토하였다. 또한 각각의 방법을 적용해 이중첨두(bimodal) 분포형에 대한 모형의 적합성을 도시적으로 비교 산정하였다.

  • PDF

Frequency Analysis of Rainfall Data Using Advanced GEV Distribution (개선된 GEV 분포를 이용한 강우량 빈도분석)

  • Lee, Kil-Seong;Kang, Won-Gu;Park, Kyung-Shin;Sung, Jin-Young
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2009.05a
    • /
    • pp.1321-1326
    • /
    • 2009
  • 강우는 수자원 확보 측면에서 근원이 되는 요소이다. 그러므로 정확한 확률강우량 산정은 미래의 가용 수자원량을 예측하는데 있어 중요한 사항중 하나이며 무엇보다 신중한 결정이 요구된다. 또한 하천의 범람에 의한 침수를 예방하는 수공구조물 등의 설계에 있어서는 신뢰할 수 있는 확률강우량 산정이 선행되어야 한다. 본 연구에서는 최근 우리나라 극치강우확률분포로서 많은 연구가 이루어지고 있는 GEV 분포(GEV-O)를 기반으로 위치 매개변수에 시간의 함수를 고려한 개선된 GEV 분포(GEV-A)를 이용하여 서울지점에 적용함으로서 GEV-O 분포에 의한 확률강우량과 GEV-A 분포로 산정된 확률강우량을 비교 검토하였다. 먼저 임의의 난수 발생을 통해 최우도추정법과 확률가중모멘트법으로 매개변수를 추정한 GEV-O 분포와 최우도추정법으로 매개변수를 추정한 GEV-A 분포의 상대평균제곱근오차 (R-RMSE)를 계산하여 비교함으로서 GEV-A 분포의 효율성을 판단하였다. 사례연구는 1961년부터 2008년까지 서울강우관측소에서 측정된 연최대 1일 강우량으로 하였으며 $X^2$-검정, PPCC-검정으로 적합도 검정을 실시하였다. 강우빈도분석 결과 GEV-A 분포가 GEV-O 분포로 산정된 결과 보다 대체로 재현기간 200년 이상일 경우, 과다 산정되는 경향을 보였다. 추후 개선된 GEV 분포를 서울 인근 지점에 적용함으로서 지역빈도해석(Regional Frequency Analysis)을 실행하기 위한 연구가 진행되어야 할 것이다. 또한 확률홍수량 산정 등에도 개선된 GEV 분포를 이용함으로서 보다 정확하고 신뢰성 있는 확률수문량을 예측하여야 할 것이다.

  • PDF