• 제목/요약/키워드: quantiles

검색결과 181건 처리시간 0.021초

Discretization Method Based on Quantiles for Variable Selection Using Mutual Information

  • CHa, Woon-Ock;Huh, Moon-Yul
    • Communications for Statistical Applications and Methods
    • /
    • 제12권3호
    • /
    • pp.659-672
    • /
    • 2005
  • This paper evaluates discretization of continuous variables to select relevant variables for supervised learning using mutual information. Three discretization methods, MDL, Histogram and 4-Intervals are considered. The process of discretization and variable subset selection is evaluated according to the classification accuracies with the 6 real data sets of UCI databases. Results show that 4-Interval discretization method based on quantiles, is robust and efficient for variable selection process. We also visually evaluate the appropriateness of the selected subset of variables.

Sequential Confidence Intervals for Quantiles Based on Recursive Density Estimators

  • Kim, Sung-Kyun;Kim, Sung-Lai
    • Journal of the Korean Statistical Society
    • /
    • 제28권3호
    • /
    • pp.297-309
    • /
    • 1999
  • A sequential procedure of fixed-width confidence intervals for quantiles satisfying a condition of coverage probability is provided based on recursive density estimators. It is shown that the proposed sequential procedure is asymptotically efficient. In addition, the asymptotic normality for the proposed stopping time is derived.

  • PDF

Improving Sample Entropy Based on Nonparametric Quantile Estimation

  • Park, Sang-Un;Park, Dong-Ryeon
    • Communications for Statistical Applications and Methods
    • /
    • 제18권4호
    • /
    • pp.457-465
    • /
    • 2011
  • Sample entropy (Vasicek, 1976) has poor performance, and several nonparametric entropy estimators have been proposed as alternatives. In this paper, we consider a piecewise uniform density function based on quantiles, which enables us to evaluate entropy in each interval, and study the poor performance of the sample entropy in terms of the poor estimation of lower and upper quantiles. Then we propose some improved entropy estimators by simply modifying the quantile estimators, and compare their performances with some existing estimators.

Nonparametric Estimation using Regression Quantiles in a Regression Model

  • Han, Sang-Moon;Jung, Byoung-Cheol
    • 응용통계연구
    • /
    • 제25권5호
    • /
    • pp.793-802
    • /
    • 2012
  • One proposal is made to construct a nonparametric estimator of slope parameters in a regression model under symmetric error distributions. This estimator is based on the use of the idea of minimizing approximate variance of a proposed estimator using regression quantiles. This nonparametric estimator and some other L-estimators are studied and compared with well known M-estimators through a simulation study.

꼬리가 두꺼운 분포의 고분위수에 대한 준모수적 붓스트랩 신뢰구간 (Semi-parametric Bootstrap Confidence Intervals for High-Quantiles of Heavy-Tailed Distributions)

  • 김지현
    • Communications for Statistical Applications and Methods
    • /
    • 제18권6호
    • /
    • pp.717-732
    • /
    • 2011
  • 꼬리가 두꺼운 분포의 고분위수에 대한 신뢰구간을 구할 때 적절한 붓스트랩 방법은 무엇인가에 대해 알아보았다. 비모수적 방법과 모수적 방법, 그리고 준모수적 방법의 성능을 모의실험을 통해 비교하였다.

절사가 주어질때 회귀기울기의 점근적 최량 L-추정법 (Asymptotically Efficient L-Estimation for Regression Slope When Trimming is Given)

  • Sang Moon Han
    • 응용통계연구
    • /
    • 제7권2호
    • /
    • pp.173-182
    • /
    • 1994
  • Han(1993)의 임의의 오차분포하에서 회귀모형에의 기울기 추정법을 응용하여 회귀분위선(regression quantile)에 의해 적당한 상.하위절사가 주어질 때 점근적으로 최량의 회귀모형에서의 기울기 추정량을 구성할 수 있음을 보였다.

  • PDF

POT방법론을 이용한 자동차보험 손해율 추정 (Estimation of Car Insurance Loss Ratio Using the Peaks over Threshold Method)

  • 김수영;송종우
    • 응용통계연구
    • /
    • 제25권1호
    • /
    • pp.101-114
    • /
    • 2012
  • 자동차보험의 손해율이란 지급보험금의 수입보험료에 대한 비율을 의미한다. 손해율이 매우 큰 값을 갖는 대형손실이 일어나는 경우에는 보험회사의 재무적인 부분에 큰 악영향을 미치게 된다. 따라서 보험회사가 이에 대비할 수 있도록 하기 위하여 손해율의 극단 분위수(extreme quantile)를 추정하는 것은 매우 중요한 일이다. 다른 종류의 보험 관련 데이터와 같이 손해율의 분포는 오른쪽으로 긴 꼬리를 갖는 두꺼운 꼬리분포(heavy-tailed distribution)를 갖는다. 이런 자료에서 극단 분위수룰 추정하기 위하여 가장 많이 사용되는 방법론은 POT(Peaks over threshold)와 Hill 추정(Hill estimation)이다. 본 논문에서는 일반화파레토분포(generalized Pareto distribution; GPD)의 다양한 모수추정방법론의 성능을 모의실험과 실제 손해율 데이터를 사용하여 비교, 분석하였다. 또한 Hill 추정치를 사용하여 극단 분위수를 추정하였다. 그 결과 대부분의 경우에 POT 방법론이 Hill 추정치를 이용한 방법보다 정확한 분위수를 추정하였고, 모수추정방법론 중에서는 MLE, Zhang, NLS-2 방법론이 가장 좋은 결과를 보여주었다.

미계측 유역의 홍수피해분석을 위한 자연유량의 홍수지표 기반 확률홍수량 산정 (Estimation of Flood Quantile in Ungauged Watersheds for Flood Damage Analysis Based on Flood Index of Natural Flow)

  • 채병석;최시중;안재현;김태웅
    • 대한토목학회논문집
    • /
    • 제38권1호
    • /
    • pp.175-182
    • /
    • 2018
  • 본 연구에서는 설계 강우-유출 관계 분석법으로 산정된 값을 지역빈도해석 기법을 바탕으로 보정하여 미계측 유역에서의 확률홍수량을 산정하는 방법을 제안하였다. 홍수빈도해석법과 설계 강우-유출 관계 분석법을 비교 분석한 결과, 설계 강우-유출 관계 분석법으로 산정된 확률홍수량이 약 52% 과대 산정되는 것으로 나타났다. 또한, 미계측 유역의 확률홍수량을 산정하기 위해서 유역 특성인자를 자연유량으로 지표화 하여 지역빈도해석법을 수행하였다. 이와 같은 세 가지 방법의 설계홍수량 산정법을 기반으로 미계측 유역을 대상으로 적용할 수 있는 보정식을 도출하였다. 미계측 유역에 대한 적용성을 검토하기 위해 Leave-One-Out Cross-Validation 기법과 Skill Score 기법을 적용하였다. 그 결과, 정확도가 기존의 설계 강우-유출 관계 분석법보다 23.2% 증가한 것으로 나타났다.

Wage Determinants Analysis by Quantile Regression Tree

  • Chang, Young-Jae
    • Communications for Statistical Applications and Methods
    • /
    • 제19권2호
    • /
    • pp.293-301
    • /
    • 2012
  • Quantile regression proposed by Koenker and Bassett (1978) is a statistical technique that estimates conditional quantiles. The advantage of using quantile regression is the robustness in response to large outliers compared to ordinary least squares(OLS) regression. A regression tree approach has been applied to OLS problems to fit flexible models. Loh (2002) proposed the GUIDE algorithm that has a negligible selection bias and relatively low computational cost. Quantile regression can be regarded as an analogue of OLS, therefore it can also be applied to GUIDE regression tree method. Chaudhuri and Loh (2002) proposed a nonparametric quantile regression method that blends key features of piecewise polynomial quantile regression and tree-structured regression based on adaptive recursive partitioning. Lee and Lee (2006) investigated wage determinants in the Korean labor market using the Korean Labor and Income Panel Study(KLIPS). Following Lee and Lee, we fit three kinds of quantile regression tree models to KLIPS data with respect to the quantiles, 0.05, 0.2, 0.5, 0.8, and 0.95. Among the three models, multiple linear piecewise quantile regression model forms the shortest tree structure, while the piecewise constant quantile regression model has a deeper tree structure with more terminal nodes in general. Age, gender, marriage status, and education seem to be the determinants of the wage level throughout the quantiles; in addition, education experience appears as the important determinant of the wage level in the highly paid group.

Quantile regression analysis: A novel approach to determine distributional changes in rainfall over Sri Lanka

  • S.S.K, Chandrasekara;Uranchimeg, Sumiya;Kwon, Hyun-Han
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2017년도 학술발표회
    • /
    • pp.228-232
    • /
    • 2017
  • Extreme hydrological events can cause serious threats to the society. Hence, the selection of probability distributions for extreme rainfall is a fundamental issue. For this reason, this study was focused on understanding possible distributional changes in annual daily maximum rainfalls (AMRs) over time in Sri Lanka using quantile regression. A simplified nine-category distributional-change scheme based on comparing empirical probability density function of two years (i.e. the first year and the last year), was used to determine the distributional changes in AMRs. Daily rainfall series of 13 station over Sri Lanka were analyzed for the period of 1960-2015. 4 distributional change categories were identified for the AMRs. 5 stations showed an upward trend in all the quantiles (i.e. 9 quantiles: from 0.05 to 0.95 with an increment of 0.01 for the AMR) which could give high probability of extreme rainfall. On the other hand, 8 stations showed a downward trend in all the quantiles which could lead to high probability of the low rainfall. Further, we identified a considerable spatial diversity in distributional changes of AMRs over Sri Lanka.

  • PDF