• Title/Summary/Keyword: 교차타당성

Search Result 103, Processing Time 0.018 seconds

Bandwidth selections based on cross-validation for estimation of a discontinuity point in density (교차타당성을 이용한 확률밀도함수의 불연속점 추정의 띠폭 선택)

  • Huh, Jib
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.4
    • /
    • pp.765-775
    • /
    • 2012
  • The cross-validation is a popular method to select bandwidth in all types of kernel estimation. The maximum likelihood cross-validation, the least squares cross-validation and biased cross-validation have been proposed for bandwidth selection in kernel density estimation. In the case that the probability density function has a discontinuity point, Huh (2012) proposed a method of bandwidth selection using the maximum likelihood cross-validation. In this paper, two forms of cross-validation with the one-sided kernel function are proposed for bandwidth selection to estimate the location and jump size of the discontinuity point of density. These methods are motivated by the least squares cross-validation and the biased cross-validation. By simulated examples, the finite sample performances of two proposed methods with the one of Huh (2012) are compared.

비모수적 회귀함수 추정에서 평활량의 선택에 관한 연구

  • 석경하
    • Communications for Statistical Applications and Methods
    • /
    • v.3 no.1
    • /
    • pp.39-49
    • /
    • 1996
  • 비모수적 커널 회귀함수 추정법에서 평활량(bandwidth of smoothing parameter)의 선택은 아주 중요한 문제이다. 교차타당성(cross-validation) 방법에 의한 평활량은 최적평활량으로의 상대적 수렴속도(relative convergence rate)가 $n^{-1/10}$로 상당히 느리다는 것을 알고 있다. 본 연구는 삽입방법(plug-in method)에 의해 선택된 평활량의 상대적 수렴속도가 교차타당성 방법보다 더 빠른 $n^{-2/7}$이 됨을 보였다. 그리고 모의실험을 통하여 소 표본에서도 삽입방법이 교차타당성 방법보다 우수함을 입증하였다.

  • PDF

확률밀도함수의 미분에 대한 커널추정법에 관한 연구

  • Seok, Gyeong-Ha;Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.7 no.2
    • /
    • pp.211-217
    • /
    • 1996
  • 본 논문은 확률밀도함수의 l 번째 도함수의 커널추정법에 관하여 다루고 있다. 확률밀도함수 도함수의 커널추정에 사용될 수 있는 두가지 평활량의 선택법, 교차타당성방법과 삽입방법에 의한 평활량의 점근분포를 규명하고 이들의 상대적 수렴속도를 각각 밝히고 삽입방법의 우수성을 소표본 모의실험을 통하여 확인하였다.

  • PDF

소디움 관-통 형 열교환기의 교차류 열전달 해석 특성

  • 심윤섭;김연식
    • Proceedings of the Korean Nuclear Society Conference
    • /
    • 1996.05b
    • /
    • pp.298-303
    • /
    • 1996
  • 액체금속로 IHX 의 열전달 해석모형을 개발하기 위한 일차적인 단계로서 교차류 열전달 모형특성에 대한 연구를 수행하였는 그 주요내용은 새로운 대수화 (finite differencing) 기법인 경계점 기법의 특성을 분석하여 이기법의 적절성을 확인하고 IHX 기하형태 및 운전 요건에 따른 격자수에 대한 요건을 분석하고 이로부터 IHX 해석에 간이 이차원 해석 모형 사용의 타당성을 확인하였다.

  • PDF

On variable bandwidth Kernel Regression Estimation (변수평활량을 이용한 커널회귀함수 추정)

  • Seog, Kyung-Ha;Chung, Sung-Suk;Kim, Dae-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.9 no.2
    • /
    • pp.179-188
    • /
    • 1998
  • Local polynomial regression estimation is the most popular one among kernel type regression estimator. In local polynomial regression function esimation bandwidth selection is crucial problem like the kernel estimation. When the regression curve has complicated structure variable bandwidth selection will be appropriate. In this paper, we propose a variable bandwidth selection method fully data driven. We will choose the bandwdith by selecting minimising estiamted MSE which is estimated by the pilot bandwidth study via croos-validation method. Monte carlo simulation was conducted in order to show the superiority of proposed bandwidth selection method.

  • PDF

Penalized logistic regression models for determining the discharge of dyspnea patients (호흡곤란 환자 퇴원 결정을 위한 벌점 로지스틱 회귀모형)

  • Park, Cheolyong;Kye, Myo Jin
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.1
    • /
    • pp.125-133
    • /
    • 2013
  • In this paper, penalized binary logistic regression models are employed as statistical models for determining the discharge of 668 patients with a chief complaint of dyspnea based on 11 blood tests results. Specifically, the ridge model based on $L^2$ penalty and the Lasso model based on $L^1$ penalty are considered in this paper. In the comparison of prediction accuracy, our models are compared with the logistic regression models with all 11 explanatory variables and the selected variables by variable selection method. The results show that the prediction accuracy of the ridge logistic regression model is the best among 4 models based on 10-fold cross-validation.

Optimal number of dimensions in linear discriminant analysis for sparse data (희박한 데이터에 대한 선형판별분석에서 최적의 차원 수 결정)

  • Shin, Ga In;Kim, Jaejik
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.6
    • /
    • pp.867-876
    • /
    • 2017
  • Datasets with small n and large p are often found in various fields and the analysis of the datasets is still a challenge in statistics. Discriminant analysis models for such datasets were recently developed in classification problems. One approach of those models tries to detect dimensions that distinguish between groups well and the number of the detected dimensions is typically smaller than p. In such models, the number of dimensions is important because the prediction and visualization of data and can be usually determined by the K-fold cross-validation (CV). However, in sparse data scenarios, the CV is not reliable for determining the optimal number of dimensions since there can be only a few observations for each fold. Thus, we propose a method to determine the number of dimensions using a measure based on the standardized distance between the mean values of each group in the reduced dimensions. The proposed method is verified through simulations.

Quantile regression using asymmetric Laplace distribution (비대칭 라플라스 분포를 이용한 분위수 회귀)

  • Park, Hye-Jung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.6
    • /
    • pp.1093-1101
    • /
    • 2009
  • Quantile regression has become a more widely used technique to describe the distribution of a response variable given a set of explanatory variables. This paper proposes a novel modelfor quantile regression using doubly penalized kernel machine with support vector machine iteratively reweighted least squares (SVM-IRWLS). To make inference about the shape of a population distribution, the widely popularregression, would be inadequate, if the distribution is not approximately Gaussian. We present a likelihood-based approach to the estimation of the regression quantiles that uses the asymmetric Laplace density.

  • PDF

Varying coefficient model with errors in variables (가변계수 측정오차 회귀모형)

  • Sohn, Insuk;Shim, Jooyong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.971-980
    • /
    • 2017
  • The varying coefficient regression model has gained lots of attention since it is capable to model dynamic changes of regression coefficients in many regression problems of science. In this paper we propose a varying coefficient regression model that effectively considers the errors on both input and response variables, which utilizes the kernel method in estimating the varying coefficient which is the unknown nonlinear function of smoothing variables. We provide a generalized cross validation method for choosing the hyper-parameters which affect the performance of the proposed model. The proposed method is evaluated through numerical studies.

A Study on Velocity Distribution Around Ship Stern by Improved Power Law Flow Model (멱법칙 유동모델의 개선에 의한 선미 유동장내 속도분포 연구)

  • 김시영
    • Transactions of the Korean Society of Mechanical Engineers
    • /
    • v.16 no.7
    • /
    • pp.1391-1397
    • /
    • 1992
  • Improved power law flow model was suggested for the calculation of wake flow characteristics around the three dimensional ship stern in case of the formation of bilge vortex in the direction of stern. In comparison with the power law and Coles flow model, the flow velocity calculated based on this study was delayed around the boundary of inner layer and outer layer in reverse flow. More accurate results was obtained with this improved power law flow model by the velocity calculation around ship stern. Accuracy was validated with the comparison of other calculation results and experimental datas.