• 제목/요약/키워드: study and auxiliary variables

검색결과 58건 처리시간 0.024초

Generalized Multi-Phase Multivariate Ratio Estimators for Partial Information Case Using Multi-Auxiliary Vatiables

  • Ahmad, Zahoor;Hanif, Muhammad;Ahmad, Munir
    • Communications for Statistical Applications and Methods
    • /
    • 제17권5호
    • /
    • pp.625-637
    • /
    • 2010
  • In this paper we propose generalized multi-phase multivariate ratio estimators in the presence of multiauxiliary variables for estimating population mean vector of variables of interest. Some special cases have been deduced from the suggested estimator in the form of remarks. The expressions for mean square errors of proposed estimators have also been derived. The suggested estimators are theoretically compared and an empirical study has also been conducted.

Efficient Use of Auxiliary Variables in Estimating Finite Population Variance in Two-Phase Sampling

  • Singh, Housila P.;Singh, Sarjinder;Kim, Jong-Min
    • Communications for Statistical Applications and Methods
    • /
    • 제17권2호
    • /
    • pp.165-181
    • /
    • 2010
  • This paper presents some chain ratio-type estimators for estimating finite population variance using two auxiliary variables in two phase sampling set up. The expressions for biases and mean squared errors of the suggested c1asses of estimators are given. Asymptotic optimum estimators(AOE's) in each class are identified with their approximate mean squared error formulae. The theoretical and empirical properties of the suggested classes of estimators are investigated. In the simulation study, we took a real dataset related to pulmonary disease available on the CD with the book by Rosner, (2005).

Improved Exponential Estimator for Estimating the Population Mean in the Presence of Non-Response

  • Kumar, Sunil
    • Communications for Statistical Applications and Methods
    • /
    • 제20권5호
    • /
    • pp.357-366
    • /
    • 2013
  • This paper defines an improvement for estimating the population mean of a study variable using auxiliary information and known values of certain population parameter(s), when there is a non-response in a study as well as on auxiliary variables. Under a simple random sampling without a replacement (SRSWOR) scheme, the mean square error (MSE) of all proposed estimators are obtained and compared with each other. Numerical illustration is also given.

Pliable regression spline estimator using auxiliary variables

  • Oh, Jae-Kwon;Jhong, Jae-Hwan
    • Communications for Statistical Applications and Methods
    • /
    • 제28권5호
    • /
    • pp.537-551
    • /
    • 2021
  • We conducted a study on a regression spline estimator with a few pre-specified auxiliary variables. For the implementation of the proposed estimators, we adapted a coordinate descent algorithm. This was implemented by considering a structure of the sum of the residuals squared objective function determined by the B-spline and the auxiliary coefficients. We also considered an efficient stepwise knot selection algorithm based on the Bayesian information criterion. This was to adaptively select smoothly functioning estimator data. Numerical studies using both simulated and real data sets were conducted to illustrate the proposed method's performance. An R software package psav is available.

Statistical micro matching using a multinomial logistic regression model for categorical data

  • Kim, Kangmin;Park, Mingue
    • Communications for Statistical Applications and Methods
    • /
    • 제26권5호
    • /
    • pp.507-517
    • /
    • 2019
  • Statistical matching is a method of combining multiple sources of data that are extracted or surveyed from the same population. It can be used in situation when variables of interest are not jointly observed. It is a low-cost way to expect high-effects in terms of being able to create synthetic data using existing sources. In this paper, we propose the several statistical micro matching methods using a multinomial logistic regression model when all variables of interest are categorical or categorized ones, which is common in sample survey. Under conditional independence assumption (CIA), a mixed statistical matching method, which is useful when auxiliary information is not available, is proposed. We also propose a statistical matching method with auxiliary information that reduces the bias of the conventional matching methods suggested under CIA. Through a simulation study, proposed micro matching methods and conventional ones are compared. Simulation study shows that suggested matching methods outperform the existing ones especially when CIA does not hold.

단위 무응답 보정에서 보조변수의 선택에 관한 연구 (A Study on Auxiliary Variable Selection in Unit Nonresponse Calibration)

  • 손창균;홍기학;이기성
    • 응용통계연구
    • /
    • 제16권1호
    • /
    • pp.33-44
    • /
    • 2003
  • 조사과정에서 필연적으로 발생하는 무응답을 보정하기 위해 보조정보를 사용해야 한다. 이 때, 이용 가능한 보조정보의 차원이 크면, 계산과정에서 많은 시간이 소요되며 데이터를 다루기가 매우 어렵다. 또한 추정량의 분산이 보조전보의 차원에 의존하기 때문에 과소추정의 문제가 발생한다. 이러한 문제를 해결하기 위해 무응답 보정에서 적절한 보조정보의 선택 방법을 제안하였고, 이에 대한 효율성을 모의실험을 통해 살펴보았다.

보조 정보에 의한 이중적 로버스트 대체법 (Doubly Robust Imputation Using Auxiliary Information)

  • 박현아;전종우;나성룡
    • Communications for Statistical Applications and Methods
    • /
    • 제18권1호
    • /
    • pp.47-55
    • /
    • 2011
  • 비대체와 회귀대체는 조사변수의 모형과 조사변수와 보조변수의 관계에 의존하며 모형이 성립되지 않는 경우 이들 대체법을 이용한 추정량의 불편성은 보장되지 않는다. 본 연구에서는 모형이 성립되지 않는 경우에도 추정량의 근사적 불편성이 성립되는 로버스트 대체법을 개발한다. 대체법 개발시 보조변수의 모수 정보를 이용하여 추정량의 효율 증대를 가져오게 한다. 모의실험을 실시하여 본 연구에 대한 이론적 결과의 타당성을 보인다.

Impact of Trend Estimates on Predictive Performance in Model Evaluation for Spatial Downscaling of Satellite-based Precipitation Data

  • Kim, Yeseul;Park, No-Wook
    • 대한원격탐사학회지
    • /
    • 제33권1호
    • /
    • pp.25-35
    • /
    • 2017
  • Spatial downscaling with fine resolution auxiliary variables has been widely applied to predict precipitation at fine resolution from coarse resolution satellite-based precipitation products. The spatial downscaling framework is usually based on the decomposition of precipitation values into trend and residual components. The fine resolution auxiliary variables contribute to the estimation of the trend components. The main focus of this study is on quantitative analysis of impacts of trend component estimates on predictive performance in spatial downscaling. Two regression models were considered to estimate the trend components: multiple linear regression (MLR) and geographically weighted regression (GWR). After estimating the trend components using the two models,residual components were predicted at fine resolution grids using area-to-point kriging. Finally, the sum of the trend and residual components were considered as downscaling results. From the downscaling experiments with time-series Tropical Rainfall Measuring Mission (TRMM) 3B43 precipitation data, MLR-based downscaling showed the similar or even better predictive performance, compared with GWR-based downscaling with very high explanatory power. Despite very high explanatory power of GWR, the relationships quantified from TRMM precipitation data with errors and the auxiliary variables at coarse resolution may exaggerate the errors in the trend components at fine resolution. As a result, the errors attached to the trend estimates greatly affected the predictive performance. These results indicate that any regression model with high explanatory power does not always improve predictive performance due to intrinsic errors of the input coarse resolution data. Thus, it is suggested that the explanatory power of trend estimation models alone cannot be always used for the selection of an optimal model in spatial downscaling with fine resolution auxiliary variables.

이상적(異常的) 가중치를 줄이는 가중치 조정 방법 연구 (Weighing adjustment avoiding extreme weights)

  • 김재광
    • 한국조사연구학회:학술대회논문집
    • /
    • 한국조사연구학회 2003년도 춘계학술발표대회
    • /
    • pp.19-28
    • /
    • 2003
  • 가중치 조정은 표본추출 확률의 역수로 계산된 기본 가중치 외에 보정승수를 곱해 줌으로써 추정단계에서 보조변수를 활용하여 추정치의 효율을 높이는 방법이다 가중치 조정의 대표적인 예로는 사후층화를 들 수가 있는데 이는 회귀추정의 특수한 경우이나 회귀추정보다 계산이 편리하여 실제로 많이 사용되고 있다. 이러한 경우 보조변수를 많이 사용하게 되면 보정승수가 지나치게 크거나 또는 지나치게 작아지게 되는 경우가 있는데 이렇게 되면 추정치의 편향(bias)이 커지게 된다. 본 연구에서는 적절히 추정치의 효율도 높이면서 편향을 줄이는 가중치 조정 방법을 제안한다. 또한 시뮬레이션을 통하여 제안된 추정치의 성질을 확인하였다.

  • PDF

GENERAL FAMILIES OF CHAIN RATIO TYPE ESTIMATORS OF THE POPULATION MEAN WITH KNOWN COEFFICIENT OF VARIATION OF THE SECOND AUXILIARY VARIABLE IN TWO PHASE SAMPLING

  • Singh Housila P.;Singh Sarjinder;Kim, Jong-Min
    • Journal of the Korean Statistical Society
    • /
    • 제35권4호
    • /
    • pp.377-395
    • /
    • 2006
  • In this paper we have suggested a family of chain estimators of the population mean $\bar{Y}$ of a study variate y using two auxiliary variates in two phase (double) sampling assuming that the coefficient of variation of the second auxiliary variable is known. It is well known that chain estimators are traditionally formulated when the population mean $\bar{X}_1$ of one of the two auxiliary variables, say $x_1$, is not known but the population mean $\bar{X}_2$ of the other auxiliary variate $x_2$ is available and $x_1$ has higher degree of positive correlation with the study variate y than $x_2$ has with y, $x_2$ being closely related to $x_1$. Here the classes are constructed when the population mean $\bar{X}_1\;of\;X_1$ is not known and the coefficient of variation $C_{x2}\;of\;X_2$ is known instead of population mean $\bar{X}_2$. Asymptotic expressions for the bias and mean square error (MSE) of the suggested family have been obtained. An asymptotic optimum estimator (AOE) is also identified with its MSE formula. The optimum sample sizes of the preliminary and final samples have been derived under a linear cost function. An empirical study has been carried out to show the superiority of the constructed estimator over others.