• Title/Summary/Keyword: 복합표본설계

Search Result 74, Processing Time 0.019 seconds

Effect of complex sample design on Pearson test statistic for homogeneity (복합표본자료에서 동질성검정을 위한 피어슨 검정통계량의 효과)

  • Heo, Sun-Yeong;Chung, Young-Ae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.4
    • /
    • pp.757-764
    • /
    • 2012
  • This research is for comparison of test statistics for homogeneity when the data is collected based on complex sample design. The survey data based on complex sample design does not satisfy the condition of independency which is required for the standard Pearson multinomial-based chi-squared test. Today, lots of data sets ara collected by complex sample designs, but the tests for categorical data are conducted using the standard Pearson chi-squared test. In this study, we compared the performance of three test statistics for homogeneity between two populations using data from the 2009 customer satisfaction evaluation survey to the service from Gyeongsangnam-do regional offices of education: the standard Pearson test, the unbiasedWald test, and the Pearsontype test with survey-based point estimates. Through empirical analyses, we fist showed that the standard Pearson test inflates the values of test statistics very much and the results are not reliable. Second, in the comparison of Wald test and Pearson-type test, we find that the test results are affected by the number of categories, the mean and standard deviation of the eigenvalues of design matrix.

A Study of Composite Estimator in 2-level Rotation Design based on 3 Rotation Groups (3개의 교체그룹을 갖는 2수준 교체표본설계에서의 복합추정량에 관한 연구)

  • 박유성;문원기;김기환
    • The Korean Journal of Applied Statistics
    • /
    • v.15 no.1
    • /
    • pp.45-55
    • /
    • 2002
  • The 2-level rotation design based on 3 rotation groups is discussed in view of Monthly Retail Trade Survey conducted by the Bureau of Census in U.S., and composite estimators for population characteristics are concerned. The generalized composite estimators and the recursive composite estimators are presented at 2-level rotation design with design gap and variance formulas for the composite estimators are provided. Also under the response variability related with covariance structure and correlation structure from repeated response, relative efficiencies of the composite estimators are compared.

Comparison of Regression Model Approaches fined to Complex Survey Data (복합표본조사 데이터 분석을 위한 회귀모형 접근법의 비교: 소규모사업체조사 데이터 분석을 중심으로)

  • 이기재
    • Survey Research
    • /
    • v.2 no.1
    • /
    • pp.73-86
    • /
    • 2001
  • In this paper. we conducted an empirical study to investigate the design and weighting effects on descriptive and analytic statistics. We compared the regression models using the design-based approach and the generalized estimating equations (GEEs) approach with the model-based approach through the design and weighting effects analysis.

  • PDF

농가경제조사를 위한 표본설계

  • Sin, Min-Ung;Lee, Gye-O;Hong, Gi-Hak;Lee, Gi-Jae
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2002.11a
    • /
    • pp.13-18
    • /
    • 2002
  • 본 논문에서는 급변하는 농촌의 환경을 충분히 반영할 수 있도록 1997년도에 설계되어 사용되고 있는 현행의 농가경제조사를 개선하였다. 새로운 표본 조사구를 선정하기 위하여 1999년도와 2000년도 농가경제조사 조사데이터와 2000년에 실시된 농어업총조사 결과를 심도 있게 분석하였다. 이를 기초로 현재의 농촌 실정에 적합한 조사모집단을 새롭게 구성하였고, 현재의 농촌 환경을 반영할 수 있는 층화 기준을 마련하여 표본 조사구를 추출하였다. 또한, 논벼를 비롯한 6개 주요작물들에 대한 농산물생산비조사의 정도(精度) 향상을 위해서 각 작물별 주산지를 표본 조사구로 선정하였다.

  • PDF

Generalized Composite Estimators and Mean Squared Errors for l/G Rotation Design (l/G 교체표본디자인에서의 일반화복합추정량과 평균제곱오차에 관한 연구)

  • 김기환;박유성;남궁재은
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.1
    • /
    • pp.61-73
    • /
    • 2004
  • Rotation sampling designs may be classified into two categories. The first type uses the same sample unit for the entire life of the survey. The second type uses the sample unit only for a fixed number of times. In both type of designs, the entire sample is partitioned into a finite number(=G) of rotation groups. This paper is generalization of the first type designs. Since the generalized design can be identified by only G rotation groups and recall level 1, we denote this rotation system as l/G rotation design. Under l/G rotation design, variance and mean squared error (MSE) of generalized composite estimator are derived, incorporating two type of biases and exponentially decaying correlation pattern. Compromising MSE's of some selected l/G designs, we investigate design efficiency, design gap effect, ans the effects of correlation and bias.

Measuring stratification effects for multistage sampling (다단추출 표본설계의 층효율성 연구)

  • Taehoon Kim;KeeJae Lee;Inho Park
    • The Korean Journal of Applied Statistics
    • /
    • v.36 no.4
    • /
    • pp.337-347
    • /
    • 2023
  • Sampling designs often use stratified sampling, where elements or clusters of the study population are divided into strata and an independent sample is chosen from each stratum. The stratification strategy consists of stratification and sample allocation, which are important issues that are repeatedly considered in survey sampling. Although a stratified multistage sample design is often used in practice, the literature tends to discuss simple sampling in terms of stratum effects or stratum efficiency. This study examines an existing stratum efficiency measure for two-stage sampling and further proposes additional stratum efficiency measures using the design effect model. The proposed measures are used to evaluate the stratification strategy of the sample design for high school students of the 4th Korean National Environmental Health Survey (KoNEHS).

Error cause analysis of Pearson test statistics for k-population homogeneity test (k-모집단 동질성검정에서 피어슨검정의 오차성분 분석에 관한 연구)

  • Heo, Sunyeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.4
    • /
    • pp.815-824
    • /
    • 2013
  • Traditional Pearson chi-squared test is not appropriate for the data collected by the complex sample design. When one uses the traditional Pearson chi-squared test to the complex sample categorical data, it may give wrong test results, and the error may occur not only due to the biased variance estimators but also due to the biased point estimators of cell proportions. In this study, the design based consistent Wald test statistics was derived for k-population homogeneity test, and the traditional Pearson chi-squared test statistics was partitioned into three parts according to the causes of error; the error due to the bias of variance estimator, the error due to the bias of cell proportion estimator, and the unseparated error due to the both bias of variance estimator and bias of cell proportion estimator. An analysis was conducted for empirical results of the relative size of each error component to the Pearson chi-squared test statistics. The second year data from the fourth Korean national health and nutrition examination survey (KNHANES, IV-2) was used for the analysis. The empirical results show that the relative size of error from the bias of variance estimator was relatively larger than the size of error from the bias of cell proportion estimator, but its degrees were different variable by variable.

Comparison of Regression Model Approaches fitted to Complex Survey Data (복합표본조사 데이터 분석을 위한 회귀모형 접근법의 비교: 소규모사업체조사 데이터 분석을 중심으로)

  • 이기재
    • Proceedings of the Korean Association for Survey Research Conference
    • /
    • 2001.04a
    • /
    • pp.73-86
    • /
    • 2001
  • In this paper, we conducted an empirical study to investigate the design and weighting effects on descriptive and analytic statistics. We compared the regression models using the design-based approach and the generalized estimating equations(GEEs) approach with the model-based approach through the design and weighting effects analysis.

The Analysis of the Relationship among Physical Activity Level, Subjective Health Status, COVID-19 Fear applying the Complex Sampling Design

  • Park, Jae-Ahm
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.6
    • /
    • pp.139-147
    • /
    • 2022
  • This study tried to analyze the relationship among physical activity level, subjective health status, COVID-19 Fear. This study used the 2020 Community Health Survey that includes 229,269 survey data from adults over 19 years old. The complex sampling design was applied including weight, stratification, cluster variables. Through the SPSS statistics program with complex sampling frequency analysis, complex sampling Chi-square and complex sampling regression, this study found followings. First, the group with high level of physical activity showed higher level of subjective health status than the group with low level of physical activity. Second, the group with high level of physical activity showed lower level of COVID-19 fear than the group with low level of physical activity. Third, the group with high level of subjective health status showed lower level of COVID-19 fear than the group with low level of subjective health status. However, this study has the limitation that this study did not check whether participant is diagnosed with Covid-19 or not.

A Composite Estimator for Cut-off Sampling using Cost Function (절사표본 설계에서 비용함수를 고려한 복합추정량)

  • Sim, Hyo-Seon;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.1
    • /
    • pp.43-59
    • /
    • 2014
  • Cut-off sampling has been widely used for a highly skewed population like a business survey by discarding a part of the population, so called a take-nothing stratum. For a more accurate estimate of the population total, Hwang and Shin (2013) suggested a composite estimator of a take-nothing stratum total that combined the survey results of a take-nothing stratum and a take-some sub-stratum (a part of take-some stratum). In this paper we propose a new cut-off sampling scheme by considering a cost function and a composite estimator based on the proposed sampling scheme. Small simulation studies compared the performances of known composite estimators and the new composite estimator suggested in this study. We also use Briquette Consumption Survey data for real data analysis.