DOI QR코드

DOI QR Code

Effect of Bias on the Pearson Chi-squared Test for Two Population Homogeneity Test

  • Heo, Sunyeong (Department of statistics, Changwon National University)
  • Received : 2012.09.18
  • Accepted : 2012.12.21
  • Published : 2012.09.30

Abstract

Categorical data collected based on complex sample design is not proper for the standard Pearson multinomial-based chi-squared test because the observations are not independent and identically distributed. This study investigates effects of bias of point estimator of population proportion and its variance estimator to the standard Pearson chi-squared test statistics when the sample is collected based on complex sampling scheme. This study examines the effect under two population homogeneity test. The standard Pearson test statistic can be partitioned into two parts; the first part is the weighted sum of ${\chi}^2_1$ with eigenvalues of design matrix as their weights, and the additional second part which is added due to the biases of the point estimator and its variance estimator. Our empirical analysis shows that even though the bias of point estimator is small, Pearson test statistic is very much inflated due to underestimate the variance of point estimator. In the connection of design-based variance estimator and its design matrix, the bigger the average of eigenvalues of design matrix is, the larger relative size of which the first component part to Pearson test statistic is taking.

Keywords

References

  1. P. J. Lavrakas, "Encyclopedia of survey research methods", Sage, London, Vol. 2, p. 113, 2008.
  2. D. Holt, A. J. Scott, and P. D. Ewings, "Chi-squared tests with survey data", J. the R. Stat. Soc. A, Vol. 143, pp. 302-320, 1980.
  3. J. N. K. Rao and A. J. Scott., "The analysis of categorical data from complex sample surveys: Chisquared tests for goodness of fit the independence in two-way tables", J. Am. Stat. Assoc., Vol. 76, pp. 221-230, 1981. https://doi.org/10.1080/01621459.1981.10477633
  4. J. N. K. Rao and A. J. Scott, "On chi-squared test for multiway contingency tables with cell proportions estimated from survey data", The Annals of Statistics, Vol. 12, pp. 46-60, 1984. https://doi.org/10.1214/aos/1176346391
  5. J. N. K. Rao and A. J. Scott, "On simple adjustments to chi-square tests with sample survey data", The Annals of Statistics, Vol. 15, pp. 385-397, 1987. https://doi.org/10.1214/aos/1176350273
  6. D. R. Thomas and J. N. K. Rao, "Small-sample comparisons of level and power for simple goodness- of-fit statistics under cluster sampling", J. Am. Stat. Assoc., Vol. 82, pp. 630-636, 1987. https://doi.org/10.1080/01621459.1987.10478476
  7. S. Heo, "Power analysis of the Rao-Scott first-order adjustment to the Pearson test for homogeneity", Joint Statistical Meetings Proceedings, Seattle, U.S.A., pp. 3126-3129, 2006.
  8. J. Shao, "Resampling methods in sample surveys (with discussion)", Statistics, Vol. 27, 203-254, 1996. https://doi.org/10.1080/02331889708802523