Search | Korea Science

Lee, Sun-Ho;Lee, Kwang-Hyun
- Journal of the Korean Data and Information Science Society
- /
- v.23 no.1
- /
- pp.1-11
- /
- 2012
When the microarray experiment developed, main interest was limited to detect differentially expressed genes associated with a phenotype of interest. However, as human diseases are thought to occur through the interactions of multiple genes within a same functional category, the unit of analysis of the microarray experiment expanded to the set of genes. For the phenotype of censored survival time, Gene Set Enrichment Analysis(GSEA), Global test and Wald type test are widely used. In this paper, we modified the Wald type test by adopting normal score transformation of gene expression values and developed a parametric test which requires much less computation than others. The proposed method is compared with other methods using a real data set of ovarian cancer and a simulation data set.
https://doi.org/10.7465/jkdi.2012.23.1.001 인용 PDF KSCI

Heo, Sun-Yeong;Yi, Su-Cheol
- Journal of the Korean Data and Information Science Society
- /
- v.16 no.3
- /
- pp.609-620
- /
- 2005
In the secondary data analysis for categorical data, situations often arise in which the estimated cell variances are available, but not the full matrix of variances. In this case researchers are often inclined to use Pearson-type test statistics for homogeneity. However, for a complex sample observed cell proportions are not distributed as multinomial and Pearson-type test statistic generally is not distributed asymptotically as chi-square distribution. This paper evaluates powers for Wald test and Pearson-type test and the first order corrected test of Pearson-type test for homogeneity. The resulting power curves indicate that as the misspecification effect increases, the amount of inflation of significance level and the loss of power Pearson-type test are getting more severe.
PDF

Heo, Sun-Yeong;Chung, Young-Ae
- Journal of the Korean Data and Information Science Society
- /
- v.23 no.4
- /
- pp.757-764
- /
- 2012
This research is for comparison of test statistics for homogeneity when the data is collected based on complex sample design. The survey data based on complex sample design does not satisfy the condition of independency which is required for the standard Pearson multinomial-based chi-squared test. Today, lots of data sets ara collected by complex sample designs, but the tests for categorical data are conducted using the standard Pearson chi-squared test. In this study, we compared the performance of three test statistics for homogeneity between two populations using data from the 2009 customer satisfaction evaluation survey to the service from Gyeongsangnam-do regional offices of education: the standard Pearson test, the unbiasedWald test, and the Pearsontype test with survey-based point estimates. Through empirical analyses, we fist showed that the standard Pearson test inflates the values of test statistics very much and the results are not reliable. Second, in the comparison of Wald test and Pearson-type test, we find that the test results are affected by the number of categories, the mean and standard deviation of the eigenvalues of design matrix.
https://doi.org/10.7465/jkdi.2012.23.4.757 인용 PDF KSCI

Kang, Shin-Soo;Larsen, Michael D.
- Journal of the Korean Data and Information Science Society
- /
- v.26 no.4
- /
- pp.971-984
- /
- 2015
Imputation procedures fill-in missing values, thereby enabling complete data analyses. Fully efficient fractional imputation (FEFI) and multiple imputation (MI) create multiple versions of the missing observations, thereby reflecting uncertainty about their true values. Methods have been described for hypothesis testing with multiple imputation. Fractional imputation assigns weights to the observed data to compensate for missing values. The focus of this article is the development of tests of independence using FEFI for partially classified two-way contingency tables. Wald and deviance tests of independence under FEFI are proposed. Simulations are used to compare type I error rates and Power. The partially observed marginal information is useful for estimating the joint distribution of cell probabilities, but it is not useful for testing association. FEFI compares favorably to other methods in simulations.
https://doi.org/10.7465/jkdi.2015.26.4.971 인용 PDF KSCI

Lee, Seung-Chun
- Communications for Statistical Applications and Methods
- /
- v.18 no.4
- /
- pp.445-455
- /
- 2011
Although misclassified binary data occur frequently in practice, the statistical methodology available for the data is rather limited. In particular, the interval estimation of population proportion has relied on the classical Wald method. Recently, Lee and Choi (2009) developed a new confidence interval by applying the Agresti-Coull's approach and showed the efficiency of their proposed confidence interval numerically, but a theoretical justification has not been explored yet. Therefore, a Bayesian model for the misclassified binary data is developed to consider the Agresti-Coull confidence interval from a theoretical point of view. It is shown that the Agresti-Coull confidence interval is essentially a Bayesian confidence interval.
https://doi.org/10.5351/CKSS.2011.18.4.445 인용 PDF KSCI

정형철;김대학
- The Korean Journal of Applied Statistics
- /
- v.15 no.2
- /
- pp.355-366
- /
- 2002
Nonparametric statistical inference for the parameter of logit model were examined. Usually nonparametric approach is milder than parametric approach based on normal theory assumption. We compared the two nonparametric methods for legit model, the bootstrap and random permutation in the sense of coverage probability. Monte Carlo simulation is conducted for small sample cases. Empirical power of hypothesis test and coverage probability for confidence interval estimation were presented for simple and multiple legit model respectively. An example were also introduced.
https://doi.org/10.5351/KJAS.2002.15.2.355 인용 PDF KSCI