DOI QR코드

DOI QR Code

Robust inference with order constraint in microarray study

  • Kang, Joonsung (Department of Information Statistics, Gangneung-Wonju national University)
  • 투고 : 2018.07.17
  • 심사 : 2018.08.30
  • 발행 : 2018.09.30

초록

Gene classification can involve complex order-restricted inference. Examining gene expression pattern across groups with order-restriction makes standard statistical inference ineffective and thus, requires different methods. For this problem, Roy's union-intersection principle has some merit. The M-estimator adjusting for outlier arrays in a microarray study produces a robust test statistic with distribution-insensitive clustering of genes. The M-estimator in conjunction with a union-intersection principle provides a nonstandard robust procedure. By exact permutation distribution theory, a conditionally distribution-free test based on the proposed test statistic generates corresponding p-values in a small sample size setup. We apply a false discovery rate (FDR) as a multiple testing procedure to p-values in simulated data and real microarray data. FDR procedure for proposed test statistics controls the FDR at all levels of ${\alpha}$ and ${\pi}_0$ (the proportion of true null); however, the FDR procedure for test statistics based upon normal theory (ANOVA) fails to control FDR.

키워드

참고문헌

  1. Benjamini Y and Hochberg Y (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing, Journal of the Royal Statistical Society: Series B, 57, 289-300.
  2. Choi D, Choi H, and Park C (2016). Classification of ratings in online reviews, Journal of the Korean Data and Information Science Society, 27, 845-854. https://doi.org/10.7465/jkdi.2016.27.4.845
  3. Dudoit S, Shaffer JP, and Boldrick JC (2003). Multiple hypothesis testing in microarray experiments, Statistical Science, 18, 71-103. https://doi.org/10.1214/ss/1056397487
  4. Hajek J (1965). Extension of the Kolmogorov-Smirnov test to regression alternatives. In Proceedings of Bernoulli-Bayes-Laplace Seminar (L. LeCam, ed.), 45-60.
  5. Huber PJ (1981). Robust Statistics, John Wiley & Sons, New York.
  6. Iyer VR, Eisen MB, Ross DT, et al. (1999). The transcriptional program in the response of human fibroblasts to serum, Sciences, 283, 83-87. https://doi.org/10.1126/science.283.5398.83
  7. Jang E, Choi S, and Kim D (2018). Robust Bayesian beta regression analysis, Journal of the Korean Data and Information Science Society, 29, 27-36. https://doi.org/10.7465/jkdi.2018.29.1.27
  8. Jureckova J and Sen PK (1996). Robust Statistical Procedures, Asymptotics and Interrelations, Wiley, New York.
  9. Kim G and Park C (2015). Analysis of English abstracts in journal of the Korean data and information science society using topic models and social network analysis, Journal of the Korean Data and Information Science Society, 26, 151-159. https://doi.org/10.7465/jkdi.2015.26.1.151
  10. Lim Y (2018). M-estimation of the long-memory parameter by Laplace periodogram, Journal of the Korean Data and Information Science Society, 29, 523-532. https://doi.org/10.7465/jkdi.2018.29.2.523
  11. Peddada SD, Lobenhofer EK, Li L, Afshari CA, Weinberg CR, and Umbach DM (2003). Gene selection and clustering for time-course and dose-response microarray experiments using order-restricted inference, Bioinformatics, 19, 834-841. https://doi.org/10.1093/bioinformatics/btg093
  12. Robertson T, Wright FT, and Dykstra RL (1988). Order Restricted Statistical Inference, Wiley series in probability and Statistics, New York.
  13. Roy SN (1953). On a heuristic method of test construction and its use in multivariate analysis, Annals of Mathematical Statistics, 24, 220-238. https://doi.org/10.1214/aoms/1177729029
  14. Sarkar SK (2006). False discovery and false nondiscovery rates in single-step multiple testing procedures, The Annals of Statistics, 34, 394-415. https://doi.org/10.1214/009053605000000778
  15. Son N and Kim M (2017). A study on robust regression estimators in heteroscedastic error models, Journal of the Korean Data and Information Science Society, 28, 1191-1204.
  16. Storey JD (2002). A direct approach to false discovery rates, Journal of the Royal Statistical Society: Series B, 64, 479-498. https://doi.org/10.1111/1467-9868.00346