DOI QR코드

DOI QR Code

EFMDR-Fast: An Application of Empirical Fuzzy Multifactor Dimensionality Reduction for Fast Execution

  • Received : 2018.12.06
  • Accepted : 2018.12.16
  • Published : 2018.12.31

Abstract

Gene-gene interaction is a key factor for explaining missing heritability. Many methods have been proposed to identify gene-gene interactions. Multifactor dimensionality reduction (MDR) is a well-known method for the detection of gene-gene interactions by reduction from genotypes of single-nucleotide polymorphism combinations to a binary variable with a value of high risk or low risk. This method has been widely expanded to own a specific objective. Among those expansions, fuzzy-MDR uses the fuzzy set theory for the membership of high risk or low risk and increases the detection rates of gene-gene interactions. Fuzzy-MDR is expanded by a maximum likelihood estimator as a new membership function in empirical fuzzy MDR (EFMDR). However, EFMDR is relatively slow, because it is implemented by R script language. Therefore, in this study, we implemented EFMDR using RCPP ($c^{{+}{+}}$ package) for faster executions. Our implementation for faster EFMDR, called EMMDR-Fast, is about 800 times faster than EFMDR written by R script only.

Keywords

References

  1. Welter D, MacArthur J, Morales J, Burdett T, Hall P, Junkins H, et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res 2014;42:D1001-D1006. https://doi.org/10.1093/nar/gkt1229
  2. Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, et al. Finding the missing heritability of complex diseases. Nature 2009;461:747-753. https://doi.org/10.1038/nature08494
  3. Ritchie MD, Hahn LW, Roodi N, Bailey LR, Dupont WD, Parl FF, et al. Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Hum Genet 2001;69:138-147. https://doi.org/10.1086/321276
  4. Velez DR, White BC, Motsinger AA, Bush WS, Ritchie MD, Williams SM, et al. A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction. Genet Epidemiol 2007;31:306-315. https://doi.org/10.1002/gepi.20211
  5. Lou XY, Chen GB, Yan L, Ma JZ, Zhu J, Elston RC, et al. A generalized combinatorial approach for detecting gene-by-gene and gene-by-environment interactions with application to nicotine dependence. Am J Hum Genet 2007;80:1125-1137. https://doi.org/10.1086/518312
  6. Gui J, Moore JH, Kelsey KT, Marsit CJ, Karagas MR, Andrew AS. A novel survival multifactor dimensionality reduction method for detecting gene-gene interactions with application to bladder cancer prognosis. Hum Genet 2011;129:101-110. https://doi.org/10.1007/s00439-010-0905-5
  7. Lee S, Kwon MS, Oh JM, Park T. Gene-gene interaction analysis for the survival phenotype based on the Cox model. Bioinformatics 2012;28:i582-i588. https://doi.org/10.1093/bioinformatics/bts415
  8. Chung Y, Lee SY, Elston RC, Park T. Odds ratio based multifactor-dimensionality reduction method for detecting genegene interactions. Bioinformatics 2007;23:71-76. https://doi.org/10.1093/bioinformatics/btl557
  9. Oh S, Lee J, Kwon MS, Weir B, Ha K, Park T. A novel method to identify high order gene-gene interactions in genome-wide association studies: gene-based MDR. BMC Bioinformatics 2012;13 Suppl 9:S5.
  10. Huh I, Park T. Multifactor dimensionality reduction analysis of multiple binary traits for gene-gene interaction. Int J Data Min Bioinform 2016;14:293-304. https://doi.org/10.1504/IJDMB.2016.075810
  11. Kim Y, Park T. Robust gene-gene interaction analysis in genome wide association studies. PLoS One 2015;10:e0135016. https://doi.org/10.1371/journal.pone.0135016
  12. Yee J, Kwon MS, Park T, Park M. A modified entropy-based approach for identifying gene-gene interactions in case-control study. PLoS One 2013;8:e69321. https://doi.org/10.1371/journal.pone.0069321
  13. Lee SY, Chung Y, Elston RC, Kim Y, Park T. Log-linear model-based multifactor dimensionality reduction method to detect gene gene interactions. Bioinformatics 2007;23:2589-2595. https://doi.org/10.1093/bioinformatics/btm396
  14. Yu W, Lee S, Park T. A unified model based multifactor dimensionality reduction framework for detecting gene-gene interactions. Bioinformatics 2016;32:i605-i610. https://doi.org/10.1093/bioinformatics/btv638
  15. Jung HY, Leem S, Lee S, Park T. A novel fuzzy set based multifactor dimensionality reduction method for detecting gene-gene interaction. Comput Biol Chem 2016;65:193-202. https://doi.org/10.1016/j.compbiolchem.2016.09.006
  16. Jung HY, Leem S, Park T. Fuzzy set-based generalized multifactor dimensionality reduction analysis of gene-gene interactions. BMC Med Genomics 2018;11(Suppl 2):32. https://doi.org/10.1186/s12920-018-0343-0
  17. Leem S, Park T. An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions. BMC Genomics 2017;18(Suppl 2):115. https://doi.org/10.1186/s12864-017-3496-x
  18. Eddelbuettel D, Francois R. Rcpp: Seamless R and C++ Integratio. J Stat Softw 2011;40:1-18.