DOI QR코드

DOI QR Code

IVAG: An Integrative Visualization Application for Various Types of Genomic Data Based on R-Shiny and the Docker Platform

  • Lee, Tae-Rim (Department of Bioinformatics and Life Science, Soongsil University) ;
  • Ahn, Jin Mo (Department of Bioinformatics and Life Science, Soongsil University) ;
  • Kim, Gyuhee (Department of Bioinformatics and Life Science, Soongsil University) ;
  • Kim, Sangsoo (Department of Bioinformatics and Life Science, Soongsil University)
  • Received : 2017.10.30
  • Accepted : 2017.11.17
  • Published : 2017.12.31

Abstract

Next-generation sequencing (NGS) technology has become a trend in the genomics research area. There are many software programs and automated pipelines to analyze NGS data, which can ease the pain for traditional scientists who are not familiar with computer programming. However, downstream analyses, such as finding differentially expressed genes or visualizing linkage disequilibrium maps and genome-wide association study (GWAS) data, still remain a challenge. Here, we introduce a dockerized web application written in R using the Shiny platform to visualize pre-analyzed RNA sequencing and GWAS data. In addition, we have integrated a genome browser based on the JBrowse platform and an automated intermediate parsing process required for custom track construction, so that users can easily build and navigate their personal genome tracks with in-house datasets. This application will help scientists perform series of downstream analyses and obtain a more integrative understanding about various types of genomic data by interactively visualizing them with customizable options.

Keywords

References

  1. van Dijk EL, Auger H, Jaszczyszyn Y, Thermes C. Ten years of next-generation sequencing technology. Trends Genet 2014;30:418-426. https://doi.org/10.1016/j.tig.2014.07.001
  2. D'Agostino N, Tripodi P. NGS-based genotyping, high-throughput phenotyping and genome-wide association studies laid the foundations for next-generation breeding in horticultural crops. Diversity 2017;9:38. https://doi.org/10.3390/d9030038
  3. Seemann T. Ten recommendations for creating usable bioinformatics command line software. Gigascience 2013;2:15. https://doi.org/10.1186/2047-217X-2-15
  4. Goecks J, Nekrutenko A, Taylor J; Galaxy Team. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol 2010;11:R86. https://doi.org/10.1186/gb-2010-11-8-r86
  5. Korean Bioinformation Center. Bioexpress. Daejeon: Korean Bioinformation Center, 2017. Accessed 2017 Nov 1. Available from: https:// bioexpress.kobic.re.kr.
  6. R Core Team. R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing, 2017.
  7. Merkel D. Docker: lightweight Linux containers for consistent development and deployment. Linux J 2014;2014:2.
  8. Chang W, Cheng J, Allaire JJ, Xie Y, McPherson J. shiny: Web Application Framework for R. R package version 1.0.5. The Comprehensive R Archive Network, 2017. Accessed 2017 Nov 1. Available from: https://CRAN.R-project.org/package=shiny.
  9. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics 2011;27:2156-2158. https://doi.org/10.1093/bioinformatics/btr330
  10. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007;81:559-575. https://doi.org/10.1086/519795
  11. Buels R, Yao E, Diesh CM, Hayes RD, Munoz-Torres M, Helt G, et al. JBrowse: a dynamic web platform for genome visualization and analysis. Genome Biol 2016;17:66. https://doi.org/10.1186/s13059-016-0924-1
  12. Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol 2010;28:511-515. https://doi.org/10.1038/nbt.1621
  13. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009;25:2078-2079. https://doi.org/10.1093/bioinformatics/btp352
  14. Anders S, Pyl PT, Huber W. HTSeq: a Python framework to work with high-throughput sequencing data. Bioinformatics 2015;31:166-169. https://doi.org/10.1093/bioinformatics/btu638
  15. Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 2010;26:139-140. https://doi.org/10.1093/bioinformatics/btp616
  16. Young MD, Wakefield MJ, Smyth GK, Oshlack A. Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biol 2010;11:R14. https://doi.org/10.1186/gb-2010-11-2-r14
  17. Li Y, Andrade J. DEApp: an interactive web interface for differential expression analysis of next generation sequence data. Source Code Biol Med 2017;12:2. https://doi.org/10.1186/s13029-017-0063-4
  18. Nelson JW, Sklenar J, Barnes AP, Minnier J. The START App: a web-based RNAseq analysis and visualization resource. Bioinformatics 2017;33:447-449.
  19. Cuellar-Partida G, Renteria ME, MacGregor S. LocusTrack:Integrated visualization of GWAS results and genomic annotation. Source Code Biol Med 2015;10:1. https://doi.org/10.1186/s13029-015-0032-8
  20. Ziegler GR, Hartsock RH, Baxter I. Zbrowse: an interactive GWAS results browser. PeerJ Comput Sci 2015;1:e3. https://doi.org/10.7717/peerj-cs.3