DOI QR코드

DOI QR Code

Prediction of Genes Related to Positive Selection Using Whole-Genome Resequencing in Three Commercial Pig Breeds

  • Kim, HyoYoung (Department of Agricultural Biotechnology, Seoul National University) ;
  • Caetano-Anolles, Kelsey (Department of Animal Sciences, University of Illinois) ;
  • Seo, Minseok (Interdisciplinary Program in Bioinformatics, Seoul National University) ;
  • Kwon, Young-jun (Interdisciplinary Program in Bioinformatics, Seoul National University) ;
  • Cho, Seoae (C&K Genomics Inc., Seoul National University Research Park) ;
  • Seo, Kangseok (Department of Animal Science and Technology, College of Life Science and Natural Resources, Sunchon National University) ;
  • Kim, Heebal (Department of Agricultural Biotechnology, Seoul National University)
  • Received : 2015.07.29
  • Accepted : 2015.11.21
  • Published : 2015.12.31

Abstract

Selective sweep can cause genetic differentiation across populations, which allows for the identification of possible causative regions/genes underlying important traits. The pig has experienced a long history of allele frequency changes through artificial selection in the domestication process. We obtained an average of 329,482,871 sequence reads for 24 pigs from three pig breeds: Yorkshire (n = 5), Landrace (n = 13), and Duroc (n = 6). An average read depth of 11.7 was obtained using whole-genome resequencing on an Illumina HiSeq2000 platform. In this study, cross-population extended haplotype homozygosity and cross-population composite likelihood ratio tests were implemented to detect genes experiencing positive selection for the genome-wide resequencing data generated from three commercial pig breeds. In our results, 26, 7, and 14 genes from Yorkshire, Landrace, and Duroc, respectively were detected by two kinds of statistical tests. Significant evidence for positive selection was identified on genes ST6GALNAC2 and EPHX1 in Yorkshire, PARK2 in Landrace, and BMP6, SLA-DQA1, and PRKG1 in Duroc. These genes are reportedly relevant to lactation, reproduction, meat quality, and growth traits. To understand how these single nucleotide polymorphisms (SNPs) related positive selection affect protein function, we analyzed the effect of non-synonymous SNPs. Three SNPs (rs324509622, rs80931851, and rs80937718) in the SLA-DQA1 gene were significant in the enrichment tests, indicating strong evidence for positive selection in Duroc. Our analyses identified genes under positive selection for lactation, reproduction, and meat-quality and growth traits in Yorkshire, Landrace, and Duroc, respectively.

Keywords

References

  1. Hafez ES. Adaptation of Domestic Animals. Philadelphia: Lea & Febiger, 1968.
  2. Larson G, Albarella U, Dobney K, Rowley-Conwy P, Schibler J, Tresset A, et al. Ancient DNA, pig domestication, and the spread of the Neolithic into Europe. Proc Natl Acad Sci USA 2007;104:15276-15281. https://doi.org/10.1073/pnas.0703411104
  3. Williams NH, Stahly TS, Zimmerman DR. Effect of chronic immune system activation on the rate, efficiency, and composition of growth and lysine needs of pigs fed from 6 to 27 kg. J Anim Sci 1997;75:2463-2471. https://doi.org/10.2527/1997.7592463x
  4. Wilkinson S, Lu ZH, Megens HJ, Archibald AL, Haley C, Jackson IJ, et al. Signatures of diversifying selection in European pig breeds. PLoS Genet 2013;9:e1003453. https://doi.org/10.1371/journal.pgen.1003453
  5. Church PC, Goscinski A, Lefevre C. EXP-PAC: providing comparative analysis and storage of next generation gene expression data. Genomics 2012;100:8-13. https://doi.org/10.1016/j.ygeno.2012.05.007
  6. Nakamura M, Tomita A, Nakatani H, Matsuda T, Nadano D. Antioxidant and antibacterial genes are upregulated in early involution of the mouse mammary gland: sharp increase of ceruloplasmin and lactoferrin in accumulating breast milk. DNA Cell Biol 2006;25:491-500. https://doi.org/10.1089/dna.2006.25.491
  7. Krege JH, Hodgin JB, Couse JF, Enmark E, Warner M, Mahler JF, et al. Generation and reproductive phenotypes of mice lacking estrogen receptor beta. Proc Natl Acad Sci USA 1998; 95:15677-15682. https://doi.org/10.1073/pnas.95.26.15677
  8. Fonseca S, Wilsons IJ, Horgan GW, Maltin CA. Slow fiber cluster pattern in pig longissimus thoracis muscle: implications for myogenesis. J Anim Sci 2003;81:973-983. https://doi.org/10.2527/2003.814973x
  9. Chen H, Patterson N, Reich D. Population differentiation as a test for selective sweeps. Genome Res 2010;20:393-402. https://doi.org/10.1101/gr.100545.109
  10. Sabeti PC, Reich DE, Higgins JM, Levine HZ, Richter DJ, Schaffner SF, et al. Detecting recent positive selection in the human genome from haplotype structure. Nature 2002; 419:832-837. https://doi.org/10.1038/nature01140
  11. Szpiech ZA, Hernandez RD. selscan: an efficient multithreaded program to perform EHH-based scans for positive selection. Mol Biol Evol 2014;31:2824-2827. https://doi.org/10.1093/molbev/msu211
  12. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009;25:2078-2079. https://doi.org/10.1093/bioinformatics/btp352
  13. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 2010;20:1297-1303. https://doi.org/10.1101/gr.107524.110
  14. Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics 2011;27:2156-2158. https://doi.org/10.1093/bioinformatics/btr330
  15. Browning BL, Browning SR. A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. Am J Hum Genet 2009; 84:210-223. https://doi.org/10.1016/j.ajhg.2009.01.005
  16. Park W, Kim J, Kim HJ, Choi J, Park JW, Cho HW, et al. Investigation of de novo unique differentially expressed genes related to evolution in exercise response during domestication in Thoroughbred race horses. PLoS One 2014;9: e91418. https://doi.org/10.1371/journal.pone.0091418
  17. Harris MA, Clark J, Ireland A, Lomax J, Ashburner M, Foulger R, et al. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res 2004;32:D258-D261. https://doi.org/10.1093/nar/gkh036
  18. Kanehisa M, Goto S, Kawashima S, Nakaya A. The KEGG databases at GenomeNet. Nucleic Acids Res 2002;30:42-46. https://doi.org/10.1093/nar/30.1.42
  19. Dennis G Jr, Sherman BT, Hosack DA, Yang J, Gao W, Lane HC, et al. DAVID: Database for annotation, visualization, and integrated discovery. Genome Biol 2003;4:P3. https://doi.org/10.1186/gb-2003-4-5-p3
  20. Reumers J, Schymkowitz J, Ferkinghoff-Borg J, Stricher F, Serrano L, Rousseau F. SNPeffect: a database mapping molecular phenotypic effects of human non-synonymous coding SNPs. Nucleic Acids Res 2005;33:D527-D532.
  21. Pareek CS. Emerging trends on functional genomics in farm animals. In: Advances in Applied Biotechnology (Parihar P, ed.). Jodhpur: Agrobios, 2010. pp. 23-34.
  22. Lee T, Shin DH, Cho S, Kang HS, Kim SH, Lee HK, et al. Genome-wide association study of integrated meat quality-related traits of the duroc pig breed. Asian-Australas J Anim Sci 2014;27:303-309. https://doi.org/10.5713/ajas.2013.13385
  23. Ma Y, Zhang H, Zhang Q, Ding X. Identification of selection footprints on the X chromosome in pig. PLoS One 2014;9:e94911. https://doi.org/10.1371/journal.pone.0094911