DOI QR코드

DOI QR Code

Effects of Single Nucleotide Polymorphism Marker Density on Haplotype Block Partition

  • Kim, Sun Ah (Department of Mathematics Education, Seoul National University) ;
  • Yoo, Yun Joo (Department of Mathematics Education, Seoul National University)
  • Received : 2016.09.08
  • Accepted : 2016.12.06
  • Published : 2016.12.31

Abstract

Many researchers have found that one of the most important characteristics of the structure of linkage disequilibrium is that the human genome can be divided into non-overlapping block partitions in which only a small number of haplotypes are observed. The location and distribution of haplotype blocks can be seen as a population property influenced by population genetic events such as selection, mutation, recombination and population structure. In this study, we investigate the effects of the density of markers relative to the full set of all polymorphisms in the region on the results of haplotype partitioning for five popular haplotype block partition methods: three methods in Haploview (confidence interval, four gamete test, and solid spine), MIG++ implemented in PLINK 1.9 and S-MIG++. We used several experimental datasets obtained by sampling subsets of single nucleotide polymorphism (SNP) markers of chromosome 22 region in the 1000 Genomes Project data and also the HapMap phase 3 data to compare the results of haplotype block partitions by five methods. With decreasing sampling ratio down to 20% of the original SNP markers, the total number of haplotype blocks decreases and the length of haplotype blocks increases for all algorithms. When we examined the marker-independence of the haplotype block locations constructed from the datasets of different density, the results using below 50% of the entire SNP markers were very different from the results using the entire SNP markers. We conclude that the haplotype block construction results should be used and interpreted carefully depending on the selection of markers and the purpose of the study.

Keywords

References

  1. Slatkin M. Linkage disequilibrium: understanding the evolutionary past and mapping the medical future. Nat Rev Genet 2008;9:477-485.
  2. Reich DE, Cargill M, Bolk S, Ireland J, Sabeti PC, Richter DJ, et al. Linkage disequilibrium in the human genome. Nature 2001;411:199-204. https://doi.org/10.1038/35075590
  3. Daly MJ, Rioux JD, Schaffner SF, Hudson TJ, Lander ES. High-resolution haplotype structure in the human genome. Nat Genet 2001;29:229-232. https://doi.org/10.1038/ng1001-229
  4. Sabeti PC, Varilly P, Fry B, Lohmueller J, Hostetter E, Cotsapas C, et al. Genome-wide detection and characterization of positive selection in human populations. Nature 2007;449:913-918. https://doi.org/10.1038/nature06250
  5. Greenspan G, Geiger D. Model-based inference of haplotype block variation. J Comput Biol 2004;11:493-504. https://doi.org/10.1089/1066527041410300
  6. Jeffreys AJ, Kauppi L, Neumann R. Intensely punctate meiotic recombination in the class II region of the major histocompatibility complex. Nat Genet 2001;29:217-222. https://doi.org/10.1038/ng1001-217
  7. Twells RC, Mein CA, Phillips MS, Hess JF, Veijola R, Gilbey M, et al. Haplotype structure, LD blocks, and uneven recombination within the LRP5 gene. Genome Res 2003;13:845-855. https://doi.org/10.1101/gr.563703
  8. Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, et al. The structure of haplotype blocks in the human genome. Science 2002;296:2225-2229. https://doi.org/10.1126/science.1069424
  9. Wang N, Akey JM, Zhang K, Chakraborty R, Jin L. Distribution of recombination crossovers and the origin of haplotype blocks: the interplay of population history, recombination, and mutation. Am J Hum Genet 2002;71:1227-1234. https://doi.org/10.1086/344398
  10. Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 2005; 21:263-265. https://doi.org/10.1093/bioinformatics/bth457
  11. Ardlie KG, Kruglyak L, Seielstad M. Patterns of linkage disequilibrium in the human genome. Nat Rev Genet 2002;3: 299-309. https://doi.org/10.1038/nrg777
  12. Taliun D, Gamper J, Pattaro C. Efficient haplotype block recognition of very long and dense genetic sequences. BMC Bioinformatics 2014;15:10. https://doi.org/10.1186/1471-2105-15-10
  13. Taliun D, Gamper J, Leser U, Pattaro C. Fast sampling-based whole-genome haplotype block recognition. IEEE/ACM Trans Comput Biol Bioinform 2016;13:315-325. https://doi.org/10.1109/TCBB.2015.2456897
  14. Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 2015;4:7. https://doi.org/10.1186/s13742-015-0047-8
  15. 1000 Genomes Project Consortium, Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, et al. An integrated map of genetic variation from 1,092 human genomes. Nature 2012; 491:56-65. https://doi.org/10.1038/nature11632
  16. International HapMap 3 Consortium, Altshuler DM, Gibbs RA, Peltonen L, Altshuler DM, Gibbs RA, et al. Integrating common and rare genetic variation in diverse human populations. Nature 2010;467:52-58. https://doi.org/10.1038/nature09298
  17. Lewontin RC. The interaction of selection and linkage. I. General considerations: heterotic models. Genetics 1964;49: 49-67.
  18. Zapata C, Alvarez G, Carollo C. Approximate variance of the standardized measure of gametic disequilibrium D'. Am J Hum Genet 1997;61:771-774. https://doi.org/10.1016/S0002-9297(07)64342-0
  19. Wall JD, Pritchard JK. Assessing the performance of the haplotype block model of linkage disequilibrium. Am J Hum Genet 2003;73:502-515. https://doi.org/10.1086/378099
  20. Hill WG. Estimation of linkage disequilibrium in randomly mating populations. Heredity (Edinb) 1974;33:229-239. https://doi.org/10.1038/hdy.1974.89
  21. Gaunt TR, Rodriguez S, Day IN. Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'. BMC Bioinformatics 2007;8:428. https://doi.org/10.1186/1471-2105-8-428