• Title/Summary/Keyword: Genome Wide Association

Search Result 336, Processing Time 0.028 seconds

Efficient strategy for the genetic analysis of related samples with a linear mixed model (선형혼합모형을 이용한 유전체 자료분석방안에 대한 연구)

  • Lim, Jeongmin;Sung, Joohon;Won, Sungho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.5
    • /
    • pp.1025-1038
    • /
    • 2014
  • Linear mixed model has often been utilized for genetic association analysis with family-based samples. The correlation matrix for family-based samples is constructed with kinship coefficient and assumes that parental phenotypes are independent and the amount of correlations between parent and offspring is same as that of correlations between siblings. However, for instance, there are positive correlations between parental heights, which indicates that the assumption for correlation matrix is often violated. The statistical validity and power are affected by the appropriateness of assumed variance covariance matrix, and in this thesis, we provide the linear mixed model with flexible variance covariance matrix. Our results show that the proposed method is usually more efficient than existing approaches, and its application to genome-wide association study of body mass index illustrates the practical value in real data analysis.

The identification of novel regions for reproduction trait in Landrace and Large White pigs using a single step genome-wide association study

  • Suwannasing, Rattikan;Duangjinda, Monchai;Boonkum, Wuttigrai;Taharnklaew, Rutjawate;Tuangsithtanon, Komson
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.12
    • /
    • pp.1852-1862
    • /
    • 2018
  • Objective: The purpose of this study was to investigate a single step genome-wide association study (ssGWAS) for identifying genomic regions affecting reproductive traits in Landrace and Large White pigs. Methods: The traits included the number of pigs weaned per sow per year (PWSY), the number of litters per sow per year (LSY), pigs weaned per litters (PWL), born alive per litters (BAL), non-productive day (NPD) and wean to conception interval per litters (W2CL). A total of 321 animals (140 Landrace and 181 Large White pigs) were genotyped with the Illumina Porcine SNP 60k BeadChip, containing 61,177 single nucleotide polymorphisms (SNPs), while multiple traits single-step genomic BLUP method was used to calculate variances of 5 SNP windows for 11,048 Landrace and 13,985 Large White data records. Results: The outcome of ssGWAS on the reproductive traits identified twenty-five and twenty-two SNPs associated with reproductive traits in Landrace and Large White, respectively. Three known genes were identified to be candidate genes in Landrace pigs including retinol binding protein 7, and ubiquitination factor E4B genes for PWL, BAL, W2CL, and PWSY and one gene, solute carrier organic anion transporter family member 6A1, for LSY and NPD. Meanwhile, five genes were identified to be candidate genes in Large White, two of which, aldehyde dehydrogenase 1 family member A3 and leucine rich repeat kinase 1, associated with all of six reproduction traits and three genes; retrotransposon Gag like 4, transient receptor potential cation channel subfamily C member 5, and LHFPL tetraspan subfamily member 1 for five traits except W2CL. Conclusion: The genomic regions identified in this study provided a start-up point for marker assisted selection and estimating genomic breeding values for improving reproductive traits in commercial pig populations.

Mechanistic insight into the progressive retinal atrophy disease in dogs via pathway-based genome-wide association analysis

  • Sheet, Sunirmal;Krishnamoorthy, Srikanth;Park, Woncheoul;Lim, Dajeong;Park, Jong-Eun;Ko, Minjeong;Choi, Bong-Hwan
    • Journal of Animal Science and Technology
    • /
    • v.62 no.6
    • /
    • pp.765-776
    • /
    • 2020
  • The retinal degenerative disease, progressive retinal atrophy (PRA) is a major reason of vision impairment in canine population. Canine PRA signifies an inherently dissimilar category of retinal dystrophies which has solid resemblances to human retinis pigmentosa. Even though much is known about the biology of PRA, the knowledge about the intricate connection among genetic loci, genes and pathways associated to this disease in dogs are still remain unknown. Therefore, we have performed a genome wide association study (GWAS) to identify susceptibility single nucleotide polymorphisms (SNPs) of PRA. The GWAS was performed using a case-control based association analysis method on PRA dataset of 129 dogs and 135,553 markers. Further, the gene-set and pathway analysis were conducted in this study. A total of 1,114 markers associations with PRA trait at p < 0.01 were extracted and mapped to 640 unique genes, and then selected significant (p < 0.05) enriched 35 gene ontology (GO) terms and 5 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways contain these genes. In particular, apoptosis process, homophilic cell adhesion, calcium ion binding, and endoplasmic reticulum GO terms as well as pathways related to focal adhesion, cyclic guanosine monophosphate)-protein kinase G signaling, and axon guidance were more likely associated to the PRA disease in dogs. These data could provide new insight for further research on identification of potential genes and causative pathways for PRA in dogs.

Genome wide association study for growth in Pakistani dromedary camels using genotyping-by-sequencing

  • Sajida Sabahat;Asif Nadeem;Rudiger Brauning;Peter C. Thomson;Mehar S. Khatkar
    • Animal Bioscience
    • /
    • v.36 no.7
    • /
    • pp.1010-1021
    • /
    • 2023
  • Objective: Growth performance and growth-related traits have a crucial role in livestock due to their influence on productivity. This genome-wide association study (GWAS) in Pakistani dromedary camels was conducted to identify single nucleotide polymorphisms (SNPs) associated with growth at specific camel ages, and for selected SNPs, to investigate in detail how their effects change with increasing camel age. This is the first GWAS conducted on dromedary camels in this region. Methods: Two Pakistani breeds, Marecha and Lassi, were selected for this study. A genotyping-by-sequencing method was used, and a total of 65,644 SNPs were identified. For GWAS, weight records data with several body weight traits, namely, birthweight, weaning weight, and weights of camels at 1, 2, 4, and 6 years of age were analysed by using model-based growth curve analysis. Age-specific weight data were analysed with a linear mixed model that included fixed effects of SNP genotype as well as sex. Results: Based on the q-value method for false discovery control, for Marecha camels, five SNPs at q<0.01 and 96 at q<0.05 were significantly associated with the weight traits considered, while three (q<0.01) and seven (q<0.05) SNP associations were identified for Lassi camels. Several candidate genes harbouring these SNP were discovered. Conclusion: These results will help to better understand the genetic architecture of growth including how these genes are expressed at different phases of their life. This will serve to lay the foundations for applied breeding programs of camels by allowing the genetic selection of superior animals.

Comparison of Two Meta-Analysis Methods: Inverse-Variance-Weighted Average and Weighted Sum of Z-Scores

  • Lee, Cue Hyunkyu;Cook, Seungho;Lee, Ji Sung;Han, Buhm
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.173-180
    • /
    • 2016
  • The meta-analysis has become a widely used tool for many applications in bioinformatics, including genome-wide association studies. A commonly used approach for meta-analysis is the fixed effects model approach, for which there are two popular methods: the inverse variance-weighted average method and weighted sum of z-scores method. Although previous studies have shown that the two methods perform similarly, their characteristics and their relationship have not been thoroughly investigated. In this paper, we investigate the optimal characteristics of the two methods and show the connection between the two methods. We demonstrate that the each method is optimized for a unique goal, which gives us insight into the optimal weights for the weighted sum of z-scores method. We examine the connection between the two methods both analytically and empirically and show that their resulting statistics become equivalent under certain assumptions. Finally, we apply both methods to the Wellcome Trust Case Control Consortium data and demonstrate that the two methods can give distinct results in certain study designs.

A Scheme for Filtering SNPs Imputed in 8,842 Korean Individuals Based on the International HapMap Project Data

  • Lee, Ki-Chan;Kim, Sang-Soo
    • Genomics & Informatics
    • /
    • v.7 no.2
    • /
    • pp.136-140
    • /
    • 2009
  • Genome-wide association (GWA) studies may benefit from the inclusion of imputed SNPs into their dataset. Due to its predictive nature, the imputation process is typically not perfect. Thus, it would be desirable to develop a scheme for filtering out the imputed SNPs by maximizing the concordance with the observed genotypes. We report such a scheme, which is based on the combination of several parameters that are calculated by PLINK, a popular GWA analysis software program. We imputed the genotypes of 8,842 Korean individuals, based on approximately 2 million SNP genotypes of the CHB+JPT panel in the International HapMap Project Phase II data, complementing the 352k SNPs in the original Affymetrix 5.0 dataset. A total of 333,418 SNPs were found in both datasets, with a median concordance rate of 98.7%. The concordance rates were calculated at different ranges of parameters, such as the number of proxy SNPs (NPRX), the fraction of successfully imputed individuals (IMPUTED), and the information content (INFO). The poor concordance that was observed at the lower values of the parameters allowed us to develop an optimal combination of the cutoffs (IMPUTED${\geq}$0.9 and INFO${\geq}$0.9). A total of 1,026,596 SNPs passed the cutoff, of which 94,364 were found in both datasets and had 99.4% median concordance. This study illustrates a conservative scheme for filtering imputed SNPs that would be useful in GWA studies.

Analysis of differences in human leukocyte antigen between the two Wellcome Trust Case Control Consortium control datasets

  • Jang, Chloe Soohyun;Choi, Wanson;Cook, Seungho;Han, Buhm
    • Genomics & Informatics
    • /
    • v.17 no.3
    • /
    • pp.29.1-29.8
    • /
    • 2019
  • The Wellcome Trust Case Control Consortium (WTCCC) study was a large genome-wide association study that aimed to identify common variants associated with seven diseases. That study combined two control datasets (58C and UK Blood Services) as shared controls. Prior to using the combined controls, the WTCCC performed analyses to show that the genomic content of the control datasets was not significantly different. Recently, the analysis of human leukocyte antigen (HLA) genes has become prevalent due to the development of HLA imputation technology. In this project, we extended the between-control homogeneity analysis of the WTCCC to HLA. We imputed HLA information in the WTCCC control dataset and showed that the HLA content was not significantly different between the two control datasets, suggesting that the combined controls can be used as controls for HLA fine-mapping analysis based on HLA imputation.

In silico approach to calculate the transcript capacity

  • Lee, Young-Sup;Won, Kyung-Hye;Oh, Jae-Don;Shin, Donghyun
    • Genomics & Informatics
    • /
    • v.17 no.3
    • /
    • pp.31.1-31.7
    • /
    • 2019
  • We sought the novel concept, transcript capacity (TC) and analyzed TC. Our approach to estimate TC was through an in silico method. TC refers to the capacity that a transcript exerts in a cell as enzyme or protein function after translation. We used the genome-wide association study (GWAS) beta effect and transcription level in RNA-sequencing to estimate TC. The trait was body fat percent and the transcript reads were obtained from the human protein atlas. The assumption was that the GWAS beta effect is the gene's effect and TC was related to the corresponding gene effect and transcript reads. Further, we surveyed gene ontology (GO) in the highest TC and the lowest TC genes. The most frequent GOs with the highest TC were neuronal-related and cell projection organization related. The most frequent GOs with the lowest TC were wound-healing related and embryo development related. We expect that our analysis contributes to estimating TC in the diverse species and playing a benevolent role to the new bioinformatic analysis.