• Title/Summary/Keyword: genome wide association

Search Result 331, Processing Time 0.026 seconds

Bayesian bi-level variable selection for genome-wide survival study

  • Eunjee Lee;Joseph G. Ibrahim;Hongtu Zhu
    • Genomics & Informatics
    • /
    • v.21 no.3
    • /
    • pp.28.1-28.13
    • /
    • 2023
  • Mild cognitive impairment (MCI) is a clinical syndrome characterized by the onset and evolution of cognitive impairments, often considered a transitional stage to Alzheimer's disease (AD). The genetic traits of MCI patients who experience a rapid progression to AD can enhance early diagnosis capabilities and facilitate drug discovery for AD. While a genome-wide association study (GWAS) is a standard tool for identifying single nucleotide polymorphisms (SNPs) related to a disease, it fails to detect SNPs with small effect sizes due to stringent control for multiple testing. Additionally, the method does not consider the group structures of SNPs, such as genes or linkage disequilibrium blocks, which can provide valuable insights into the genetic architecture. To address the limitations, we propose a Bayesian bi-level variable selection method that detects SNPs associated with time of conversion from MCI to AD. Our approach integrates group inclusion indicators into an accelerated failure time model to identify important SNP groups. Additionally, we employ data augmentation techniques to impute censored time values using a predictive posterior. We adapt Dirichlet-Laplace shrinkage priors to incorporate the group structure for SNP-level variable selection. In the simulation study, our method outperformed other competing methods regarding variable selection. The analysis of Alzheimer's Disease Neuroimaging Initiative (ADNI) data revealed several genes directly or indirectly related to AD, whereas a classical GWAS did not identify any significant SNPs.

IVAG: An Integrative Visualization Application for Various Types of Genomic Data Based on R-Shiny and the Docker Platform

  • Lee, Tae-Rim;Ahn, Jin Mo;Kim, Gyuhee;Kim, Sangsoo
    • Genomics & Informatics
    • /
    • v.15 no.4
    • /
    • pp.178-182
    • /
    • 2017
  • Next-generation sequencing (NGS) technology has become a trend in the genomics research area. There are many software programs and automated pipelines to analyze NGS data, which can ease the pain for traditional scientists who are not familiar with computer programming. However, downstream analyses, such as finding differentially expressed genes or visualizing linkage disequilibrium maps and genome-wide association study (GWAS) data, still remain a challenge. Here, we introduce a dockerized web application written in R using the Shiny platform to visualize pre-analyzed RNA sequencing and GWAS data. In addition, we have integrated a genome browser based on the JBrowse platform and an automated intermediate parsing process required for custom track construction, so that users can easily build and navigate their personal genome tracks with in-house datasets. This application will help scientists perform series of downstream analyses and obtain a more integrative understanding about various types of genomic data by interactively visualizing them with customizable options.

Construction of an Analysis System Using Digital Breeding Technology for the Selection of Capsicum annuum

  • Donghyun Jeon;Sehyun Choi;Yuna Kang;Changsoo Kim
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2022.10a
    • /
    • pp.233-233
    • /
    • 2022
  • As the world's population grows and food needs diversify, the demand for horticultural crops for beneficial traits is increasing. In order to meet this demand, it is necessary to develop suitable cultivars and breeding methods accordingly. Breeding methods have changed over time. With the recent development of sequencing technology, the concept of genomic selection (GS) has emerged as large-scale genome information can be used. GS shows good predictive ability even for quantitative traits by using various markers, breaking away from the limitations of Marker Assisted Selection (MAS). Moreover, GS using machine learning (ML) and deep learning (DL) has been studied recently. In this study, we aim to build a system that selects phenotype-related markers using the genomic information of the pepper population and trains a genomic selection model to select individuals from the validation population. We plan to establish an optimal genome wide association analysis model by comparing and analyzing five models. Validation of molecular markers by applying linkage markers discovered through genome wide association analysis to breeding populations. Finally, we plan to establish an optimal genome selection model by comparing and analyzing 12 genome selection models. Then We will use the genome selection model of the learning group in the breeding group to verify the prediction accuracy and discover a prediction model.

  • PDF

Genome-wide association study identifies 22 new loci for body dimension and body weight traits in a White Duroc×Erhualian F2 intercross population

  • Ji, Jiuxiu;Zhou, Lisheng;Guo, Yuanmei;Huang, Lusheng;Ma, Junwu
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.30 no.8
    • /
    • pp.1066-1073
    • /
    • 2017
  • Objective: Growth-related traits are important economic traits in the swine industry. However, the genetic mechanism of growth-related traits is little known. The aim of this study was to screen the candidate genes and molecular markers associated with body dimension and body weight traits in pigs. Methods: A genome-wide association study (GWAS) on body dimension and body weight traits was performed in a White $Duroc{\times}Erhualian$ $F_2$ intercross by the illumina PorcineSNP60K Beadchip. A mixed linear model was used to assess the association between single nucleotide polymorphisms (SNPs) and the phenotypes. Results: In total, 611 and 79 SNPs were identified significantly associated with body dimension traits and body weight respectively. All SNPs but 62 were located into 23 genomic regions (quantitative trait loci, QTLs) on 14 autosomal and X chromosomes in Sus scrofa Build 10.2 assembly. Out of the 23 QTLs with the suggestive significance level ($5{\times}10^{-4}$), three QTLs exceeded the genome-wide significance threshold ($1.15{\times}10^{-6}$). Except the one on Sus scrofa chromosome (SSC) 7 which was reported previously all the QTLs are novel. In addition, we identified 5 promising candidate genes, including cell division cycle 7 for abdominal circumference, pleiomorphic adenoma gene 1 and neuropeptides B/W receptor 1 for both body weight and cannon bone circumference on SSC4, phosphoenolpyruvate carboxykinase 1, and bone morphogenetic protein 7 for hip circumference on SSC17. Conclusion: The results have not only demonstrated a number of potential genes/loci associated with the growth-related traits in pigs, but also laid a foundation for studying the genes' role and further identifying causative variants underlying these loci.

Prediction of Quantitative Traits Using Common Genetic Variants: Application to Body Mass Index

  • Bae, Sunghwan;Choi, Sungkyoung;Kim, Sung Min;Park, Taesung
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.149-159
    • /
    • 2016
  • With the success of the genome-wide association studies (GWASs), many candidate loci for complex human diseases have been reported in the GWAS catalog. Recently, many disease prediction models based on penalized regression or statistical learning methods were proposed using candidate causal variants from significant single-nucleotide polymorphisms of GWASs. However, there have been only a few systematic studies comparing existing methods. In this study, we first constructed risk prediction models, such as stepwise linear regression (SLR), least absolute shrinkage and selection operator (LASSO), and Elastic-Net (EN), using a GWAS chip and GWAS catalog. We then compared the prediction accuracy by calculating the mean square error (MSE) value on data from the Korea Association Resource (KARE) with body mass index. Our results show that SLR provides a smaller MSE value than the other methods, while the numbers of selected variables in each model were similar.

Relevance Epistasis Network of Gastritis for Intra-chromosomes in the Korea Associated Resource (KARE) Cohort Study

  • Jeong, Hyun-hwan;Sohn, Kyung-Ah
    • Genomics & Informatics
    • /
    • v.12 no.4
    • /
    • pp.216-224
    • /
    • 2014
  • Gastritis is a common but a serious disease with a potential risk of developing carcinoma. Helicobacter pylori infection is reported as the most common cause of gastritis, but other genetic and genomic factors exist, especially single-nucleotide polymorphisms (SNPs). Association studies between SNPs and gastritis disease are important, but results on epistatic interactions from multiple SNPs are rarely found in previous genome-wide association (GWA) studies. In this study, we performed computational GWA case-control studies for gastritis in Korea Associated Resource (KARE) data. By transforming the resulting SNP epistasis network into a gene-gene epistasis network, we also identified potential gene-gene interaction factors that affect the susceptibility to gastritis.

Genome-wide Association Study of Chicken Plumage Pigmentation

  • Park, Mi Na;Choi, Jin Ae;Lee, Kyung-Tai;Lee, Hyun-Jeong;Choi, Bong-Hwan;Kim, Heebal;Kim, Tae-Hun;Cho, Seoae;Lee, Taeheon
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.26 no.11
    • /
    • pp.1523-1528
    • /
    • 2013
  • To increase plumage color uniformity and understand the genetic background of Korean chickens, we performed a genome-wide association study of different plumage color in Korean native chickens. We analyzed 60K SNP chips on 279 chickens with GEMMA methods for GWAS and estimated the genetic heritability for plumage color. The estimated heritability suggests that plumage coloration is a polygenic trait. We found new loci associated with feather pigmentation at the genome-wide level and from the results infer that there are additional genetic effect for plumage color. The results will be used for selecting and breeding chicken for plumage color uniformity.

Novel Genetic Variants Associated with Lumbar Spondylosis in Koreans : A Genome-Wide Association Study

  • Kim, Hyun Ah;Heo, Seong Gu;Park, Ji Wan;Jung, Young Ok
    • Journal of Korean Neurosurgical Society
    • /
    • v.61 no.1
    • /
    • pp.66-74
    • /
    • 2018
  • Objective : The aim of this study was to identify the susceptibility genes responsible for lumbar spondylosis (LS) in Korean patients. Methods : Data from 1427 subjects were made available for radiographic grading and genome wide association studies (GWAS) analysis. Lateral lumbar spine radiographs were obtained and the various degrees of degenerative change were semi-quantitatively scored. A pilot GWAS was performed using the AffymetrixGenome-Wide Human single-nucleotide polymorphisms (SNPs), 500K array. A total of 352228 SNPs were analyzed and the association between the SNPs and case-control status was analyzed by stepwise logistic regression analyses. Results : The top 100 SNPs with a cutoff p-value of less than $3.7{\times}10^{-4}$ were selected for joint space narrowing, while a cutoff p-value of $6.0{\times}10^{-4}$ was applied to osteophytes and the Kellgren-Lawrence (K-L) osteoarthritis grade. The SNPs with the strongest effect on disc space narrowing, osteophytes, and K-L grade were serine incorporator 1 (rs155467, odds ratio [OR]=17.58, $p=1.6{\times}10^{-4}$), stromal interaction molecule 2 (STIM1, rs210781, OR=5.53, $p=5{\times}10^{-4}$), and transient receptor potential cation channel, subfamily C (rs11224760, OR=3.99, $p=4.8{\times}10^{-4}$), respectively. Leucine-rich repeat-containing G protein-coupled receptor 4 was significantly associated with both disc space narrowing and osteophytes (rs1979400, OR=2.01, $p=1.1{\times}10^{-4}$ for disc space narrowing, OR=1.79, $p=3{\times}10^{-4}$ for osteophytes), while zinc finger and BTB domain containing 7C was significantly and negatively associated with both osteophytes and a K-L grade >2 (rs12457004,OR=0.25, $p=5.8{\times}10^{-4}$ and OR=0.27, $p=5.3{\times}10^{-4}$, respectively). Conclusion : We identified SNPs that potentially contribute to the pathogenesis of LS. This is the first report of a GWAS in an Asian population.

Genome Wide Association Studies Using Multiple-lactation Breeding Value in Holsteins

  • Cho, Kwang-Hyun;Oh, Jae-Don;Kim, Hee-Bal;Park, Kyung-Do;Lee, Joon-Ho
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.28 no.3
    • /
    • pp.328-333
    • /
    • 2015
  • A genome wide association study was conducted using estimated breeding value (EBV) for milk production traits from 1st to 4th lactation. Significant single nucleotide polymorphism (SNP) markers were selected for each trait and the differences were compared by lactation. DNA samples were taken from 456 animals with EBV which are Holstein proven bulls whose semen is being sold or the daughters of old proven bulls whose semen is no longer being sold in Korea. High density genome wide SNP genotype was investigated and the significance of markers associated with traits was tested using the breeding value estimated by a multiple lactation model as a dependent variant. As the result of significance comparisons by lactations, several differences were found between the first lactation and subsequent lactations (from second to 4th lactation). A similar trend was noted in mean deviation and correlation of the estimated effects by lactation. Since there was a difference in the genes associated with EBV for each trait between first and subsequent lactations, a multi-lactation model in which lactation is considered as a different trait is genetically useful. Also, significant markers in all lactations and common markers for different traits were detected, which can be used as markers for quantitative trait loci exploration and marker assisted selection in milk production traits.

Genome-wide association study for intramuscular fat content in Chinese Lulai black pigs

  • Wang, Yanping;Ning, Chao;Wang, Cheng;Guo, Jianfeng;Wang, Jiying;Wu, Ying
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.32 no.5
    • /
    • pp.607-613
    • /
    • 2019
  • Objective: Intramuscular fat (IMF) content plays an important role in meat quality. Identification of single nucleotide polymorphisms (SNPs) and genes related to pig IMF, especially using pig populations with high IMF content variation, can help to establish novel molecular breeding tools for optimizing IMF in pork and unveil the mechanisms that underlie fat metabolism. Methods: We collected muscle samples of 453 Chinese Lulai black pigs, measured IMF content by Soxhlet petroleum-ether extraction method, and genotyped genome-wide SNPs using GeneSeek Genomic Profiler Porcine HD BeadChip. Then a genome-wide association study was performed using a linear mixed model implemented in the GEMMA software. Results: A total of 43 SNPs were identified to be significantly associated with IMF content by the cutoff p<0.001. Among these significant SNPs, the greatest number of SNPs (n = 19) were detected on Chr.9, and two linkage disequilibrium blocks were formed among them. Additionally, 17 significant SNPs are mapped to previously reported quantitative trait loci (QTLs) of IMF and confirmed previous QTLs studies. Forty-two annotated genes centering these significant SNPs were obtained from Ensembl database. Overrepresentation test of pathways and gene ontology (GO) terms revealed some enriched reactome pathways and GO terms, which mainly involved regulation of basic material transport, energy metabolic process and signaling pathway. Conclusion: These findings improve our understanding of the genetic architecture of IMF content in pork and facilitate the follow-up study of fine-mapping genes that influence fat deposition in muscle.