• Title/Summary/Keyword: Genome Wide Association

Search Result 336, Processing Time 0.022 seconds

Design of a Fast Algorithm for Computing Contingency Tables that are Used to Construct Epistasis Networks of SNPs (단일염기다형성 상위성 네트워크를 구성하기 위한 분할표를 생성하는 빠른 알고리즘의 설계)

  • Wang, Sehee;Wee, Kyubum
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2016.07a
    • /
    • pp.21-24
    • /
    • 2016
  • 전장유전체 연관성 연구에서 상위성 탐색은 많은 단일염기다형성 수로 인해 계산이 어렵기 때문에 네트워크에서의 탐색을 이용한 방법이 사용되고 있다. 그러나 전장유전체 연관성 연구에서 단일염기다형성들의 상위성 네트워크의 구성 역시 큰 계산 비용을 필요로 한다. 본 논문에서는 단일염기다형성과 표현형의 상호정보량을 이용한 네트워크를 구성하는데 드는 시간을 줄이는 알고리즘을 제안한다. 또한 표본 크기별로 계산 시간을 실험해 보았으며, 기존의 방법과 비교해 실행 속도가 향상됨을 보였다.

  • PDF

The Genetics and Pathogenesis of Inflammatory Bowel Disease (염증성 장질환의 유전학과 병인론)

  • Ko, Jae Sung
    • Pediatric Gastroenterology, Hepatology & Nutrition
    • /
    • v.11 no.sup2
    • /
    • pp.59-66
    • /
    • 2008
  • Genome-wide association studies using large case-control samples and several hundred thousand genetic markers efficiently and powerfully assay common genetic variations. The application of these studies to inflammatory bowel disease has led to the identification of susceptibility genes and affirmed the importance of innate and adaptive immunity in the pathogenesis of disease. Efforts directed towards the identification of environmental factors have implicated commensal bacteria as determinants of dysregulated immunity and inflammatory bowel disease. Host genetic polymorphisms most likely interact with functional bacterial changes to stimulate aggressive immune responses that lead to chronic tissue injury.

  • PDF

Multi-omics integration strategies for animal epigenetic studies - A review

  • Kim, Do-Young;Kim, Jun-Mo
    • Animal Bioscience
    • /
    • v.34 no.8
    • /
    • pp.1271-1282
    • /
    • 2021
  • Genome-wide studies provide considerable insights into the genetic background of animals; however, the inheritance of several heritable factors cannot be elucidated. Epigenetics explains these heritabilities, including those of genes influenced by environmental factors. Knowledge of the mechanisms underlying epigenetics enables understanding the processes of gene regulation through interactions with the environment. Recently developed next-generation sequencing (NGS) technologies help understand the interactional changes in epigenetic mechanisms. There are large sets of NGS data available; however, the integrative data analysis approaches still have limitations with regard to reliably interpreting the epigenetic changes. This review focuses on the epigenetic mechanisms and profiling methods and multi-omics integration methods that can provide comprehensive biological insights in animal genetic studies.

StrokeBase: A Database of Cerebrovascular Disease-related Candidate Genes

  • Kim, Young-Uk;Kim, Il-Hyun;Bang, Ok-Sun;Kim, Young-Joo
    • Genomics & Informatics
    • /
    • v.6 no.3
    • /
    • pp.153-156
    • /
    • 2008
  • Complex diseases such as stroke and cancer have two or more genetic loci and are affected by environmental factors that contribute to the diseases. Due to the complex characteristics of these diseases, identifying candidate genes requires a system-level analysis of the following: gene ontology, pathway, and interactions. A database and user interface, termed StrokeBase, was developed; StrokeBase provides queries that search for pathways, candidate genes, candidate SNPs, and gene networks. The database was developed by using in silico data mining of HGNC, ENSEMBL, STRING, RefSeq, UCSC, GO, HPRD, KEGG, GAD, and OMIM. Forty candidate genes that are associated with cerebrovascular disease were selected by human experts and public databases. The networked cerebrovascular disease gene maps also were developed; these maps describe genegene interactions and biological pathways. We identified 1127 genes, related indirectly to cerebrovascular disease but directly to the etiology of cerebrovascular disease. We found that a protein-protein interaction (PPI) network that was associated with cerebrovascular disease follows the power-law degree distribution that is evident in other biological networks. Not only was in silico data mining utilized, but also 250K Affymetrix SNP chips were utilized in the 320 control/disease association study to generate associated markers that were pertinent to the cerebrovascular disease as a genome-wide search. The associated genes and the genes that were retrieved from the in silico data mining system were compared and analyzed. We developed a well-curated cerebrovascular disease-associated gene network and provided bioinformatic resources to cerebrovascular disease researchers. This cerebrovascular disease network can be used as a frame of systematic genomic research, applicable to other complex diseases. Therefore, the ongoing database efficiently supports medical and genetic research in order to overcome cerebrovascular disease.

A genomic and bioinformatic-based approach to identify genetic variants for liver cancer across multiple continents

  • Muhammad Ma'ruf;Lalu Muhammad Irham;Wirawan Adikusuma;Made Ary Sarasmita;Sabiah Khairi;Barkah Djaka Purwanto;Rockie Chong;Maulida Mazaya;Lalu Muhammad Harmain Siswanto
    • Genomics & Informatics
    • /
    • v.21 no.4
    • /
    • pp.48.1-48.8
    • /
    • 2023
  • Liver cancer is the fourth leading cause of death worldwide. Well-known risk factors include hepatitis B virus and hepatitis C virus, along with exposure to aflatoxins, excessive alcohol consumption, obesity, and type 2 diabetes. Genomic variants play a crucial role in mediating the associations between these risk factors and liver cancer. However, the specific variants involved in this process remain under-explored. This study utilized a bioinformatics approach to identify genetic variants associated with liver cancer from various continents. Single-nucleotide polymorphisms associated with liver cancer were retrieved from the genome-wide association studies catalog. Prioritization was then performed using functional annotation with HaploReg v4.1 and the Ensembl database. The prevalence and allele frequencies of each variant were evaluated using Pearson correlation coefficients. Two variants, rs2294915 and rs2896019, encoded by the PNPLA3 gene, were found to be highly expressed in the liver tissue, as well as in the skin, cell-cultured fibroblasts, and adipose-subcutaneous tissue, all of which contribute to the risk of liver cancer. We further found that these two SNPs (rs2294915 and rs2896019) were positively correlated with the prevalence rate. Positive associations with the prevalence rate were more frequent in East Asian and African populations. We highlight the utility of this population-specific PNPLA3 genetic variant for genetic association studies and for the early prognosis and treatment of liver cancer. This study highlights the potential of integrating genomic databases with bioinformatic analysis to identify genetic variations involved in the pathogenesis of liver cancer. The genetic variants investigated in this study are likely to predispose to liver cancer and could affect its progression and aggressiveness. We recommend future research prioritizing the validation of these variations in clinical settings.

Comparison of genome-wide association and genomic prediction methods for milk production traits in Korean Holstein cattle

  • Lee, SeokHyun;Dang, ChangGwon;Choy, YunHo;Do, ChangHee;Cho, Kwanghyun;Kim, Jongjoo;Kim, Yousam;Lee, Jungjae
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.32 no.7
    • /
    • pp.913-921
    • /
    • 2019
  • Objective: The objectives of this study were to compare identified informative regions through two genome-wide association study (GWAS) approaches and determine the accuracy and bias of the direct genomic value (DGV) for milk production traits in Korean Holstein cattle, using two genomic prediction approaches: single-step genomic best linear unbiased prediction (ss-GBLUP) and Bayesian Bayes-B. Methods: Records on production traits such as adjusted 305-day milk (MY305), fat (FY305), and protein (PY305) yields were collected from 265,271 first parity cows. After quality control, 50,765 single-nucleotide polymorphic genotypes were available for analysis. In GWAS for ss-GBLUP (ssGWAS) and Bayes-B (BayesGWAS), the proportion of genetic variance for each 1-Mb genomic window was calculated and used to identify informative genomic regions. Accuracy of the DGV was estimated by a five-fold cross-validation with random clustering. As a measure of accuracy for DGV, we also assessed the correlation between DGV and deregressed-estimated breeding value (DEBV). The bias of DGV for each method was obtained by determining regression coefficients. Results: A total of nine and five significant windows (1 Mb) were identified for MY305 using ssGWAS and BayesGWAS, respectively. Using ssGWAS and BayesGWAS, we also detected multiple significant regions for FY305 (12 and 7) and PY305 (14 and 2), respectively. Both single-step DGV and Bayes DGV also showed somewhat moderate accuracy ranges for MY305 (0.32 to 0.34), FY305 (0.37 to 0.39), and PY305 (0.35 to 0.36) traits, respectively. The mean biases of DGVs determined using the single-step and Bayesian methods were $1.50{\pm}0.21$ and $1.18{\pm}0.26$ for MY305, $1.75{\pm}0.33$ and $1.14{\pm}0.20$ for FY305, and $1.59{\pm}0.20$ and $1.14{\pm}0.15$ for PY305, respectively. Conclusion: From the bias perspective, we believe that genomic selection based on the application of Bayesian approaches would be more suitable than application of ss-GBLUP in Korean Holstein populations.

Genome Wide Association Study for Phytophthora sojae Resistance with the Two Races Collected from Main Soybean Production Area in Korea with 210 Soybean Natural Population

  • Beom-Kyu Kang;Su-Vin Heo;Ji-Hee Park;Jeong-Hyun Seo;Man-Soo Choi;Jun-Hoi Kim;Jae-Bok Hwang;Ji-Yeon Ko;Yun-Woo Jang;Young-Nam Yun;Choon-Song Kim
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2022.10a
    • /
    • pp.202-202
    • /
    • 2022
  • Recently days, soybean production in paddy field is increasing, from 4,422 ha in 2016 to 10,658 ha in 2021 in Korea. It is easy for Phytophthora stem and root rot (PSR) occurring in paddy field condition, when it is poorly drained soils with a high clay content, and temporary flooding and ponding. Therefore PSR resistant soybean cultivar is required. The objective of this study is to identify QTL region and candidate genes relating to PSR resistance of the race in main soybean cultivation area in Korea. 210 soybean materials including cultivars and germplasm were used for inoculation and genome-wide association study (GWAS). Inoculation was conducted using stem-scar method with 2 replications in 2-year for the race 3053 from Kimje and 3617 from Andong. 210 materials were genotyped with Soya SNP 180K chip, and structure analysis and association mapping were conducted with QTLMAX V2. The results of inoculation showed that survival ratio ranged from 0% to 96.7% and mean 9.7% for 3053 and ranged from 0% to 100% and mean 7.6% for 3617. Structure analysis showed linkage disequillibrium (LD) was decayed below r2=0.5 at 335kb of SNP distance. Significant SNPs (LOD>7.0) were identified in Chr 1, 2, 3, 4, 5, 11, 14, 15 for 3053 and Chr 1, 2, 3, 7, 10, 14 for 3617. Especially, LD blocks (AX-90455181;15,056,628bp~AX-90475572;15,298,872bp) in Chr 2 for 3053 and 3067 were duplicated. 29 genes were identified on these genetic regions including Glyma.02gl47000 relating to ribosome recycling factor and defense response to fungus in Soybase.

  • PDF

Lipoprotein Lipase Polymorphism rs10503669 is Associated with High-density Lipoprotein Cholesterol Levels in Korean Population

  • Sull, Jae Woong;Eom, Yong-Bin;Jee, Sun Ha
    • Biomedical Science Letters
    • /
    • v.20 no.4
    • /
    • pp.221-226
    • /
    • 2014
  • High-density lipoprotein (HDL) cholesterol levels are associated with decreased risk of coronary artery disease. Several genome-wide association studies (GWAS) for HDL cholesterol levels have implicated Lipoprotein lipase (LPL) as possibly being causal. Herein, the association between single nucleotide polymorphism (SNP) rs10503669 in the LPL gene and HDL cholesterol levels and triglyceride levels was tested in the Korean population. A total of 994 subjects from Seoul City were included in a replication study with LPL SNP rs10503669. SNP rs10503669 in the LPL gene was associated with mean HDL cholesterol levels (effect per allele 3.13 mg/dL, P<0.0001) and triglyceride levels (effect per allele -18.0 mg/dL, P=0.0026). Subjects with the CA/AA genotype had a 0.42-fold (range 0.23~0.77-fold) lower risk of having abnormal HDL cholesterol levels (<40 mg/dL) than subjects with the CC genotype. When analyzed by gender, the association of LPL was stronger in men than in women. This study clearly demonstrates that genetic variants in LPL influence HDL cholesterol levels and triglyceride levels in Korean adults.

Effect of single nucleotide polymorphisms on intramuscular fat content in Hungarian Simmental cattle

  • Anton, Istvan;Huth, Balazs;Fuller, Imre;Rozsa, Laszlo;Hollo, Gabriella;Zsolnai, Attila
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.9
    • /
    • pp.1415-1419
    • /
    • 2018
  • Objective: To estimate effect of single nucleotide polymorphisms on the intramuscular fat content (IMF) of Hungarian Simmental bulls. Methods: Genotypes were determined on high-density Illumina Bovine DNA Chip. After slaughtering of animals, chemical percentage of intramuscular fat was determined from longissimus dorsi muscle. A multi-locus mixed-model was applied for statistical analyses. Results: Analyses revealed four loci (rs43284251, rs109210955, rs41630030, and rs41642251) to be highly associated ($-{\log}_{10}P$>12) with IMF located on chromosome 1, 6, 13, and 17, respectively. The frequency of their minor alleles was 0.426, 0.221, 0.162, and 0.106. Conclusion: The loci above can be useful in selection programs and gives the possibility to assist selection by molecular tools.

Gene expression and SNP identification related to leaf angle traits using a genome-wide association study in rice (Oryza sativa L.) (GWAS 분석을 이용한 벼 지엽각 관련 SNP 동정 및 발현 분석)

  • Kim, Me-Sun;Yu, Yeisoo;Kang, Kwon-Kyoo;Cho, Yong-Gu
    • Journal of Plant Biotechnology
    • /
    • v.45 no.1
    • /
    • pp.17-29
    • /
    • 2018
  • This study was conducted to investigate a morphological trait in 294 rice accessions including Korean breeding lines. We also carried out a genome-wide association study (GWAS) to detect significant single nucleotide polymorphism markers and candidate genes affecting major agronomic traits. A Manhattan plot analysis of GWAS using morphological traits showed that phenotypic and statistical significance was associated with a chromosome in each group. The significance of SNPs that were detected in this study was investigated by comparing them with those found previously studied QTL regions related to agronomic traits. As a result, SNP (S8-19815442), which is significant with regard to leaf angle, was located in the known QTL regions. To observe gene mutations related to leaf angle in a candidate gene, Os08g31950, its sequences were compared with sequences in previously selected rice varieties. In Os08g31950, a single nucleotide mutation occurred in one region. To compare relative RNA expression levels of candidate gene Os08g31950, obtained from GWAS analysis of 294 rice accessions and related to lateral leaf angle, we investigated relative levels by selecting 10 erect leaf angle varieties and 10 horizontal leaf angle varieties and examining real-time PCR. In Os08g31950, a high level of expression and various expression patterns were observed in all tissues. Also, Os08g31950 showed higher expression levels in the erect leaf angle variety group and higher expression rates in the leaf than in the root. The candidate gene detected through GWAS would be useful in developing new rice varieties with improved yield potential through future molecular breeding.