• Title/Summary/Keyword: genome-wide search

Search Result 24, Processing Time 0.028 seconds

COCAW: A Genome-wide Pattern Search System for Designing Microbial Probes

  • Ryu, Seung-Hee;Park, Kie-Jung;Lee, Do-Hoon;Kim, Cheol-Min
    • Genomics & Informatics
    • /
    • v.7 no.3
    • /
    • pp.178-180
    • /
    • 2009
  • A few bioinformatics tools have been used to find out conserved regions as probes. We have developed a system based on a heuristic method with web interfaces to find out conserved regions against microbial genomes. The system runs in real time by using relative entropy in limited narrow regions and detecting similar regions between pair regions with local alignment. The system could be useful to find out conserved regions as genome-wide scale.

Genome-wide Linkage Study for Plasma HDL Cholesterol Level in an Isolated Population of Mongolia

  • Park, Han-Soo;Kim, Jong-Il;Cho, Sung-Il;Sung, Joo-Hon;Kim, Hyung-Lae;Ju, Young-Seok;Bayasgalan, Gombojav;Lee, Mi-Kyeong;Seo, Jeong-Sun
    • Genomics & Informatics
    • /
    • v.6 no.1
    • /
    • pp.8-13
    • /
    • 2008
  • High-density lipoprotein (HDL) whose primary role is to transport cholesterol from peripheral tissues to the liver, is associated with the incidence of coronary heart disease. We analyzed HDL cholesterol levels in a genetically isolated population of extended Mongolian families. A total of 1002 individuals (54.5% women) from 95 families were enrolled. After genotyping by use of 1000 microsatellite markers, we performed a genome-wide linkage search with variance component analysis. The estimated heritability of HDL cholesterol was 0.45, revealing that HDL cholesterol was under significant genetic influence. We found peak evidence of linkage (LOD score=1.88) for HDL cholesterol level on chromosome 6 (nearest marker D6S1660) and potential evidences for linkage on chromosomes 1, 12 and 19 with the LOD scores of 1.32, 1.44 and 1.14, respectively. These results should pave the way for the discovery of the relevant genes by fine mapping and association analysis.

Web-Based Database and Viewer of East Asian Copy Number Variations

  • Kim, Ji-Hong;Hu, Hae-Jin;Chung, Yeun-Jun
    • Genomics & Informatics
    • /
    • v.10 no.1
    • /
    • pp.65-67
    • /
    • 2012
  • We have discovered copy number variations (CNVs) in 3,578 Korean individuals with the Affymetrix Genome-Wide SNP array 5.0, and 4,003 copy number variation regions (CNVRs) were defined in a previous study. To explore the details of the variants easily in related studies, we built a database, cataloging the CNVs and related information. This system helps researchers browsing these variants with gene and structure variant annotations. Users can easily find specific regions with search options and verify them from system-integrated genome browsers with annotations.

A Pilot Genome-wide Association Study of Breast Cancer Susceptibility Loci in Indonesia

  • Haryono, Samuel J;Datasena, I Gusti Bagus;Santosa, Wahyu Budi;Mulyarahardja, Raymond;Sari, Kartika
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.6
    • /
    • pp.2231-2235
    • /
    • 2015
  • Genome-wide association studies (GWASs) of the entire genome provide a systematic approach for revealing novel genetic susceptibility loci for breast cancer. However, genetic association studies have hitherto been primarily conducted in women of European ancestry. Therefofre we here performed a pilot GWAS with a single nucleotide polymorphism (SNP) array 5.0 platform from $Affymetrix^{(R)}$ that contains 443,813 SNPs to search for new genetic risk factors in 89 breast cancer cases and 46 healthy women of Indonesian ancestry. The case-control association of the GWAS finding set was evaluated using PLINK. The strengths of allelic and genotypic associations were assessed using logistic regression analysis and reported as odds ratios (ORs) and P values; P values less than $1.00{\times}10^{-8}$ and $5.00{\times}10^{-5}$ were required for significant association and suggestive association, respectively. After analyzing 292,887 SNPs, we recognized 11 chromosome loci that possessed suggestive associations with breast cancer risk. Of these, however, there were only four chromosome loci with identified genes: chromosome 2p.12 with the CTNNA2 gene [Odds ratio (OR)=1.20, 95% confidence interval (CI)=1.13-1.33, $P=1.08{\times}10^{-7}$]; chromosome 18p11.2 with the SOGA2 gene (OR=1.32, 95%CI=1.17-1.44, $P=6.88{\times}10^{-6}$); chromosome 5q14.1 with the SSBP2 gene (OR=1.22, 95%CI=1.11-1.34, $P=4.00{\times}10^{-5}$); and chromosome 9q31.1 with the TEX10 gene (OR=1.24, 95%CI=1.12-1.35, $P=4.68{\times}10^{-5}$). This study identified 11 chromosome loci which exhibited suggestive associations with the risk of breast cancer among Indonesian women.

Multiple Genes Related to Muscle Identified through a Joint Analysis of a Two-stage Genome-wide Association Study for Racing Performance of 1,156 Thoroughbreds

  • Shin, Dong-Hyun;Lee, Jin Woo;Park, Jong-Eun;Choi, Ik-Young;Oh, Hee-Seok;Kim, Hyeon Jeong;Kim, Heebal
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.28 no.6
    • /
    • pp.771-781
    • /
    • 2015
  • Thoroughbred, a relatively recent horse breed, is best known for its use in horse racing. Although myostatin (MSTN) variants have been reported to be highly associated with horse racing performance, the trait is more likely to be polygenic in nature. The purpose of this study was to identify genetic variants strongly associated with racing performance by using estimated breeding value (EBV) for race time as a phenotype. We conducted a two-stage genome-wide association study to search for genetic variants associated with the EBV. In the first stage of genome-wide association study, a relatively large number of markers (~54,000 single-nucleotide polymorphisms, SNPs) were evaluated in a small number of samples (240 horses). In the second stage, a relatively small number of markers identified to have large effects (170 SNPs) were evaluated in a much larger number of samples (1,156 horses). We also validated the SNPs related to MSTN known to have large effects on racing performance and found significant associations in the stage two analysis, but not in stage one. We identified 28 significant SNPs related to 17 genes. Among these, six genes have a function related to myogenesis and five genes are involved in muscle maintenance. To our knowledge, these genes are newly reported for the genetic association with racing performance of Thoroughbreds. It complements a recent horse genome-wide association studies of racing performance that identified other SNPs and genes as the most significant variants. These results will help to expand our knowledge of the polygenic nature of racing performance in Thoroughbreds.

A local search algorithm for predicting epistatic interactions of SNPs (복합 질환 관련 SNP 상호작용 예측을 위한 국소탐색 알고리즘)

  • Hong, Won-Pyo;Wee, Kyubum
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2010.11a
    • /
    • pp.1395-1398
    • /
    • 2010
  • 최근 GWAS(Genome-wide association study)로 인해 수십만 개의 SNP들이 사용 가능하게 되었다. 그러나 SNP 정보의 양이 방대하여 모든 SNP 조합을 검토하는 방식은 계산 비용이 클 뿐 아니라 오버피팅의 위험이 따른다. 본 논문에서는 필터링 기반 알고리즘인 SNPHarvester의 속도를 개선하고 평가함수를 상호정보량으로 대체하여 실험한다. 기존 SNPHarvester와 비교해 속도면에서 50%가 향상되었고 평가함수 면에서는 기존 SNPHarvester와 동일한 성능을 보였다.

Genome-wide association study of rice core set related selenium content

  • Choi, Buung;Lee, Sang Beom;Kim, Gyeong Jin;Kim, Kyu Won;Yoo, Ji Hyock;Oh, Kyeong Seok;Moon, Byeong Churl;Park, Yong Jin;Park, Sang Won
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2017.06a
    • /
    • pp.158-158
    • /
    • 2017
  • The purpose of this study was to identify the candidate genes involved in selenium content in brown rice. Rice (Oryza sativa L.) was important crop including diverse functional substance such as carbohydrate, protein, lysine and tocopherol, mineral. Especially, selenium as nutritionally important minerals, it was known to activate the immune system, antioxidant effect and inhibition of carcinogenesis. Also recommended daily requirements of the United States and the United Kingdom were 55 to 90 ug for selenium. Therefore, selenium content in brown rice of core-set were analyzed by using ICP-MS (Inductively Coupled Plasma Mass Spectrometer) and GWAS (Genome Wide Association Study) was conducted to search for candidate genes in this study. The new natural variants identified through haplotyping analysis would be useful to develop new rice varieties with improved storage ability of the valuable mineral through the future molecular breeding.

  • PDF

Genome-wide association study identifies positional candidate genes affecting back fat thickness trait in pigs

  • Lee, Jae-Bong;Kang, Ho-Chan;Kim, Eun-Ho;Kim, Yoon-Joo;Yoo, Chae-Kyoung;Choi, Tae-Jeong;Lim, Hyun-Tae
    • Korean Journal of Agricultural Science
    • /
    • v.45 no.4
    • /
    • pp.707-713
    • /
    • 2018
  • This study was done to search for positional candidate genes associated with the back fat thickness trait using a Genome-Wide Association Study (GWAS) in purebred Yorkshires (N = 1755). Genotype and phenotype analyses were done for 1,642 samples. As a result of the associations with back fat thickness using the Gemma program (ver. 0.93), when the genome-wide suggestive threshold was determined using the Bonferroni method ($p=1.61{\times}10^{-5}$), the single nucleotide polymorphism (SNP) markers with suggestive significance were identified in 1 SNP marker on chromosome 2 (MARC0053928; $p=3.65{\times}10^{-6}$), 2 SNP markers on chromosome 14 (ALGA0083078; $p=7.85{\times}10^{-6}$, INRA0048453; $p=1.27{\times}10^{-5}$), and 1 SNP marker on chromosome 18 (ALGA0120564; $p=1.44{\times}10^{-5}$). We could select positional candidate genes (KCNQ1, DOCK1, LOC106506151, and LOC110257583), located close to the SNP markers. Among these, we identified a potassium voltage-gated channel subfamily Q member gene (KCNQ1) and the dedicator of cytokinesis 1 (DOCK1) gene associated with obesity and Type-2 diabetes. The SNPs and haplotypes of the KCNQ1 and DOCK1 genes can contribute to understanding the genetic structure of back fat thickness. Additionally, it may provide basic data regarding marker assisted selection for a meat quality trait in pigs.

A Genome-Wide Analysis of Antibiotic Producing Genes in Streptomyces globisporus SP6C4

  • Kim, Da-Ran;Kwak, Youn-Sig
    • The Plant Pathology Journal
    • /
    • v.37 no.4
    • /
    • pp.389-395
    • /
    • 2021
  • Soil is the major source of plant-associated microbes. Several fungal and bacterial species live within plant tissues. Actinomycetes are well known for producing a variety of antibiotics, and they contribute to improving plant health. In our previous report, Streptomyces globisporus SP6C4 colonized plant tissues and was able to move to other tissues from the initially colonized ones. This strain has excellent antifungal and antibacterial activities and provides a suppressive effect upon various plant diseases. Here, we report the genome-wide analysis of antibiotic producing genes in S. globisporus SP6C4. A total of 15 secondary metabolite biosynthetic gene clusters were predicted using antiSMASH. We used the CRISPR/Cas9 mutagenesis system, and each biosynthetic gene was predicted via protein basic local alignment search tool (BLAST) and rapid annotation using subsystems technology (RAST) server. Three gene clusters were shown to exhibit antifungal or antibacterial activity, viz. cluster 16 (lasso peptide), cluster 17 (thiopeptide-lantipeptide), and cluster 20 (lantipeptide). The results of the current study showed that SP6C4 has a variety of antimicrobial activities, and this strain is beneficial in agriculture.

CONVIRT: A web-based tool for transcriptional regulatory site identification using a conserved virtual chromosome

  • Ryu, Tae-Woo;Lee, Se-Joon;Hur, Cheol-Goo;Lee, Do-Heon
    • BMB Reports
    • /
    • v.42 no.12
    • /
    • pp.823-828
    • /
    • 2009
  • Techniques for analyzing protein-DNA interactions on a genome-wide scale have recently established regulatory roles for distal enhancers. However, the large sizes of higher eukaryotic genomes have made identification of these elements difficult. Information regarding sequence conservation, exon annotation and repetitive regions can be used to reduce the size of the search region. However, previously developed resources are inadequate for consolidating such information. CONVIRT is a web resource for the identification of transcription factor binding sites and also features comparative genomics. Genomic information on ortholog-independent conserved regions, exons, repeats and sequences is integrated into the virtual chromosome, and statistically over-represented single or combinations of transcription factor binding sites are sought. CONVIRT provides regulatory network analysis for several organisms with long promoter regions and permits inter-species genome alignments. CONVIRT is freely available at http://biosoft.kaist.ac.kr/convirt.