• 제목/요약/키워드: pathway genome database

검색결과 50건 처리시간 0.025초

Higher Order Knowledge Processing: Pathway Database and Ontologies

  • Fukuda, Ken Ichiro
    • Genomics & Informatics
    • /
    • 제3권2호
    • /
    • pp.47-51
    • /
    • 2005
  • Molecular mechanisms of biological processes are typically represented as 'pathways' that have a graph­analogical network structure. However, due to the diversity of topics that pathways cover, their constituent biological entities are highly diverse and the semantics is embedded implicitly. The kinds of interactions that connect biological entities are likewise diverse. Consequently, how to model or process pathway data is not a trivial issue. In this review article, we give an overview of the challenges in pathway database development by taking the INOH project as an example.

A Unified Object Database for Biochemical Pathways

  • Jung, T.S.;Oh, J.S.;Jang, H.K.;Ahn, M.S.;Roh, D.H.;Cho, W.S.
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2005년도 BIOINFO 2005
    • /
    • pp.383-387
    • /
    • 2005
  • One of the most important issues in post-genome era is identifying functions of genes and understanding the interaction among them. Such interactions from complex biochemical pathways, which are very useful to understand the organism system. We present an integrated biochemical pathway database system with a set of software tools for reconstruction, visualization, and simulation of the pathways from the database. The novel features of the presented system include: (a) automatic integration of the heterogeneous biochemical pathway databases, (b) gene ontology for high quality of database in the integration and query (c) various biochemical simulations on the pathway database, (d) dynamic pathway reconstruction for the gene list or sequence data, (e) graphical tools which enable users to view the reconstructed pathways in a dynamic form, (f) importing/exporting SBML documents, a data exchange standard for systems biology.

  • PDF

미선나무 품종 옥황 1호의 유전체를 활용한 Acteoside 생화학 합성과정 예측 및 확인 (Prediction and Identification of Biochemical Pathway of Acteoside from Whole Genome Sequences of Abeliophyllum Distichum Nakai, Cultivar Ok Hwang 1ho)

  • 박재호;시홍;한지윤;이정민;김용성;이준미;손장혁;안정좌;장태원;최지수;박종선
    • 융합정보논문지
    • /
    • 제10권3호
    • /
    • pp.76-91
    • /
    • 2020
  • 최근에 한국 고유종인 미선나무 (Abeliophyllum distichum Nakai; Oleaceae) 품종 옥황1호의 유전체가 성공적으로 해독되었다. Acteoside는 다양한 활성을 가지는 물질이며, 여러개의 생화학합성과정이 제시되어왔고, 이들을 통합 검토하여 정확한 생화학합성과정을 완성하였다. 유전체 데이터로부터 2차대사산물을 예측할 수 있는 MetaPre-AITM와 정확한 acteoside 생화학합성과정, InfoBoss Pathway Database를 활용하여, acteoside에 관여하는 모든 효소의 유전자를 옥황1호 유전체로부터 성공적으로 확인하였다. 이는 옥황1호는 acteoside 물질을 생산할 수 있는 가능성이 있음을 의미한다. 이에 고성능액체크로마토그래피를 사용하여 옥황1호의 캘러스 세포를 분석하여 acteoside과 이의 유도체인 isoacteoside를 확인하였다. 본 연구는 MetaPre-AITM은 유전체로부터 2차대사산물을 성공적으로 예측하였다. 이 방법은 화학물질보다 안정적인 DNA를 분석하여 2차 대사산물을 예측하는 효율적인 방법이 될 것이다.

EST Knowledge Integrated Systems (EKIS): An Integrated Database of EST Information for Research Application

  • Kim, Dae-Won;Jung, Tae-Sung;Choi, Young-Sang;Nam, Seong-Hyeuk;Kwon, Hyuk-Ryul;Kim, Dong-Wook;Choi, Han-Suk;Choi, Sang-Heang;Park, Hong-Seog
    • Genomics & Informatics
    • /
    • 제7권1호
    • /
    • pp.38-40
    • /
    • 2009
  • The EST Knowledge Integrated System, EKIS (http://ekis.kribb.re.kr), was established as a part of Korea's Ministry of Education, Science and Technology initiative for genome sequencing and application research of the biological model organisms (GEAR) project. The goals of the EKIS are to collect EST information from GEAR projects and make an integrated database to provide transcriptomic and metabolomic information for biological scientists. The EKIS constitutes five independent categories and several retrieval systems in each category for incorporating massive EST data from high-throughput sequencing of 65 different species. Through the EKIS database, scientists can freely access information including BLAST functional annotation as well as Genechip and pathway information for KEGG. By integrating complex data into a framework of existing EST knowledge information, the EKIS provides new insights into specialized metabolic pathway information for an applied industrial material.

Gene annotation by the "interactome"analysis in KEGG

  • Kanehisa, Minoru
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2000년도 International Symposium on Bioinformatics
    • /
    • pp.56-58
    • /
    • 2000
  • Post-genomics may be defined in different ways depending on how one views the challenges after the genome. A popular view is to follow the concept of the central dogma in molecular biology, namely from genome to transcriptome to proteome. Projects are going on to analyze gene expression profiles both at the mRNA and protein levels and to catalog protein 3D structure families, which will no doubt help the understanding of information in the genome. However complete, such catalogs of genes, RNAs, and proteins only tell us about the building blocks of life. They do not tell us much about the wiring (interaction) of building blocks, which is essential for uncovering systemic functional behaviors of the cell or the organism. Thus, an alternative view of post-genomics is to go up from the molecular level to the cellular level, and to understand, what I call, the "interactome"or a complete picture of molecular interactions in the cell. KEGG (http://www.genome.ad.jp/kegg/) is our attempt to computerize current knowledge on various cellular processes as a collection of "generalized"protein-protein interaction networks, to develop new graph-based algorithms for predicting such networks from the genome information, and to actually reconstruct the interactomes for all the completely sequenced genomes and some partial genomes. During the reconstruction process, it becomes readily apparent that certain pathways and molecular complexes are present or absent in each organism, indicating modular structures of the interactome. In addition, the reconstruction uncovers missing components in an otherwise complete pathway or complex, which may result from misannotation of the genome or misrepresentation of the KEGG pathway. When combined with additional experimental data on protein-protein interactions, such as by yeast two-hybrid systems, the reconstruction possibly uncovers unknown partners for a particular pathway or complex. Thus, the reconstruction is tightly coupled with the annotation of individual genes, which is maintained in the GENES database in KEGG. We are also trying to expand our literature surrey to include in the GENES database most up-to-date information about gene functions.

  • PDF

SOP (Search of Omics Pathway): A Web-based Tool for Visualization of KEGG Pathway Diagrams of Omics Data

  • Kim, Jun-Sub;Yeom, Hye-Jung;Kim, Seung-Jun;Kim, Ji-Hoon;Park, Hye-Won;Oh, Moon-Ju;Hwang, Seung-Yong
    • Molecular & Cellular Toxicology
    • /
    • 제3권3호
    • /
    • pp.208-213
    • /
    • 2007
  • With the help of a development and popularization of microarray technology that enable to us to simultaneously investigate the expression pattern of thousands of genes, the toxicogenomics experimenters can interpret the genome-scale interaction between genes exposed in toxicant or toxicant-related environment. The ultimate and primary goal of toxicogenomics identifies functional context among the group of genes that are differentially or similarly coexpressed under the specific toxic substance. On the other side, public reference databases with transcriptom, proteom, and biological pathway information are needed for the analysis of these complex omics data. However, due to the heterogeneous and independent nature of these databases, it is hard to individually analyze a large omics annotations and their pathway information. Fortunately, several web sites of the public database provide information linked to other. Nevertheless it involves not only approriate information but also unnecessary information to users. Therefore, the systematically integrated database that is suitable to a demand of experimenters is needed. For these reasons, we propose SOP (Search of Omics Pathway) database system which is constructed as the integrated biological database converting heterogeneous feature of public databases into combined feature. In addition, SOP offers user-friendly web interfaces which enable users to submit gene queries for biological interpretation of gene lists derived from omics experiments. Outputs of SOP web interface are supported as the omics annotation table and the visualized pathway maps of KEGG PATHWAY database. We believe that SOP will appear as a helpful tool to perform biological interpretation of genes or proteins traced to omics experiments, lead to new discoveries from their pathway analysis, and design new hypothesis for a next toxicogenomics experiments.

KUGI: A Database and Search System for Korean Unigene and Pathway Information

  • Yang, Jin-Ok;Hahn, Yoon-Soo;Kim, Nam-Soon;Yu, Ung-Sik;Woo, Hyun-Goo;Chu, In-Sun;Kim, Yong-Sung;Yoo, Hyang-Sook;Kim, Sang-Soo
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2005년도 BIOINFO 2005
    • /
    • pp.407-411
    • /
    • 2005
  • KUGI (Korean UniGene Information) database contains the annotation information of the cDNA sequences obtained from the disease samples prevalent in Korean. A total of about 157,000 5'-EST high throughput sequences collected from cDNA libraries of stomach, liver, and some cancer tissues or established cell lines from Korean patients were clustered to about 35,000 contigs. From each cluster a representative clone having the longest high quality sequence or the start codon was selected. We stored the sequences of the representative clones and the clustered contigs in the KUGI database together with their information analyzed by running Blast against RefSeq, human mRNA, and UniGene databases from NCBI. We provide a web-based search engine fur the KUGI database using two types of user interfaces: attribute-based search and similarity search of the sequences. For attribute-based search, we use DBMS technology while we use BLAST that supports various similarity search options. The search system allows not only multiple queries, but also various query types. The results are as follows: 1) information of clones and libraries, 2) accession keys, location on genome, gene ontology, and pathways to public databases, 3) links to external programs, and 4) sequence information of contig and 5'-end of clones. We believe that the KUGI database and search system may provide very useful information that can be used in the study for elucidating the causes of the disease that are prevalent in Korean.

  • PDF

닭 특이 대사 경로 재확립 (Reconstruction of Metabolic Pathway for the Chicken Genome)

  • 김운수;이세영;박혜선;백운기;이준헌;서성원
    • 한국가금학회지
    • /
    • 제37권3호
    • /
    • pp.275-282
    • /
    • 2010
  • 닭의 대사 생리에 대한 연구는 산업적 가치 및 생물학, 의학적으로도 매우 중요하다. 닭의 유전체 염기서열 분석 결과는 2004년에 처음 발표되었고, 이러한 유전체 정보를 바탕으로 유전형과 표현형의 상관관계를 분석하는 연구가 필요하다. 따라서 본 연구는 닭 유전체 정보를 바탕으로 대사 경로를 재확립하고, 닭 특이 대사 경로 유전체 데이터베이스를 구축하였다. 이를 위해 Perl 언어를 기반으로 개발된 자동 파이프라인(pipeline)을 이용하여 여러 생물정보 데이터베이스에 산재해 있는 닭 유전체에 관한 정보를 통합한 닭 특이 통합 데이터베이스를 구축하였다. 또한, 구축된 닭 특이 통합 데이터베이스를기반으로PathoLogic 알고리즘을구현한Pathway Tools 소프트웨어를 이용하여 닭 특이 대사 경로를 재확립하였다. 결과적으로, 닭 유전체 Gallus_gallus-2.1에서 2,709개의 효소, 71개의 운반체(transporter)와 1,698개의 효소 반응, 8개의 운반 반응(transport reaction)이 도출되었다. 이를 통해 총 212개의 대사 경로가 재확립되었고, 1,360개의 화합물(compound)이 닭 특이 대사 데이터베이스에 포함되었다. 다른 종(사람, 생쥐, 소)과의 비교 분석을 통해 중요한 대사 경로가 닭 유전체에 보존되어 있음을 보였다. 또한, 닭 유전체의 assembly와 annotation의 질을 높이는 노력과 닭 및 조류에서 유전자 기능 및 대사 경로에 대한 연구가 필요한 것으로 나타났다. 결론적으로, 본 연구에서 재확립된 닭의 대사 경로 및 데이터베이스는 닭 및 조류의 대사 연구뿐만 아니라 포유동물 및 미생물과의 비교 생물학적 접근을 통한 의학 및 생물학적 연구에 활용될 것으로 기대된다.

In Silico Identification of 6-Phosphogluconolactonase Genes that are Frequently Missing from Completely Sequenced Bacterial Genomes

  • Jeong, Hae-Young;F. Kim, Ji-Hyun;Park, Hong-Seog
    • Genomics & Informatics
    • /
    • 제4권4호
    • /
    • pp.182-187
    • /
    • 2006
  • 6-Phosphogluconolactonase (6PGL) is one of the key enzymes in the ubiquitous pathways of central carbon metabolism, but bacterial 6PGL had been long known as a missing enzyme even after complete bacterial genome sequence information became available. Although recent experimental characterization suggests that there are two types of 6PGLs (DevB and YbhE), their phylogenetic distribution is severely biased. Here we present that proteins in COG group previously described as 3-oarboxymuconate cyclase (COG2706) are actually the YbhE-type 6PGLs, which are widely distributed in Proteobacteria and Fimicutes. This case exemplifies how erroneous functional description of a member in the reference database commonly used in transitive genome annotation cause systematic problem in the prediction of genes even with universal cellular functions.

Genome analysis of Yucatan miniature pigs to assess their potential as biomedical model animals

  • Kwon, Dae-Jin;Lee, Yeong-Sup;Shin, Donghyun;Won, Kyeong-Hye;Song, Ki-Duk
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제32권2호
    • /
    • pp.290-296
    • /
    • 2019
  • Objective: Pigs share many physiological, anatomical and genomic similarities with humans, which make them suitable models for biomedical researches. Understanding the genetic status of Yucatan miniature pigs (YMPs) and their association with human diseases will help to assess their potential as biomedical model animals. This study was performed to identify non-synonymous single nucleotide polymorphisms (nsSNPs) in selective sweep regions of the genome of YMPs and present the genetic nsSNP distributions that are potentially associated with disease occurrence in humans. Methods: nsSNPs in whole genome resequencing data from 12 YMPs were identified and annotated to predict their possible effects on protein function. Sorting intolerant from tolerant (SIFT) and polymorphism phenotyping v2 analyses were used, and gene ontology (GO) network and Kyoto encyclopedia of genes and genomes (KEGG) pathway analyses were performed. Results: The results showed that 8,462 genes, encompassing 72,067 nsSNPs were identified, and 118 nsSNPs in 46 genes were predicted as deleterious. GO network analysis classified 13 genes into 5 GO terms (p<0.05) that were associated with kidney development and metabolic processes. Seven genes encompassing nsSNPs were classified into the term associated with Alzheimer's disease by referencing the genetic association database. The KEGG pathway analysis identified only one significantly enriched pathway (p<0.05), hsa04080: Neuroactive ligand-receptor interaction, among the transcripts. Conclusion: The number of deleterious nsSNPs in YMPs was identified and then these variants-containing genes in YMPs data were adopted as the putative human diseases-related genes. The results revealed that many genes encompassing nsSNPs in YMPs were related to the various human genes which are potentially associated with kidney development and metabolic processes as well as human disease occurrence.