• Title/Summary/Keyword: Protein Ontology

Search Result 132, Processing Time 0.019 seconds

Protein Ontology: Semantic Data Integration in Proteomics

  • Sidhu, Amandeep S.;Dillon, Tharam S.;Chang, Elizabeth;Sidhu, Baldev S.
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.388-391
    • /
    • 2005
  • The Protein Structural and Functional Conservation need a common language for data definition. With the help of common language provided by Protein Ontology the high level of sequence and functional conservation can be extended to all organisms with the likelihood that proteins that carry out core biological processes will again be probable orthologues. The structural and functional conservation in these proteins presents both opportunities and challenges. The main opportunity lies in the possibility of automated transfer of protein data annotations from experimentally traceable model organisms to a less traceable organism based on protein sequence similarity. Such information can be used to improve human health or agriculture. The challenge lies in using a common language to transfer protein data annotations among different species of organisms. First step in achieving this huge challenge is producing a structured, precisely defined common vocabulary using Protein Ontology. The Protein Ontology described in this paper covers the sequence, structure and biological roles of Protein Complexes in any organism.

  • PDF

Comparison of External Information Performance Predicting Subcellular Localization of Proteins (단백질의 세포내 위치를 예측하기 위한 외부정보의 성능 비교)

  • Chi, Sang-Mun
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.11
    • /
    • pp.803-811
    • /
    • 2010
  • Since protein subcellular location and biological function are highly correlated, the prediction of protein subcellular localization can provide information about the function of a protein. In order to enhance the prediction performance, external information other than amino acids sequence information is actively exploited in many researches. This paper compares the prediction capabilities resided in amino acid sequence similarity, protein profile, gene ontology, motif, and textual information. In the experiments using PLOC dataset which has proteins less than 80% sequence similarity, sequence similarity information and gene ontology are effective information, achieving a classification accuracy of 94.8%. In the experiments using BaCelLo IDS dataset with low sequence similarity less than 30%, using gene ontology gives the best prediction accuracies, 93.2% for animals and 86.6% for fungi.

Protein Interaction Network Visualization System Combined with Gene Ontology (유전자 온톨로지와 연계한 단백질 상호작용 네트워크 시각화 시스템)

  • Choi, Yun-Kyu;Kim, Seok;Yi, Gwan-Su;Park, Jin-Ah
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.36 no.2
    • /
    • pp.60-67
    • /
    • 2009
  • Analyzing protein-protein interactions(PPI) is an important task in bioinformatics as it can help in new drugs' discovery process. However, due to vast amount of PPI data and their complexity, efficient visualization of the data is still remained as a challenging problem. We have developed efficient and effective visualization system that integrates Gene Ontology(GO) and PPI network to provide better insights to scientists. To provide efficient data visualization, we have employed dynamic interactive graph drawing methods and context-based browsing strategy. In addition, quick and flexible cross-reference system between GO and PPI; LCA(Least Common Ancestor) finding for GO; and etc are supported as special features. In terms of interface, our visualization system provides two separate graphical windows side-by-side for GO graphs and PPI network, and also provides cross-reference functions between them.

Transcriptomic analysis of 'Campbell Early' and 'Muscat Bailey A' grapevine shoots exposed to freezing cold stress (영하의 저온에 노출된 'Campbell Early'와 'Muscat Bailey A' 포도나무 신초의 전사체 비교)

  • Kim, Seon Ae;Yun, Hae Keun
    • Journal of Plant Biotechnology
    • /
    • v.43 no.2
    • /
    • pp.204-212
    • /
    • 2016
  • To understand the responses of grapevines in response to cold stress causing the limited growth and development, differentially expressed genes (DEGs) were screened through transcriptome analysis of shoots from 2 grapevine cultivars ('Campbell Early' and 'Muscat Baily A') kept at -$2^{\circ}C$ for 4 days. In gene ontology analysis of DEGs from 'Campbell Early', there were 17,424 clones related with biological process, 28,954 with cellular component, and 6,972 with molecular function genes in response to freezing temperature. The major induced genes included dehydrin xero 1, K-box region and MADS-box transcription factor family protein, and MYB domain protein 36, and inhibited genes included light-harvesting chlorophyll B-binding protein 3, FASCICLIN-like arabinoogalactan 9, and pectin methylesterase 61 in 'Campbell Early' grapevines. In gene ontology analysis of DEGs from 'Muscat Baily A', there were 1,157 clones related with biological process, 1,350 with cellular component, and 431 with molecular function gene. The major induced genes of 'Muscat Baily A' included NB-ARC domain-containing disease resistance protein, fatty acid hydrozylase superfamily, and isopentenyltransferase 3, and inhibited genes included binding, IAP-like protein 1, and pentatricopeptide repeat superfamily protein. All major DEGs were shown to be expressed differentially by freezing temperature in real time-PCR analysis. Protein domain analysis using InterPro Scan revealed that ubiquitin-protein ligase was redundant in both tested grapevines. Transcriptome profile of shoots exposed to cold can provide new insights into the molecular basis of tolerance to low-temperature in grapevines, and can be used as resources for development new grapevines tolerant to coldness.

Improved Algorithms for the Identification of Yeast Proteins and Significant Transcription Factor and Motif Analysis

  • Lee Seung-Won;Hong Seong-Eui;Lee Kyoo-Yeol;Choi Do-Il;Chung Hae-Young;Hur Cheol-Goo
    • Genomics & Informatics
    • /
    • v.4 no.2
    • /
    • pp.87-93
    • /
    • 2006
  • With the rapid development of MS technologiesy, the demands for a more sophisticated MS interpretation algorithm haves grown as well. We have developed a new protein fingerprinting method using a binomial distribution, (fBIND). With the fBIND, we improved the performance accuracy of protein fingerprinting up to the maximum 49% (more than MOWSE) and 2% than(at a previous binomial distribution approach studied by of Wool et al.) as compared to the established algorithms. Moreover, we also suggest a the statistical approach to define the significance of transcription factors and motifs in the identified proteins based on the Gene Ontology (GO). Abbreviations: fBIND, fingerprinting using binomial distribution; GO, Gene Ontology; MS, Mass Spectrometry; PMF, peptide mass fingerprinting; nr, nonredundant; SGD, Saccharomyces Genome Database

Prediction of hub genes of Alzheimer's disease using a protein interaction network and functional enrichment analysis

  • Wee, Jia Jin;Kumar, Suresh
    • Genomics & Informatics
    • /
    • v.18 no.4
    • /
    • pp.39.1-39.8
    • /
    • 2020
  • Alzheimer's disease (AD) is a chronic, progressive brain disorder that slowly destroys affected individuals' memory and reasoning faculties, and consequently, their ability to perform the simplest tasks. This study investigated the hub genes of AD. Proteins interact with other proteins and non-protein molecules, and these interactions play an important role in understanding protein function. Computational methods are useful for understanding biological problems, in particular, network analyses of protein-protein interactions. Through a protein network analysis, we identified the following top 10 hub genes associated with AD: PTGER3, C3AR1, NPY, ADCY2, CXCL12, CCR5, MTNR1A, CNR2, GRM2, and CXCL8. Through gene enrichment, it was identified that most gene functions could be classified as integral to the plasma membrane, G-protein coupled receptor activity, and cell communication under gene ontology, as well as involvement in signal transduction pathways. Based on the convergent functional genomics ranking, the prioritized genes were NPY, CXCL12, CCR5, and CNR2.

Web-Based Computational System for Protein-Protein Interaction Inference

  • Kim, Ki-Bong
    • Journal of Information Processing Systems
    • /
    • v.8 no.3
    • /
    • pp.459-470
    • /
    • 2012
  • Recently, high-throughput technologies such as the two-hybrid system, protein chip, Mass Spectrometry, and the phage display have furnished a lot of data on protein-protein interactions (PPIs), but the data has not been accurate so far and the quantity has also been limited. In this respect, computational techniques for the prediction and validation of PPIs have been developed. However, existing computational methods do not take into account the fact that a PPI is actually originated from the interactions of domains that each protein contains. So, in this work, the information on domain modules of individual proteins has been employed in order to find out the protein interaction relationship. The system developed here, WASPI (Web-based Assistant System for Protein-protein interaction Inference), has been implemented to provide many functional insights into the protein interactions and their domains. To achieve those objectives, several preprocessing steps have been taken. First, the domain module information of interacting proteins was extracted by taking advantage of the InterPro database, which includes protein families, domains, and functional sites. The InterProScan program was used in this preprocess. Second, the homology comparison with the GO (Gene Ontology) and COG (Clusters of Orthologous Groups) with an E-value of $10^{-5}$, $10^{-3}$ respectively, was employed to obtain the information on the function and annotation of each interacting protein of a secondary PPI database in the WASPI. The BLAST program was utilized for the homology comparison.

GSnet: An Integrated Tool for Gene Set Analysis and Visualization

  • Choi, Yoon-Jeong;Woo, Hyun-Goo;Yu, Ung-Sik
    • Genomics & Informatics
    • /
    • v.5 no.3
    • /
    • pp.133-136
    • /
    • 2007
  • The Gene Set network viewer (GSnet) visualizes the functional enrichment of a given gene set with a protein interaction network and is implemented as a plug-in for the Cytoscape platform. The functional enrichment of a given gene set is calculated using a hypergeometric test based on the Gene Ontology annotation. The protein interaction network is estimated using public data. Set operations allow a complex protein interaction network to be decomposed into a functionally-enriched module of interest. GSnet provides a new framework for gene set analysis by integrating a priori knowledge of a biological network with functional enrichment analysis.

Identification of prognosis-specific network and prediction for estrogen receptor-negative breast cancer using microarray data and PPI data (마이크로어레이 데이터와 PPI 데이터를 이용한 에스트로겐 수용체 음성 유방암 환자의 예후 특이 네트워크 식별 및 예후 예측)

  • Hwang, Youhyeon;Oh, Min;Yoon, Youngmi
    • Journal of the Korea Society of Computer and Information
    • /
    • v.20 no.2
    • /
    • pp.137-147
    • /
    • 2015
  • This study proposes an algorithm for predicting breast cancer prognosis based on genetic network. We identify prognosis-specific network using gene expression data and PPI(protein-protein interaction) data. To acquire the network, we calculate Pearson's correlation coefficient(PCC) between genes in all PPI pairs using gene expression data. We develop a prediction model for breast cancer patients with estrogen-receptor-negative using the network as a classifier. We compare classification performance of our algorithm with existing algorithms on independent data and shows our algorithm is improved. In addition, we make an functionality analysis on the genes in the prognosis-specific network using GO(Gene Ontology) enrichment validation.

A network pharmacology approach to explore the potential role of Panax ginseng on exercise performance

  • Kim, Jisu;Lee, Kang Pa;Kim, Myoung-Ryu;Kim, Bom Sahn;Moon, Byung Seok;Shin, Chul Ho;Baek, Suji;Hong, Bok Sil
    • Korean Journal of Exercise Nutrition
    • /
    • v.25 no.3
    • /
    • pp.28-35
    • /
    • 2021
  • [Purpose] As Panax ginseng C. A. Meyer (ginseng) exhibits various physiological activities and is associated with exercise, we investigated the potential active components of ginseng and related target genes through network pharmacological analysis. Additionally, we analyzed the association between ginseng-related genes, such as the G-protein-coupled receptors (GPCRs), and improved exercise capacity. [Methods] Active compounds in ginseng and the related target genes were searched in the Traditional Chinese Medicine Database and Analysis Platform (TCMSP). Gene ontology functional analysis was performed to identify biological processes related to the collected genes, and a compound-target network was visualized using Cytoscape 3.7.2. [Results] A total of 21 ginseng active compounds were detected, and 110 targets regulated by 17 active substances were identified. We found that the active compound protein was involved in the biological process of adrenergic receptor activity in 80%, G-protein-coupled neurotransmitter in 10%, and leucocyte adhesion to arteries in 10%. Additionally, the biological response centered on adrenergic receptor activity showed a close relationship with G protein through the beta-1 adrenergic receptor gene reactivity. [Conclusion] According to bioavailability analysis, ginseng comprises 21 active compounds. Furthermore, we investigated the ginseng-stimulated gene activation using ontology analysis. GPCR, a gene upregulated by ginseng, is positively correlated to exercise. Therefore, if a study on this factor is conducted, it will provide useful basic data for improving exercise performance and health.