Deciphering FEATURE for Novel Protein Data Analysis and Functional Annotation

단백질 구조 및 기능 분석을 위한 FEATURE 시스템 개선

  • 유승학 (고려대학교 전기전자전파공학부) ;
  • 윤성로 (고려대학교 전기전자전파공학부)
  • Published : 2009.09.30

Abstract

FEATURE is a computational method to recognize functional and structural sites for automatic protein function prediction. By profiling physicochemical properties around residues, FEATURE can characterize and predict functional and structural sites in 3D protein structures in a high-throughput manner. Despite its effectiveness, it has been challenging to apply FEATURE to novel protein data due to limited customization support. To address this problem, we thoroughly analyze the internal modules of FEATURE and propose a methodology to customize FEATURE so that it can be used for new protein data for automatic functional annotations.

FEATURE는 단백질 내에서 특정 기능이나 구조를 가지고 있는 site의 미세환경분포를 이용하여 다른 단백질 내에서 이와 유사한 미세환경을 가지고 있는 부분을 찾아 그 분분이 site일 확률을 수치적으로 제시해 줌으로써 사용자로 하여금 site의 존재 유무와 그 위치를 판단하는데 기준을 제공해주는 유용한 툴이다. 하지만 기존의 FEATURE에서 사용된 데이터 이외의 새로운 단백질 구조 데이터를 FEATURE에 적용하기 위해서는 FEATURE 내부의 module을 입력 데이터 구조에 맞게 수정해야 한다. 그러나 FEATURE 내부의 module 구조를 수정하는 방식이 직관적이지 않기 때문에 많은 연구자들이 FEATURE를 원활하게 사용하지 못하였다. 따라서 본 논문에서는 FEATURE의 내부 구조를 분석하고 FEATURE를 새로운 단백질 데이터에 적용하기 위한 방법을 제시한다.

Keywords

References

  1. Sungroh Yoon, Jessica C. Ebert, Eui-Young Chung, Giovanni De Micheli and Russ B. Altman " Clustering protein environments for function prediction: finding PROSITE motifs in 3D" BMC(BioMedCentral) Bioinformatics 8(Suppl 4):S10, 2007.
  2. Liping Wei and Russ B. Altman, "Recognizingcomplex, asymmetric functional sites in proteinstructures using a bayesian scoring function,"Journal of Bioinformatics and Computational BiologyVol. 1, pp. 119-138, 2003. https://doi.org/10.1142/S0219720003000150
  3. Steven C. Bagley, Liping Wei, Carol Cheon, andRuss B. Altman "Characterizing oriented proteinstructural sites using biochemical properties" ProcInt Conf Intell Syst Mol Biol. 3, pp. 12-20, 1995.
  4. Wallace, A.C., N.Borkakoti, and J.M. Thornton,"TESS: a geometric hashing algorithm for deriving3D coordinate templates for searching structuraldatabases. Application to enzyme active sites"Protein Sci. 6, 11(1997), pp. 2308-2323 , 1997. https://doi.org/10.1002/pro.5560061104
  5. Wallace, A.C., R.A. Laskowski, and J.M.Thornton, "Derivation of 3D coordinate templatesfor searching structural databases: application toSer-His-Asp catalytic triads in the serineproteinases and lipases" Protein Sci. 5, 6(1996), pp.1001-1013, 1996. https://doi.org/10.1002/pro.5560050603
  6. Fetrow, J.S. and J. Skolnick, "Method forprediction of protein function from sequence usingthe sequence-to structure-to function paradigmwith application to glutaredoxins/thioredoxins andT1 ribonucleases" J Mol Biol. 281, 5(1998) pp. 949-968, 1998. https://doi.org/10.1006/jmbi.1998.1993
  7. Fetrow, J.S., A. Godzik, and J. Skolnick,"Functional analysis of the Escherichia coli genomeusing the sequence- to structure-to-funtionparadigm: identification of proteins exhibiting theglutaredoxin/thioredoxin disulfide oxidoreductaseactivity" J Mol Biol. 282, 4(1998), pp. 703-711, 1998. https://doi.org/10.1006/jmbi.1998.2061
  8. Inbal Haplperin, Dariya S Glazer, Shirely Wu andRuss B Altman "The FEATURE framework forprotein function annotation: modelling new functions,improving performance, and extending to novelapplications" BMC Genomics 9(Suppl 2):52, 2008. https://doi.org/10.1186/1471-2164-9-52
  9. M.P. Liang, D.L. Brutlag, R.B. Altman,"Automated construction of structural motifs forpredicting functional sites on protein structures," ThePac Symp Biocomput. pp. 204-215, 2003.
  10. Liping Wei, Russ B. Altman, Jeffrey T. Chang "Using the radial distributions of physical features tocompare amino acid environments and align aminoacid sequences" Pac Symp Biocomput. pp. 465-76,1997.
  11. Liping Wei and Russ B. Altman "Recognizingprotein binding sites using statistical descriptions oftheir 3D environments" Pac Symp Biocomput. pp.497-508, 1998.