DOI QR코드

DOI QR Code

Implementation of Prototype for a Protein Motif Prediction and Update

단백질 모티프 예측 및 갱신 프로토 타입 구현

  • Published : 2004.08.01

Abstract

Motif databases are used in the function and structure prediction of proteins. The frequency of use about these databases increases continuously because of protein sequence data growth. Recently, many researches about motif resource integration are proceeding. However, existing motif databases were developed independently, thus these databases have a heterogeneous search result problem. Database intnegration for this problem resolution has a periodic update problem, a complex query process problem, a duplicate database entry handling problem and BML support problem. Therefore, in this paper, we suppose a database resource integration method for these problem resolution, describe periodically integrated database update method and XML transformation. finally, we estimate the implementation of our prototype and a case database.

모티프 데이터베이스는 새롭게 등장하는 원시 단백질 서열의 기능 및 구조 예측에 사용된다. 이러한 모티프 데이터베이스들은 원시 단백질 서열의 빠른 성장과 더불어 급속한 이용 증가 추세를 보이고 있으며, 최근에 이르러 모티프 자원 통합에 관한 연구가 진행되고 있다. 그러나 이러한 모티프 데이터베이스들은 각기 개별적인 메소드로 개발되었기 때문에 각기 다른 형식의 검색 결과를 제공한다. 이러한 문제 해결을 위한 데이터베이스 통합에서는 데이터베이스 자동 갱신 문제, 복잡한 질의 처리 문제, 중복된 데이터베이스 엔트리 핸들링 문제, XML 지원 문제 등을 지니고 있다. 이 논문에서는 기존 문제점들을 해결하기 위하여 데이터베이스 자원 통합 방법론을 제안하였고, 통합된 데이터베이스의 주기적 갱신 방안과 XML로의 변환에 관하여 기술하였다. 아울러 구축된 통합 데이터베이스와 사례 데이터베이스를 비교 평가하였다.

Keywords

References

  1. 김성진, 이상호, '객체-관계형 데이터베이스 시스템을 위한 새로운 성능 평가 방법론', 정보처리논문지, 제7권 7호, 2000
  2. 이범주, 최은선, 류근호, '모티프 자원 통합을 이용한 단백질 모티프 예측 시스템 구현', 정보처리학회논문지D, 제10-D권 제4호, 2003 https://doi.org/10.3745/KIPSTD.2003.10D.4.679
  3. R. Apweiler, T. K. Attwood, A. Bairoch, A. Bateman, E. Birney, M. Biswas, P. Bucher, L. Cerutti, F. Corpet, M. D. R. Croning, R. Durbin, L. Falquet, W. Fleischmann, J. Gouzy, H. Hermjakob, N. Hulo, L. Jonassen, D. Kahn, A. Kanapin, Y. Karavidopoulou, R. Lopez, B. Marx, N. J. Mulder, T. M. Oinn, M. Pagni, F. Servant, C. J. A. Sigrist and E. M. Zdobnov, 'The InterPro database, an integrated documentation resource for protein families, domains and functional sites,' Nuleic Acids Research, Vol.29, No.1, pp.37-40, 2001 https://doi.org/10.1093/nar/29.1.37
  4. M. R. Wilkins, K. L. Williams, R. D. Appel, D. F. Hochstrasser, 'Proteome Research : New Frontiers in Functional Genomics,' Springer-Verlag Berlin Heidelberg, pp.109-175, 1997
  5. Minoru Kanehisa, 'Post-Genome Informatics,' Oxford university press, pp.35-47, 2000
  6. David W. Mount, 'Bioinformatics : Sequence and Genome Analysis,' Cold Spring Harbor Laboratory Press, pp.45-48, 2001
  7. Kevin A. T. Silverstein, Alan Kilian, John L. Freeman, James E. Johnson, Ihab A. Awad, Ernest F. Retzel, 'PANA L : an integrated resource for Protein sequence ANALysis,' Bioinformatics, Vol.16, pp.1157-1158, 2000 https://doi.org/10.1093/bioinformatics/16.12.1157
  8. T. K. Attwood, M. E. Beck, D. R. Flower, P. Scordis, N. Selley, 'The PRINTS protein fingerprint database in its fifty year,' Nuleic Acids Research, Vol.26, No.1, pp.304-308, 1998 https://doi.org/10.1093/nar/26.1.304
  9. Alex Bateman, Evan Birney, Lorenzo Berruti, Richard Durbin, Laurence Etwiller, Sean R. Eddy, Sam Griffiths-Jones, Kevin L. Howe, Mhairi Marshall, Erik L. L. Sonnhammer, 'The Pfam Protein Families Database,' Nuleic Acids Research, Vol.30, No.1, pp.276-280, 2002 https://doi.org/10.1093/nar/30.1.276
  10. Jorja G. Henikoff, Steven Henikoff, Shmuel Pietrokovski, 'New features of the Block Database servers,' Nuleic Acids Research, Vol.27, No.1, pp.226-228, 1999 https://doi.org/10.1093/nar/27.1.226
  11. T. K. Attwood, H. Aviison, M. E. Beck, M. Bewley, A. J. Bleasby, F. Brewster, P. Cooper, K. Degtyarenko, A. J. Geddes, D. R. Flower, M. P. Kelly, S. Lott, K. M. Measures, D. J. Parry-Smith, D. N. Perkins, P. Scordis, D. Scott, C. Worledge, 'The PRINTS Database of Protein Fingerprints : A Novel Information Resource for Computational Molecular Biology,' J. Chem. Inf. Comput. Sci. 37, pp.417-424, 1997 https://doi.org/10.1021/ci960468e
  12. Laurent Falquet, Marco Pagni, Philipp Bucher, Nicolas Hulo, Christian J. A. Sigrist, Kay Hofmann, Amos Bairoch, 'The PROSITE database, its status in 2002,' Nucleic Acids Research, Vol.30, pp.235-238, 2002 https://doi.org/10.1093/nar/30.1.235
  13. Helen M. Berman, John Westbrook, Zukang Feng, Gary Gilliland, T. N. Bhat, Helge Weissig, Ilya N. Shindyalov, Philip E. Bourne, 'The Proten Data Bank,' Nucleic Acids Research, Vol.18, pp.235-242
  14. Etzold T., Ulyanov A., Argos P., 'SRS : information retrieval system for molecular biology data banks,' Methods Enzymol, pp.114-128, 1996
  15. Ramez Elmasri, Shamkant B. Navathe, 'Fundamentals of Database Systems,' Addison-Wesley, Reading, Massachusetts, 2000
  16. Philip Scordis, Darren R. Flower, Teresa K. Attwood, 'FingerPRINTScan : intellegent searching of the PRINTS motif database,' Bioinformatics, Vol.15, No.10, pp.799-806, 1999 https://doi.org/10.1093/bioinformatics/15.10.799
  17. T. K. Attwood, M. J. Blythe, D. R. Flower, A. Gaulton, J. E. Mabey, N.Maudling, L. McGregor, A. L. Mitchell, G. Moulton, K. Paine, P. Scordis, 'PRINTS and PRINTS-S shed light on protein ancestry,' Nucleic Acids Research, Vol.30, No.1, pp.239-241, 2002 https://doi.org/10.1093/nar/30.1.239
  18. Philipp Bucher, Kevin Karplus, Nicolas Moeri, Kay Hofmann, 'A Flexible Motif Search Technique Based on Generalized Profiles,' Comput. Chem., Vol.20, pp.3-24, 1996 https://doi.org/10.1016/S0097-8485(96)80003-9
  19. Doug Brutlag, 'Protein Structure & Motifs,' Biochemistry 201, Molecular Biology, 2000
  20. Gynthia Gibas, Per Jambeck, 'Developing Bioinformatics Computer Skills,' O'REILLY, pp.290-295, 2001
  21. Attwood, 'The Babel of Bioinformatics,' Science 290, pp.471-473, 2000 https://doi.org/10.1126/science.290.5491.471
  22. Florence Corpet, Florence Servant, Jerome Gouzy and Daniel Kahn, 'ProDom and ProDom-CG : tools for protein domain analysis and whole genome comparisons,' Nucleic Acids Research, Vol.28, No.1 pp.267-269, 2000 https://doi.org/10.1093/nar/28.1.267
  23. Barbara Eckman, Julia Rice, Bill Swope, 'Heterogeneous Data and Algorithm Integration in Bioinformatics,' ISMB, 10th International Conference Tutorial, 2002
  24. Steve Muench, 'Building Oracle XML Applications,' O'Reilly & Associates, Inc., pp.8-22, 2000
  25. Steven Holzner, 'Inside XML,' New Riders, pp.33-42, 2000
  26. Bill Brogden, Charis Minnick, 'JAVA Developer's Guide to E-Commerce with XML and JSP,' SYBEX Inc., 2001
  27. 최명종, 유재우, 최재영, '자바 개발자를 위한 XML', 흥룡과학출판사, 2003