Protein-Protein Interaction Prediction using Interaction Significance Matrix

Jang, Woo-Hyuk;Jung, Suk-Hoon;Jung, Hwie-Sung;Hyun, Bo-Ra;Han, Dong-Soo;

Journal of KIISE:Software and Applications (한국정보과학회논문지:소프트웨어및응용)

Volume 36 Issue 10
/
Pages.851-860
/
2009
/
1229-6848(pISSN)

Korean Institute of Information Scientists and Engineers (한국정보과학회)

Protein-Protein Interaction Prediction using Interaction Significance Matrix

상호작용 중요도 행렬을 이용한 단백질-단백질 상호작용 예측

장우혁 (KAIST 정보통신공학과) ;
정석훈 (KAIST 정보통신공학과) ;
정휘성 (KAIST 정보통신공학과) ;
현보라 (KAIST 정보통신공학과) ;
한동수 (KAIST 전산학과)

Published : 2009.10.15

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Recently, among the computational methods of protein-protein interaction prediction, vast amounts of domain based methods originated from domain-domain relation consideration have been developed. However, it is true that multi domains collaboration is avowedly ignored because of computational complexity. In this paper, we implemented a protein interaction prediction system based the Interaction Significance matrix, which quantified an influence of domain combination pair on a protein interaction. Unlike conventional domain combination methods, IS matrix contains weighted domain combinations and domain combination pair power, which mean possibilities of domain collaboration and being the main body on a protein interaction. About 63% of sensitivity and 94% of specificity were measured when we use interaction data from DIP, IntAct and Pfam-A as a domain database. In addition, prediction accuracy gradually increased by growth of learning set size, The prediction software and learning data are currently available on the web site.

최근 계산을 통한 단백질 상호작용 예측 기법 중, 단백질 쌍이 포함하고 있는 도메인들 사이의 관계에 중점을 둔 도메인 정보 기반 예측 기법들이 다양하게 제안되고 있다. 하지만, 다수의 도메인 쌍들이 상호작용에 기여하는 정도를 정밀하게 반영하는 계산 기법은 드문 실정이다. 본 논문에서는 단백질 상호작용에 있어 도메인 조합 쌍의 상호작용 영향력을 수치화하여 반영한 상호작용 중요도 행렬을 고안하고 이를 기반으로 한 단백질 상호작용 예측 시스템을 구현한다. 일반적인 도메인 조합 기법과 달리, 상호작용 중요도 행렬에서는 상호작용을 위한 도메인간의 협업 확률이 고려된 Weighted 도메인 조합과, 다수의 Weighted 도메인 조합 중 실제 상호작용 주체가 될 확률을 도메인 조합 쌍의 힘(Domain Combination Pair Power, DCPPW)으로 수치화한다. DIP과 IntAct에서 얻어온 S. cerevisiae의 단백질 상호작용 데이터와 Pfam-A 도메인 정보를 사용한 정확도 검증 결과, 평균 63%의 민감도와 94%의 특이도를 확인하였으며, 학습집단의 증가에 따른 안정적인 예측 정확도 향상을 보였다. 본 논문에서 구현한 예측 시스템과 학습 데이터는 웹(http://code.google.com/p/prespi)을 통하여 내려 받을 수 있다.

Keywords

References

Marcotte, E., Pellegrini, M., Ng, H., Rice, D., Yeates, T., Eisenberg, D., 'Detecting protein function and protein-protein interactions from genome sequences,' Science, 285, pp.751-753, 1999
Szilagyi A, Grimm V, Arakaki A, Skolnick J, 'Prediction of physical protein-protein interactions,' Phys Biol., 2, S1-S16, 2005 https://doi.org/10.1088/1478-3975/2/2/S01
Sprinzak, E., Margalit, H., 'Correlated sequencesignatures as markers of protein-protein interaction,' J. Mol. Biol., 311, pp.681-692, 2001 https://doi.org/10.1006/jmbi.2001.4920
Deng, M., Mehta, S., Sun, F., Chen, T., 'Inferring domain-domain interactions from protein-protein interactions,' Genome Res., 12, pp.1540-1548, 2002 https://doi.org/10.1101/gr.153002
Chen, L., Wu, L., Y. W., Zhang, X., 'Inferring protein interactions from experimental data by association probabilistic method,' Proteins, 62, pp.833-837, 2006 https://doi.org/10.1002/prot.20783
Liu, Y., Liu, N., Zhao, H., 'Inferring proteinprotein interactions through high-throughput interaction data from diverse organisms,' Bioinformatics, 21, pp.3279-3285, 2005 https://doi.org/10.1093/bioinformatics/bti492
Dohkan, S., Koike, A., Takagi, T., 'Support vector machines for predicting protein-protein interactions,' Genome Inform, 14, pp.502-503, 2003
Riley, R., Lee, C., Sabatti, C., Eisenberg, D., 'Inferring protein domain interactions from databases of interacting proteins,' Genome Biology, 6,R89, 2005 https://doi.org/10.1186/gb-2005-6-10-r89
Moza, B., Buonpane, R., Zhu, P., Herfst, C., Rahman, A., McCormick, J., Kranz, D., Sundberg, E., 'Long-range cooperative binding effects in a T cell receptor variable domain,' Proc Natl Acad Sci, 103, pp.9867-9872, 2006 https://doi.org/10.1073/pnas.0600220103
S.H. Jung, H.Y. Hur, D. Kim, D.S. Han, 'Identification of Conserved Domain Combinations in S. cerevisiae Proteins,' Bioinformatics and Bioengineering, pp.14-20, 2007
J. Brodie and I. J. McEwan, 'Intra-domain communication between the nterminal and DNAbinding domains of the androgen receptor: modulation of androgen response element DNA binding,' Journal of Molecular Endocrinology, 34, pp.603-615, 2005 https://doi.org/10.1677/jme.1.01723
N. B. E. Ronne and K. Dano, 'Domain interplay in the urokinase receptor,' J. Biol. Chem., 217(37), 22, pp.885-22 894, 1996 https://doi.org/10.1074/jbc.271.37.22885
Han, D., Kim, H., Jang, W., Lee, S., Jung, S., 'PreSPI: a domain combination based prediction system for protein-protein interaction,' Nucl Acids Res, 32, pp.6312-6320, 2004 https://doi.org/10.1093/nar/gkh972
R.S. Wang, Y. Wang, L.Y. Wu, X.S. Zhang, L. Chen, 'Analysis on multi-domain cooperation for predicting protein-protein interactions,' BMC Bioinformatics, 8, pp.2-20, 2007 https://doi.org/10.1186/1471-2105-8-2

Journal of KIISE:Software and Applications (한국정보과학회논문지:소프트웨어및응용)

Protein-Protein Interaction Prediction using Interaction Significance Matrix

상호작용 중요도 행렬을 이용한 단백질-단백질 상호작용 예측

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)