TY - GEN
T1 - STRIKE
T2 - A protein-protein interaction classification approach
AU - Zaki, Nazar
AU - El-Hajj, Wassim
AU - Kamel, Hesham M.
AU - Sibai, Fadi
PY - 2011
Y1 - 2011
N2 - Protein-protein interaction has proven to be a valuable biological knowledge and an initial point for understanding how the cell internally works. In this chapter, we introduce a novel approach termed STRIKE which uses String Kernel to predict protein-protein interaction. STRIKE classifies protein pairs into "interacting" and "non-interacting" sets based solely on amino acid sequence information. The classification is performed by applying the string kernel approach, which has been shown to achieve good performance on text categorization and protein sequence classification. Two proteins are classified as "interacting" if they contain similar substrings of amino acids. Strings' similarity would allow one to infer homology which could lead to a very similar structural relationship. To evaluate the performance of STRIKE, we apply it to classify into "interacting" and "non-interacting" protein pairs. The dataset of the protein pairs are generated from the yeast protein interaction literature. The dataset is supported by different lines of experimental evidence. STRIKE was able to achieve reasonable improvement over the existing protein-protein interaction prediction methods.
AB - Protein-protein interaction has proven to be a valuable biological knowledge and an initial point for understanding how the cell internally works. In this chapter, we introduce a novel approach termed STRIKE which uses String Kernel to predict protein-protein interaction. STRIKE classifies protein pairs into "interacting" and "non-interacting" sets based solely on amino acid sequence information. The classification is performed by applying the string kernel approach, which has been shown to achieve good performance on text categorization and protein sequence classification. Two proteins are classified as "interacting" if they contain similar substrings of amino acids. Strings' similarity would allow one to infer homology which could lead to a very similar structural relationship. To evaluate the performance of STRIKE, we apply it to classify into "interacting" and "non-interacting" protein pairs. The dataset of the protein pairs are generated from the yeast protein interaction literature. The dataset is supported by different lines of experimental evidence. STRIKE was able to achieve reasonable improvement over the existing protein-protein interaction prediction methods.
KW - Amino acid sequencing
KW - Biological data mining and knowledge discovery
KW - Pattern classification and recognition
KW - Protein sequence analysis
KW - Protein-protein interaction
UR - http://www.scopus.com/inward/record.url?scp=79958001927&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79958001927&partnerID=8YFLogxK
U2 - 10.1007/978-1-4419-7046-6_26
DO - 10.1007/978-1-4419-7046-6_26
M3 - Conference contribution
C2 - 21431566
AN - SCOPUS:79958001927
SN - 9781441970459
T3 - Advances in Experimental Medicine and Biology
SP - 263
EP - 270
BT - Software Tools and Algorithms for Biological Systems
A2 - Arabnia, Hamid
A2 - Tran, Quoc-Nam
ER -