Protein-Protein Interactions Efficiently Modeled by Residue Cluster Classes
- PMID: 32640745
- PMCID: PMC7370293
- DOI: 10.3390/ijms21134787
Protein-Protein Interactions Efficiently Modeled by Residue Cluster Classes
Abstract
Predicting protein-protein interactions (PPI) represents an important challenge in structural bioinformatics. Current computational methods display different degrees of accuracy when predicting these interactions. Different factors were proposed to help improve these predictions, including choosing the proper descriptors of proteins to represent these interactions, among others. In the current work, we provide a representative protein structure that is amenable to PPI classification using machine learning approaches, referred to as residue cluster classes. Through sampling and optimization, we identified the best algorithm-parameter pair to classify PPI from more than 360 different training sets. We tested these classifiers against PPI datasets that were not included in the training set but shared sequence similarity with proteins in the training set to reproduce the situation of most proteins sharing sequence similarity with others. We identified a model with almost no PPI error (96-99% of correctly classified instances) and showed that residue cluster classes of protein pairs displayed a distinct pattern between positive and negative protein interactions. Our results indicated that residue cluster classes are structural features relevant to model PPI and provide a novel tool to mathematically model the protein structure/function relationship.
Keywords: machine learning; protein–protein interaction; residue cluster class.
Conflict of interest statement
The authors declare no conflict of interest. The funders had no role in the design of the study, in the collection, analyses, or interpretation of data, in the writing of the manuscript, or in the decision to publish the results.
Figures









Similar articles
-
Prediction of Protein-Protein Interaction via co-occurring Aligned Pattern Clusters.Methods. 2016 Nov 1;110:26-34. doi: 10.1016/j.ymeth.2016.07.018. Epub 2016 Jul 27. Methods. 2016. PMID: 27476008
-
Protein-protein interaction site predictions with three-dimensional probability distributions of interacting atoms on protein surfaces.PLoS One. 2012;7(6):e37706. doi: 10.1371/journal.pone.0037706. Epub 2012 Jun 6. PLoS One. 2012. PMID: 22701576 Free PMC article.
-
False positive reduction in protein-protein interaction predictions using gene ontology annotations.BMC Bioinformatics. 2007 Jul 23;8:262. doi: 10.1186/1471-2105-8-262. BMC Bioinformatics. 2007. PMID: 17645798 Free PMC article.
-
Recent advances in predicting and modeling protein-protein interactions.Trends Biochem Sci. 2023 Jun;48(6):527-538. doi: 10.1016/j.tibs.2023.03.003. Epub 2023 Apr 14. Trends Biochem Sci. 2023. PMID: 37061423 Review.
-
Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity.Proteomics. 2006 Jul;6(14):4023-37. doi: 10.1002/pmic.200500938. Proteomics. 2006. PMID: 16791826 Review.
References
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources