Machine Learnable Fold Space Representation based on Residue Cluster Classes
- PMID: 26366526
- DOI: 10.1016/j.compbiolchem.2015.07.010
Machine Learnable Fold Space Representation based on Residue Cluster Classes
Abstract
Motivation: Protein fold space is a conceptual framework where all possible protein folds exist and ideas about protein structure, function and evolution may be analyzed. Classification of protein folds in this space is commonly achieved by using similarity indexes and/or machine learning approaches, each with different limitations.
Results: We propose a method for constructing a compact vector space model of protein fold space by representing each protein structure by its residues local contacts. We developed an efficient method to statistically test for the separability of points in a space and showed that our protein fold space representation is learnable by any machine-learning algorithm.
Availability: An API is freely available at https://code.google.com/p/pyrcc/.
Copyright © 2015 Elsevier Ltd. All rights reserved.
Similar articles
-
Support Vector Machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs.Bioinformatics. 2007 Dec 15;23(24):3320-7. doi: 10.1093/bioinformatics/btm527. Epub 2007 Nov 7. Bioinformatics. 2007. PMID: 17989092
-
PFRES: protein fold classification by using evolutionary information and predicted secondary structure.Bioinformatics. 2007 Nov 1;23(21):2843-50. doi: 10.1093/bioinformatics/btm475. Epub 2007 Oct 17. Bioinformatics. 2007. PMID: 17942446
-
Non-local residue-residue contacts in proteins are more conserved than local ones.Bioinformatics. 2013 Feb 1;29(3):331-7. doi: 10.1093/bioinformatics/bts694. Epub 2012 Nov 30. Bioinformatics. 2013. PMID: 23202807
-
Recent Progress in Machine Learning-Based Methods for Protein Fold Recognition.Int J Mol Sci. 2016 Dec 16;17(12):2118. doi: 10.3390/ijms17122118. Int J Mol Sci. 2016. PMID: 27999256 Free PMC article. Review.
-
Inter-residue interactions in protein folding and stability.Prog Biophys Mol Biol. 2004 Oct;86(2):235-77. doi: 10.1016/j.pbiomolbio.2003.09.003. Prog Biophys Mol Biol. 2004. PMID: 15288760 Review.
Cited by
-
Residue Cluster Classes: A Unified Protein Representation for Efficient Structural and Functional Classification.Entropy (Basel). 2020 Apr 20;22(4):472. doi: 10.3390/e22040472. Entropy (Basel). 2020. PMID: 33286246 Free PMC article.
-
Protein-Protein Interactions Efficiently Modeled by Residue Cluster Classes.Int J Mol Sci. 2020 Jul 6;21(13):4787. doi: 10.3390/ijms21134787. Int J Mol Sci. 2020. PMID: 32640745 Free PMC article.
-
Saturation Mutagenesis of the Transmembrane Region of HokC in Escherichia coli Reveals Its High Tolerance to Mutations.Int J Mol Sci. 2021 Sep 26;22(19):10359. doi: 10.3390/ijms221910359. Int J Mol Sci. 2021. PMID: 34638709 Free PMC article.
-
Network Science and Group Fusion Similarity-Based Searching to Explore the Chemical Space of Antiparasitic Peptides.ACS Omega. 2022 Dec 6;7(50):46012-46036. doi: 10.1021/acsomega.2c03398. eCollection 2022 Dec 20. ACS Omega. 2022. PMID: 36570318 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources