ProMEX: a mass spectral reference database for proteins and protein phosphorylation sites
- PMID: 17587460
- PMCID: PMC1920535
- DOI: 10.1186/1471-2105-8-216
ProMEX: a mass spectral reference database for proteins and protein phosphorylation sites
Abstract
Background: In the last decade, techniques were established for the large scale genome-wide analysis of proteins, RNA, and metabolites, and database solutions have been developed to manage the generated data sets. The Golm Metabolome Database for metabolite data (GMD) represents one such effort to make these data broadly available and to interconnect the different molecular levels of a biological system 1. As data interpretation in the light of already existing data becomes increasingly important, these initiatives are an essential part of current and future systems biology.
Results: A mass spectral library consisting of experimentally derived tryptic peptide product ion spectra was generated based on liquid chromatography coupled to ion trap mass spectrometry (LC-IT-MS). Protein samples derived from Arabidopsis thaliana, Chlamydomonas reinhardii, Medicago truncatula, and Sinorhizobium meliloti were analysed. With currently 4,557 manually validated spectra associated with 4,226 unique peptides from 1,367 proteins, the database serves as a continuously growing reference data set and can be used for protein identification and quantification in uncharacterized biological samples. For peptide identification, several algorithms were implemented based on a recently published study for peptide mass fingerprinting 2 and tested for false positive and negative rates. An algorithm which considers intensity distribution for match correlation scores was found to yield best results. For proof of concept, an LC-IT-MS analysis of a tryptic leaf protein digest was converted to mzData format and searched against the mass spectral library. The utility of the mass spectral library was also tested for the identification of phosphorylated tryptic peptides. We included in vivo phosphorylation sites of Arabidopsis thaliana proteins and the identification performance was found to be improved compared to genome-based search algorithms. Protein identification by ProMEX is linked to other levels of biological organization such as metabolite, pathway, and transcript data. The database is further connected to annotation and classification services via BioMoby.
Conclusion: The ProMEX protein/peptide database represents a mass spectral reference library with the capability of matching unknown samples for protein identification. The database allows text searches based on metadata such as experimental information of the samples, mass spectrometric instrument parameters or unique protein identifier like AGI codes. ProMEX integrates proteomics data with other levels of molecular organization including metabolite, pathway, and transcript information and may thus become a useful resource for plant systems biology studies. The ProMEX mass spectral library is available at http://promex.mpimp-golm.mpg.de/.
Figures





Similar articles
-
GMD@CSB.DB: the Golm Metabolome Database.Bioinformatics. 2005 Apr 15;21(8):1635-8. doi: 10.1093/bioinformatics/bti236. Epub 2004 Dec 21. Bioinformatics. 2005. PMID: 15613389
-
ProMEX - a mass spectral reference database for plant proteomics.Front Plant Sci. 2012 Jun 6;3:125. doi: 10.3389/fpls.2012.00125. eCollection 2012. Front Plant Sci. 2012. PMID: 22685450 Free PMC article.
-
Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry.Nat Methods. 2007 Mar;4(3):207-14. doi: 10.1038/nmeth1019. Nat Methods. 2007. PMID: 17327847
-
Large-scale database searching using tandem mass spectra: looking up the answer in the back of the book.Nat Methods. 2004 Dec;1(3):195-202. doi: 10.1038/nmeth725. Nat Methods. 2004. PMID: 15789030 Review.
-
Phosphoproteomics by mass spectrometry and classical protein chemistry approaches.Mass Spectrom Rev. 2005 Nov-Dec;24(6):828-46. doi: 10.1002/mas.20042. Mass Spectrom Rev. 2005. PMID: 15538747 Review.
Cited by
-
Top-down proteomics.Nat Rev Methods Primers. 2024;4(1):38. doi: 10.1038/s43586-024-00318-2. Epub 2024 Jun 13. Nat Rev Methods Primers. 2024. PMID: 39006170 Free PMC article.
-
Documentation system for plant transformation service and research.Plant Methods. 2010 Jan 27;6:4. doi: 10.1186/1746-4811-6-4. Plant Methods. 2010. PMID: 20181025 Free PMC article.
-
Absolute quantification of Medicago truncatula sucrose synthase isoforms and N-metabolism enzymes in symbiotic root nodules and the detection of novel nodule phosphoproteins by mass spectrometry.J Exp Bot. 2008;59(12):3307-15. doi: 10.1093/jxb/ern182. J Exp Bot. 2008. PMID: 18772307 Free PMC article.
-
Sorting signals, N-terminal modifications and abundance of the chloroplast proteome.PLoS One. 2008 Apr 23;3(4):e1994. doi: 10.1371/journal.pone.0001994. PLoS One. 2008. PMID: 18431481 Free PMC article.
-
ChlamyCyc: an integrative systems biology database and web-portal for Chlamydomonas reinhardtii.BMC Genomics. 2009 May 4;10:209. doi: 10.1186/1471-2164-10-209. BMC Genomics. 2009. PMID: 19409111 Free PMC article.
References
-
- Wienkoop S, Larrainzar E, Niemann M, Gonzalez E, Lehmann U, Weckwerth W. Stable isotope-free quantitative shotgun proteomics combined with sample pattern recognition for rapid diagnostics - a case study in Medicago truncatula nodules. Journal of Separation Science. 2006;29:2793–2801. doi: 10.1002/jssc.200600290. - DOI - PubMed
-
- Wienkoop S, Glinski M, Tanaka N, Tolstikov V, Fiehn O, Weckwerth W. Linking protein fractionation with multidimensional monolithic RP peptide chromatography/mass spectrometry enhances protein identification from complex mixtures even in the presence of abundant proteins. Rapid Communications of Mass Spectrometry. 2004;18:643–650. doi: 10.1002/rcm.1376. - DOI - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials