Testing the significance of microorganism identification by mass spectrometry and proteome database search

Fernando J. Pineda, Jeffrey S. Lin, Catherine Fenselau, Plamen A. Demirev

Research output: Contribution to journalArticle

Abstract

We derive and validate a simple statistical model that predicts the distribution of false matches between peaks in matrix-assisted laser desorption/ionization mass spectrometry data and proteins in proteome databases. The model allows us to calculate the significance of previously reported microorganism identification results. In particular, for Δm = ±1.5 Da, we find that the computed significance levels are sufficient to demonstrate the ability to identify microorganisms, provided the number of candidate microorganisms is limited to roughly three Escherichia coli-like or roughly 10 Bacillus subtilis-like microorganisms (in the sense of having roughly the same number of proteins per unit-mass interval). We conclude that, given the cluttered and incomplete nature of the data, it is likely that neither simple ranking nor simple hypothesis testing will be sufficient for truly robust microorganism identification over a large number of candidate microorganisms.

Original languageEnglish (US)
Pages (from-to)3739-3744
Number of pages6
JournalAnalytical chemistry
Volume72
Issue number16
DOIs
StatePublished - Aug 15 2000

ASJC Scopus subject areas

  • Analytical Chemistry

Fingerprint Dive into the research topics of 'Testing the significance of microorganism identification by mass spectrometry and proteome database search'. Together they form a unique fingerprint.

  • Cite this