Joint Bounding of Peaks Across Samples Improves Differential Analysis in Mass Spectrometry-Based Metabolomics

Research output: Contribution to journalArticlepeer-review


As mass spectrometry-based metabolomics becomes more widely used in biomedical research, it is important to revisit existing data analysis paradigms. Existing data preprocessing efforts have largely focused on methods which start by extracting features separately from each sample, followed by a subsequent attempt to group features across samples to facilitate comparisons. We show that this preprocessing approach leads to unnecessary variability in peak quantifications that adversely impacts downstream analysis. We present a new method, bakedpi, for the preprocessing of both centroid and profile mode metabolomics data that relies on an intensity-weighted bivariate kernel density estimation on a pooling of all samples to detect peaks. This new method reduces this unnecessary quantification variability and increases power in downstream differential analysis. (Figure Presented).

Original languageEnglish (US)
Pages (from-to)3517-3523
Number of pages7
JournalAnalytical Chemistry
Issue number6
StatePublished - Mar 21 2017

ASJC Scopus subject areas

  • Analytical Chemistry

Cite this