An adaptive multivariate two-sample test with application to microbiome differential abundance analysis

Kalins Banerjee, Ni Zhao, Arun Srinivasan, Lingzhou Xue, Steven D. Hicks, Frank A. Middleton, Rongling Wu, Xiang Zhan

Research output: Contribution to journalArticle

Abstract

Differential abundance analysis is a crucial task in many microbiome studies, where the central goal is to identify microbiome taxa associated with certain biological or clinical conditions. There are two different modes of microbiome differential abundance analysis: the individual-based univariate differential abundance analysis and the group-based multivariate differential abundance analysis. The univariate analysis identifies differentially abundant microbiome taxa subject to multiple correction under certain statistical error measurements such as false discovery rate, which is typically complicated by the high-dimensionality of taxa and complex correlation structure among taxa. The multivariate analysis evaluates the overall shift in the abundance of microbiome composition between two conditions, which provides useful preliminary differential information for the necessity of followup validation studies. In this paper, we present a novel Adaptive multivariate two-sample test for Microbiome Differential Analysis (AMDA) to examine whether the composition of a taxa-set are different between two conditions. Our simulation studies and real data applications demonstrated that the AMDA test was often more powerful than several competing methods while preserving the correct type I error rate. A free implementation of our AMDA method in R software is available at https://github.com/xyz5074/AMDA.

Original languageEnglish (US)
Article number350
JournalFrontiers in Genetics
Volume10
Issue numberAPR
DOIs
StatePublished - Jan 1 2019

Fingerprint

Microbiota
Validation Studies
Software
Multivariate Analysis

Keywords

  • Adaptive microbiome differential analysis (AMDA)
  • Maximum mean discrepancy (MMD)
  • Multivariate two-sample test
  • Permutation
  • Subset testing
  • Taxa-set

ASJC Scopus subject areas

  • Molecular Medicine
  • Genetics
  • Genetics(clinical)

Cite this

An adaptive multivariate two-sample test with application to microbiome differential abundance analysis. / Banerjee, Kalins; Zhao, Ni; Srinivasan, Arun; Xue, Lingzhou; Hicks, Steven D.; Middleton, Frank A.; Wu, Rongling; Zhan, Xiang.

In: Frontiers in Genetics, Vol. 10, No. APR, 350, 01.01.2019.

Research output: Contribution to journalArticle

Banerjee, Kalins ; Zhao, Ni ; Srinivasan, Arun ; Xue, Lingzhou ; Hicks, Steven D. ; Middleton, Frank A. ; Wu, Rongling ; Zhan, Xiang. / An adaptive multivariate two-sample test with application to microbiome differential abundance analysis. In: Frontiers in Genetics. 2019 ; Vol. 10, No. APR.
@article{0192d8d4165849afb2d82477bf22789a,
title = "An adaptive multivariate two-sample test with application to microbiome differential abundance analysis",
abstract = "Differential abundance analysis is a crucial task in many microbiome studies, where the central goal is to identify microbiome taxa associated with certain biological or clinical conditions. There are two different modes of microbiome differential abundance analysis: the individual-based univariate differential abundance analysis and the group-based multivariate differential abundance analysis. The univariate analysis identifies differentially abundant microbiome taxa subject to multiple correction under certain statistical error measurements such as false discovery rate, which is typically complicated by the high-dimensionality of taxa and complex correlation structure among taxa. The multivariate analysis evaluates the overall shift in the abundance of microbiome composition between two conditions, which provides useful preliminary differential information for the necessity of followup validation studies. In this paper, we present a novel Adaptive multivariate two-sample test for Microbiome Differential Analysis (AMDA) to examine whether the composition of a taxa-set are different between two conditions. Our simulation studies and real data applications demonstrated that the AMDA test was often more powerful than several competing methods while preserving the correct type I error rate. A free implementation of our AMDA method in R software is available at https://github.com/xyz5074/AMDA.",
keywords = "Adaptive microbiome differential analysis (AMDA), Maximum mean discrepancy (MMD), Multivariate two-sample test, Permutation, Subset testing, Taxa-set",
author = "Kalins Banerjee and Ni Zhao and Arun Srinivasan and Lingzhou Xue and Hicks, {Steven D.} and Middleton, {Frank A.} and Rongling Wu and Xiang Zhan",
year = "2019",
month = "1",
day = "1",
doi = "10.3389/fgene.2019.00350",
language = "English (US)",
volume = "10",
journal = "Frontiers in Genetics",
issn = "1664-8021",
publisher = "Frontiers Media S. A.",
number = "APR",

}

TY - JOUR

T1 - An adaptive multivariate two-sample test with application to microbiome differential abundance analysis

AU - Banerjee, Kalins

AU - Zhao, Ni

AU - Srinivasan, Arun

AU - Xue, Lingzhou

AU - Hicks, Steven D.

AU - Middleton, Frank A.

AU - Wu, Rongling

AU - Zhan, Xiang

PY - 2019/1/1

Y1 - 2019/1/1

N2 - Differential abundance analysis is a crucial task in many microbiome studies, where the central goal is to identify microbiome taxa associated with certain biological or clinical conditions. There are two different modes of microbiome differential abundance analysis: the individual-based univariate differential abundance analysis and the group-based multivariate differential abundance analysis. The univariate analysis identifies differentially abundant microbiome taxa subject to multiple correction under certain statistical error measurements such as false discovery rate, which is typically complicated by the high-dimensionality of taxa and complex correlation structure among taxa. The multivariate analysis evaluates the overall shift in the abundance of microbiome composition between two conditions, which provides useful preliminary differential information for the necessity of followup validation studies. In this paper, we present a novel Adaptive multivariate two-sample test for Microbiome Differential Analysis (AMDA) to examine whether the composition of a taxa-set are different between two conditions. Our simulation studies and real data applications demonstrated that the AMDA test was often more powerful than several competing methods while preserving the correct type I error rate. A free implementation of our AMDA method in R software is available at https://github.com/xyz5074/AMDA.

AB - Differential abundance analysis is a crucial task in many microbiome studies, where the central goal is to identify microbiome taxa associated with certain biological or clinical conditions. There are two different modes of microbiome differential abundance analysis: the individual-based univariate differential abundance analysis and the group-based multivariate differential abundance analysis. The univariate analysis identifies differentially abundant microbiome taxa subject to multiple correction under certain statistical error measurements such as false discovery rate, which is typically complicated by the high-dimensionality of taxa and complex correlation structure among taxa. The multivariate analysis evaluates the overall shift in the abundance of microbiome composition between two conditions, which provides useful preliminary differential information for the necessity of followup validation studies. In this paper, we present a novel Adaptive multivariate two-sample test for Microbiome Differential Analysis (AMDA) to examine whether the composition of a taxa-set are different between two conditions. Our simulation studies and real data applications demonstrated that the AMDA test was often more powerful than several competing methods while preserving the correct type I error rate. A free implementation of our AMDA method in R software is available at https://github.com/xyz5074/AMDA.

KW - Adaptive microbiome differential analysis (AMDA)

KW - Maximum mean discrepancy (MMD)

KW - Multivariate two-sample test

KW - Permutation

KW - Subset testing

KW - Taxa-set

UR - http://www.scopus.com/inward/record.url?scp=85067862085&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85067862085&partnerID=8YFLogxK

U2 - 10.3389/fgene.2019.00350

DO - 10.3389/fgene.2019.00350

M3 - Article

C2 - 31068967

AN - SCOPUS:85067862085

VL - 10

JO - Frontiers in Genetics

JF - Frontiers in Genetics

SN - 1664-8021

IS - APR

M1 - 350

ER -