A direct approach to estimating false discovery rates conditional on covariates

Simina M. Boca, Jeffrey T. Leek

Research output: Contribution to journalArticle

Abstract

Modern scientific studies from many diverse areas of research abound with multiple hypothesis testing concerns. The false discovery rate (FDR) is one of the most commonly used approaches for measuring and controlling error rates when performing multiple tests. Adaptive FDRs rely on an estimate of the proportion of null hypotheses among all the hypotheses being tested. This proportion is typically estimated once for each collection of hypotheses. Here, we propose a regression framework to estimate the proportion of null hypotheses conditional on observed covariates. This may then be used as a multiplication factor with the Benjamini–Hochberg adjusted p-values, leading to a plug-in FDR estimator. We apply our method to a genome-wise association meta-analysis for body mass index. In our framework, we are able to use the sample sizes for the individual genomic loci and the minor allele frequencies as covariates. We further evaluate our approach via a number of simulation scenarios. We provide an implementation of this novel method for estimating the proportion of null hypotheses in a regression framework as part of the Bioconductor package swfdr.

Original languageEnglish (US)
Article numbere6035
JournalPeerJ
Volume2018
Issue number12
DOIs
StatePublished - 2018

Keywords

  • Adaptive FDR
  • FDR regression
  • False discovery rates

ASJC Scopus subject areas

  • Neuroscience(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Agricultural and Biological Sciences(all)

Fingerprint Dive into the research topics of 'A direct approach to estimating false discovery rates conditional on covariates'. Together they form a unique fingerprint.

  • Cite this