Differential expression analysis of RNA-seq data at single-base resolution

Alyssa C. Frazee, Sarven Sabunciyan, Kasper D. Hansen, Rafael A. Irizarry, Jeffrey T. Leek

Research output: Contribution to journalArticlepeer-review

30 Scopus citations

Abstract

RNA-sequencing (RNA-seq) is a flexible technology for measuring genome-wide expression that is rapidly replacing microarrays as costs become comparable. Current differential expression analysis methods for RNA-seq data fall into two broad classes: (1) methods that quantify expression within the boundaries of genes previously published in databases and (2) methods that attempt to reconstruct full length RNA transcripts. The first class cannot discover differential expression outside of previously known genes. While the second approach does possess discovery capabilities, statistical analysis of differential expression is complicated by the ambiguity and variability incurred while assembling transcripts and estimating their abundances. Here, we propose a novel method that first identifies differentially expressed regions (DERs) of interest by assessing differential expression at each base of the genome. The method then segments the genome into regions comprised of bases showing similar differential expression signal, and then assigns a measure of statistical significance to each region. Optionally, DERs can be annotated using a reference database of genomic features. We compare our approach with leading competitors from both current classes of differential expression methods and highlight the strengths and weaknesses of each. A software implementation of our method is available on github.

Original languageEnglish (US)
Pages (from-to)413-426
Number of pages14
JournalBiostatistics
Volume15
Issue number3
DOIs
StatePublished - Jul 2014

Keywords

  • Bioinformatics
  • Differential expression
  • False discovery rate
  • Genomics
  • RNA sequencing

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Differential expression analysis of RNA-seq data at single-base resolution'. Together they form a unique fingerprint.

Cite this