CLASS2: Accurate and efficient splice variant annotation from RNA-seq reads

Research output: Contribution to journalArticlepeer-review

24 Scopus citations

Abstract

Next generation sequencing of cellular RNA is making it possible to characterize genes and alternative splicing in unprecedented detail. However, designing bioinformatics tools to accurately capture splicing variation has proven difficult. Current programs can find major isoforms of a gene but miss lower abundance variants, or are sensitive but imprecise. CLASS2 is a novel open source tool for accurate genome-guided transcriptome assembly from RNA-seq reads based on the model of splice graph. An extension of our program CLASS, CLASS2 jointly optimizes read patterns and the number of supporting reads to score and prioritize transcripts, implemented in a novel, scalable and efficient dynamic programming algorithm. When compared against reference programs, CLASS2 had the best overall accuracy and could detect up to twice as many splicing events with precision similar to the best reference program. Notably, it was the only tool to produce consistently reliable transcript models for a wide range of applications and sequencing strategies, including ribosomal RNA-depleted samples. Lightweight and multi-threaded, CLASS2 requires <3GB RAM and can analyze a 350 million read set within hours, and can be widely applied to transcriptomics studies ranging from clinical RNA sequencing, to alternative splicing analyses, and to the annotation of new genomes.

Original languageEnglish (US)
Article numbere98
JournalNucleic acids research
Volume44
Issue number10
DOIs
StatePublished - Jun 2 2016

ASJC Scopus subject areas

  • Genetics

Fingerprint

Dive into the research topics of 'CLASS2: Accurate and efficient splice variant annotation from RNA-seq reads'. Together they form a unique fingerprint.

Cite this