SQuIRE reveals locus-specific regulation of interspersed repeat expression

Wan R. Yang, Daniel Ardeljan, Clarissa N. Pacyna, Lindsay M. Payer, Kathleen Burns

Research output: Contribution to journalArticle

Abstract

Transposable elements (TEs) are interspersed repeat sequences that make up much of the human genome. Their expression has been implicated in development and disease. However, TE-derived RNA-seq reads are difficult to quantify. Past approaches have excluded these reads or aggregated RNA expression to subfamilies shared by similar TE copies, sacrificing quantitative accuracy or the genomic context necessary to understand the basis of TE transcription. As a result, the effects of TEs on gene expression and associated phenotypes are not well understood. Here, we present Software for Quantifying Interspersed Repeat Expression (SQuIRE), the first RNA-seq analysis pipeline that provides a quantitative and locus-specific picture of TE expression (https://github.com/wyang17/SQuIRE). SQuIRE is an accurate and user-friendly tool that can be used for a variety of species. We applied SQuIRE to RNA-seq from normal mouse tissues and a Drosophila model of amyotrophic lateral sclerosis. In both model organisms, we recapitulated previously reported TE subfamily expression levels and revealed locus-specific TE expression. We also identified differences in TE transcription patterns relating to transcript type, gene expression and RNA splicing that would be lost with other approaches using subfamily-level analyses. Altogether, our findings illustrate the importance of studying TE transcription with locus-level resolution.

Original languageEnglish (US)
Pages (from-to)e27
JournalNucleic acids research
Volume47
Issue number5
DOIs
StatePublished - Mar 18 2019

Fingerprint

DNA Transposable Elements
Software
RNA
RNA Splicing
Gene Expression
Amyotrophic Lateral Sclerosis
Human Genome
Drosophila
Phenotype

ASJC Scopus subject areas

  • Genetics

Cite this

SQuIRE reveals locus-specific regulation of interspersed repeat expression. / Yang, Wan R.; Ardeljan, Daniel; Pacyna, Clarissa N.; Payer, Lindsay M.; Burns, Kathleen.

In: Nucleic acids research, Vol. 47, No. 5, 18.03.2019, p. e27.

Research output: Contribution to journalArticle

Yang, Wan R. ; Ardeljan, Daniel ; Pacyna, Clarissa N. ; Payer, Lindsay M. ; Burns, Kathleen. / SQuIRE reveals locus-specific regulation of interspersed repeat expression. In: Nucleic acids research. 2019 ; Vol. 47, No. 5. pp. e27.
@article{ef4d37a23f9045c2a0dccc26900d22b6,
title = "SQuIRE reveals locus-specific regulation of interspersed repeat expression",
abstract = "Transposable elements (TEs) are interspersed repeat sequences that make up much of the human genome. Their expression has been implicated in development and disease. However, TE-derived RNA-seq reads are difficult to quantify. Past approaches have excluded these reads or aggregated RNA expression to subfamilies shared by similar TE copies, sacrificing quantitative accuracy or the genomic context necessary to understand the basis of TE transcription. As a result, the effects of TEs on gene expression and associated phenotypes are not well understood. Here, we present Software for Quantifying Interspersed Repeat Expression (SQuIRE), the first RNA-seq analysis pipeline that provides a quantitative and locus-specific picture of TE expression (https://github.com/wyang17/SQuIRE). SQuIRE is an accurate and user-friendly tool that can be used for a variety of species. We applied SQuIRE to RNA-seq from normal mouse tissues and a Drosophila model of amyotrophic lateral sclerosis. In both model organisms, we recapitulated previously reported TE subfamily expression levels and revealed locus-specific TE expression. We also identified differences in TE transcription patterns relating to transcript type, gene expression and RNA splicing that would be lost with other approaches using subfamily-level analyses. Altogether, our findings illustrate the importance of studying TE transcription with locus-level resolution.",
author = "Yang, {Wan R.} and Daniel Ardeljan and Pacyna, {Clarissa N.} and Payer, {Lindsay M.} and Kathleen Burns",
year = "2019",
month = "3",
day = "18",
doi = "10.1093/nar/gky1301",
language = "English (US)",
volume = "47",
pages = "e27",
journal = "Nucleic Acids Research",
issn = "1362-4962",
publisher = "Oxford University Press",
number = "5",

}

TY - JOUR

T1 - SQuIRE reveals locus-specific regulation of interspersed repeat expression

AU - Yang, Wan R.

AU - Ardeljan, Daniel

AU - Pacyna, Clarissa N.

AU - Payer, Lindsay M.

AU - Burns, Kathleen

PY - 2019/3/18

Y1 - 2019/3/18

N2 - Transposable elements (TEs) are interspersed repeat sequences that make up much of the human genome. Their expression has been implicated in development and disease. However, TE-derived RNA-seq reads are difficult to quantify. Past approaches have excluded these reads or aggregated RNA expression to subfamilies shared by similar TE copies, sacrificing quantitative accuracy or the genomic context necessary to understand the basis of TE transcription. As a result, the effects of TEs on gene expression and associated phenotypes are not well understood. Here, we present Software for Quantifying Interspersed Repeat Expression (SQuIRE), the first RNA-seq analysis pipeline that provides a quantitative and locus-specific picture of TE expression (https://github.com/wyang17/SQuIRE). SQuIRE is an accurate and user-friendly tool that can be used for a variety of species. We applied SQuIRE to RNA-seq from normal mouse tissues and a Drosophila model of amyotrophic lateral sclerosis. In both model organisms, we recapitulated previously reported TE subfamily expression levels and revealed locus-specific TE expression. We also identified differences in TE transcription patterns relating to transcript type, gene expression and RNA splicing that would be lost with other approaches using subfamily-level analyses. Altogether, our findings illustrate the importance of studying TE transcription with locus-level resolution.

AB - Transposable elements (TEs) are interspersed repeat sequences that make up much of the human genome. Their expression has been implicated in development and disease. However, TE-derived RNA-seq reads are difficult to quantify. Past approaches have excluded these reads or aggregated RNA expression to subfamilies shared by similar TE copies, sacrificing quantitative accuracy or the genomic context necessary to understand the basis of TE transcription. As a result, the effects of TEs on gene expression and associated phenotypes are not well understood. Here, we present Software for Quantifying Interspersed Repeat Expression (SQuIRE), the first RNA-seq analysis pipeline that provides a quantitative and locus-specific picture of TE expression (https://github.com/wyang17/SQuIRE). SQuIRE is an accurate and user-friendly tool that can be used for a variety of species. We applied SQuIRE to RNA-seq from normal mouse tissues and a Drosophila model of amyotrophic lateral sclerosis. In both model organisms, we recapitulated previously reported TE subfamily expression levels and revealed locus-specific TE expression. We also identified differences in TE transcription patterns relating to transcript type, gene expression and RNA splicing that would be lost with other approaches using subfamily-level analyses. Altogether, our findings illustrate the importance of studying TE transcription with locus-level resolution.

UR - http://www.scopus.com/inward/record.url?scp=85062842878&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85062842878&partnerID=8YFLogxK

U2 - 10.1093/nar/gky1301

DO - 10.1093/nar/gky1301

M3 - Article

VL - 47

SP - e27

JO - Nucleic Acids Research

JF - Nucleic Acids Research

SN - 1362-4962

IS - 5

ER -