TY - JOUR
T1 - Unification of miRNA and isomiR research
T2 - The mirGFF3 format and the mirtop API
AU - Desvignes, Thomas
AU - Loher, Phillipe
AU - Eilbeck, Karen
AU - Ma, Jeffery
AU - Urgese, Gianvito
AU - Fromm, Bastian
AU - Sydes, Jason
AU - Aparicio-Puerta, Ernesto
AU - Barrera, Victor
AU - Espín, Roderic
AU - Londin, Eric
AU - Telonis, Aristeidis G.
AU - Ficarra, Elisa
AU - Friedländer, Marc R.
AU - Postlethwait, John H.
AU - Rigoutsos, Isidore
AU - Hackenberg, Michael
AU - Vlachos, Ioannis S.
AU - Halushka, Marc K.
AU - Pantano, Lorena
N1 - Publisher Copyright:
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2018/12/25
Y1 - 2018/12/25
N2 - Background MicroRNAs (miRNAs) are small RNA molecules (∼22 nucleotide long) involved in post-transcriptional gene regulation. Advances in high-throughput sequencing technologies led to the discovery of isomiRs, which are miRNA sequence variants. While many miRNA-seq analysis tools exist, a lack of consensus on miRNA/isomiR analyses exists, and the resulting diversity of output formats hinders accurate comparisons between tools and precludes data sharing and the development of common downstream analysis methods. Findings To overcome this situation, we present here a community-based project, miRTOP (miRNA Transcriptomic Open Project) working towards the optimization of miRNA analyses. The aim of miRTOP is to promote the development of downstream analysis tools that are compatible with any existing detection and quantification tool. Based on the existing GFF3 format, we first created a new standard format, mirGFF3, for the output of miRNA/isomiR detection and quantification results from small RNA-seq data. Additionally, we developed a command line Python tool, ‘mirtop’, to manage the mirGFF3 format. Currently, mirtop can convert into mirGFF3 the outputs of commonly used pipelines, such as seqbuster, miRge2.0, isomiR-SEA, sRNAbench, and Prost!, as well as BAM files. Its open architecture enables any tool or pipeline to output results in mirGFF3. Conclusions Collectively a comprehensive isomiR categorization system, along with the accompanying mirGFF3 and mirtop API provide a complete solution for the standardization of miRNA and isomiR analysis, enabling data sharing, reporting, comparative analyses, and benchmarking, while promoting the development of common miRNA methods focusing on downstream steps to miRNA detection, annotation, and quantification.
AB - Background MicroRNAs (miRNAs) are small RNA molecules (∼22 nucleotide long) involved in post-transcriptional gene regulation. Advances in high-throughput sequencing technologies led to the discovery of isomiRs, which are miRNA sequence variants. While many miRNA-seq analysis tools exist, a lack of consensus on miRNA/isomiR analyses exists, and the resulting diversity of output formats hinders accurate comparisons between tools and precludes data sharing and the development of common downstream analysis methods. Findings To overcome this situation, we present here a community-based project, miRTOP (miRNA Transcriptomic Open Project) working towards the optimization of miRNA analyses. The aim of miRTOP is to promote the development of downstream analysis tools that are compatible with any existing detection and quantification tool. Based on the existing GFF3 format, we first created a new standard format, mirGFF3, for the output of miRNA/isomiR detection and quantification results from small RNA-seq data. Additionally, we developed a command line Python tool, ‘mirtop’, to manage the mirGFF3 format. Currently, mirtop can convert into mirGFF3 the outputs of commonly used pipelines, such as seqbuster, miRge2.0, isomiR-SEA, sRNAbench, and Prost!, as well as BAM files. Its open architecture enables any tool or pipeline to output results in mirGFF3. Conclusions Collectively a comprehensive isomiR categorization system, along with the accompanying mirGFF3 and mirtop API provide a complete solution for the standardization of miRNA and isomiR analysis, enabling data sharing, reporting, comparative analyses, and benchmarking, while promoting the development of common miRNA methods focusing on downstream steps to miRNA detection, annotation, and quantification.
KW - file format
KW - isomiR
KW - microRNA
KW - reproducibility
KW - small RNA-seq
KW - standardization
UR - http://www.scopus.com/inward/record.url?scp=85093544757&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85093544757&partnerID=8YFLogxK
U2 - 10.1101/505222
DO - 10.1101/505222
M3 - Article
AN - SCOPUS:85093544757
JO - Advances in Water Resources
JF - Advances in Water Resources
SN - 0309-1708
ER -