Back to Search Start Over

Systematic assessment of long-read RNA-seq methods for transcript identification and quantification

Authors :
Pardo-Palacios, Francisco J.
Wang, Dingjie
Reese, Fairlie
Diekhans, Mark
Carbonell-Sala, Sílvia
Williams, Brian
Loveland, Jane E.
Adams, Matthew S.
Balderrama-Gutierrez, Gabriela
Behera, Amit K.
María, Maite De
Gonzalez, Jose M.
Hunt, Toby
Lagarde, Julien
Li, Haoran
Liang, Cindy E.
Prjibelski, Andrey D.
Sheynkman, Leon
Amador, David Moraga
Barnes, If
Berry, Andrew
Çelik, Muhammed Hasan
Garcia-Reyero, Natàlia
Goetz, Stefan
Kondratova, Liudmyla
Martinez-Tomas, Jorge
Menor, Carlos
Mudge, Jonathan M.
Paniagua, Alejandro
Suner, Marie-Marthe
Takahashi, Hazuki
Tang, Alison D.
Youngworth, Ingrid Ashley
Carninci, Piero
Denslow, Nancy
Guigó, Roderic
Hunter, Margaret E.
Tilgner, Hagen U.
Wold, Barbara J.
Vollmers, Christopher
Frankish, Adam
Au, Kin Fai
Sheynkman, Gloria M.
Conesa, Ana
Mortazavi, Ali
Brooks, Angela N.
Publication Year :
2022
Publisher :
figshare, 2022.

Abstract

With increased usage of long-read sequencing technologies to perform transcriptome analyses, there becomes a greater need to evaluate different methodologies including library preparation, sequencing platform, and computational analysis tools. Here, we report the study design of a community effort called the Long-read RNA-Seq Genome Annotation Assessment Project (LRGASP) Consortium, whose goals are characterizing the strengths and remaining challenges in using long-read approaches to identify and quantify the transcriptomes of both model and non-model organisms. The LRGASP organizers have generated cDNA and direct RNA datasets in human, mouse, and manatee samples using different protocols followed by sequencing on Illumina, Pacific Biosciences, and Oxford Nanopore Technologies platforms. Participants will use the provided data to submit predictions for three challenges: transcript isoform detection with a high-quality genome, transcript isoform quantification, and de novo transcript isoform identification. Evaluators from different institutions will determine which pipelines have the highest accuracy for a variety of metrics using benchmarks that include spike-in synthetic transcripts, simulated data, and a set of undisclosed, manually curated transcripts by GENCODE. We also describe plans for experimental validation of predictions that are platform-specific and computational tool-specific. We believe that a community effort to evaluate long-read RNA-seq methods will help move the field toward a better consensus on the best approaches to use for transcriptome analyses. Items: The LRGASP Registered Report Supplementary File Supplementary Figures Supplementary Table 1

Details

Database :
OpenAIRE
Accession number :
edsair.doi.dedup.....8ea12619c931df09a3d6510aec9227b0
Full Text :
https://doi.org/10.6084/m9.figshare.19642383.v1