Back to Search Start Over

Covering all your bases: incorporating intron signal from RNA-seq data.

Authors :
Lee S
Zhang AY
Su S
Ng AP
Holik AZ
Asselin-Labat ML
Ritchie ME
Law CW
Source :
NAR genomics and bioinformatics [NAR Genom Bioinform] 2020 Sep 22; Vol. 2 (3), pp. lqaa073. Date of Electronic Publication: 2020 Sep 22 (Print Publication: 2020).
Publication Year :
2020

Abstract

RNA-seq datasets can contain millions of intron reads per library that are typically removed from downstream analysis. Only reads overlapping annotated exons are considered to be informative since mature mRNA is assumed to be the major component sequenced, especially for poly(A) RNA libraries. In this study, we show that intron reads are informative, and through exploratory data analysis of read coverage that intron signal is representative of both pre-mRNAs and intron retention. We demonstrate how intron reads can be utilized in differential expression analysis using our index method where a unique set of differentially expressed genes can be detected using intron counts. In exploring read coverage, we also developed the superintronic software that quickly and robustly calculates user-defined summary statistics for exonic and intronic regions. Across multiple datasets, superintronic enabled us to identify several genes with distinctly retained introns that had similar coverage levels to that of neighbouring exons. The work and ideas presented in this paper is the first of its kind to consider multiple biological sources for intron reads through exploratory data analysis, minimizing bias in discovery and interpretation of results. Our findings open up possibilities for further methods development for intron reads and RNA-seq data in general.<br /> (© The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.)

Details

Language :
English
ISSN :
2631-9268
Volume :
2
Issue :
3
Database :
MEDLINE
Journal :
NAR genomics and bioinformatics
Publication Type :
Academic Journal
Accession number :
33575621
Full Text :
https://doi.org/10.1093/nargab/lqaa073