Back to Search Start Over

JUM is a computational method for comprehensive annotation-free analysis of alternative pre-mRNA splicing patterns

Authors :
Qingqing Wang
Donald C. Rio
Source :
Proceedings of the National Academy of Sciences of the United States of America, vol 115, iss 35
Publication Year :
2018
Publisher :
Proceedings of the National Academy of Sciences, 2018.

Abstract

Alternative pre-mRNA splicing (AS) greatly diversifies metazoan transcriptomes and proteomes and is crucial for gene regulation. Current computational analysis methods of AS from Illumina RNA-sequencing data rely on preannotated libraries of known spliced transcripts, which hinders AS analysis with poorly annotated genomes and can further mask unknown AS patterns. To address this critical bioinformatics problem, we developed a method called the junction usage model (JUM) that uses a bottom-up approach to identify, analyze, and quantitate global AS profiles without any prior transcriptome annotations. JUM accurately reports global AS changes in terms of the five conventional AS patterns and an additional “composite” category composed of inseparable combinations of conventional patterns. JUM stringently classifies the difficult and disease-relevant pattern of intron retention (IR), reducing the false positive rate of IR detection commonly seen in other annotation-based methods to near-negligible rates. When analyzing AS in RNA samples derived from Drosophila heads, human tumors, and human cell lines bearing cancer-associated splicing factor mutations, JUM consistently identified approximately twice the number of novel AS events missed by other methods. Computational simulations showed JUM exhibits a 1.2 to 4.8 times higher true positive rate at a fixed cutoff of 5% false discovery rate. In summary, JUM provides a framework and improved method that removes the necessity for transcriptome annotations and enables the detection, analysis, and quantification of AS patterns in complex metazoan transcriptomes with superior accuracy.

Details

ISSN :
10916490 and 00278424
Volume :
115
Database :
OpenAIRE
Journal :
Proceedings of the National Academy of Sciences
Accession number :
edsair.doi.dedup.....0c3f1e8b82b92c1ffb1ae3a211096d82
Full Text :
https://doi.org/10.1073/pnas.1806018115