Back to Search
Start Over
JUM is a computational method for comprehensive annotation-free analysis of alternative pre-mRNA splicing patterns
- Source :
- Proceedings of the National Academy of Sciences of the United States of America, vol 115, iss 35
- Publication Year :
- 2018
- Publisher :
- Proceedings of the National Academy of Sciences, 2018.
-
Abstract
- Alternative pre-mRNA splicing (AS) greatly diversifies metazoan transcriptomes and proteomes and is crucial for gene regulation. Current computational analysis methods of AS from Illumina RNA-sequencing data rely on preannotated libraries of known spliced transcripts, which hinders AS analysis with poorly annotated genomes and can further mask unknown AS patterns. To address this critical bioinformatics problem, we developed a method called the junction usage model (JUM) that uses a bottom-up approach to identify, analyze, and quantitate global AS profiles without any prior transcriptome annotations. JUM accurately reports global AS changes in terms of the five conventional AS patterns and an additional “composite” category composed of inseparable combinations of conventional patterns. JUM stringently classifies the difficult and disease-relevant pattern of intron retention (IR), reducing the false positive rate of IR detection commonly seen in other annotation-based methods to near-negligible rates. When analyzing AS in RNA samples derived from Drosophila heads, human tumors, and human cell lines bearing cancer-associated splicing factor mutations, JUM consistently identified approximately twice the number of novel AS events missed by other methods. Computational simulations showed JUM exhibits a 1.2 to 4.8 times higher true positive rate at a fixed cutoff of 5% false discovery rate. In summary, JUM provides a framework and improved method that removes the necessity for transcriptome annotations and enables the detection, analysis, and quantification of AS patterns in complex metazoan transcriptomes with superior accuracy.
- Subjects :
- 0301 basic medicine
False discovery rate
RNA Splicing
RNA-Seq
Computational biology
Biology
Genome
03 medical and health sciences
Splicing factor
Annotation
Genetic
Models
Neoplasms
RNA Precursors
Genetics
Animals
Humans
Drosophila Proteins
annotation-free
Computer Simulation
RNA, Neoplasm
Multidisciplinary
Models, Genetic
Human Genome
Intron
Molecular Sequence Annotation
alternative pre-mRNA splicing
Neoplasm Proteins
Drosophila melanogaster
030104 developmental biology
PNAS Plus
Networking and Information Technology R&D (NITRD)
Mutation
RNA splicing
RNA
Neoplasm
RNA Splicing Factors
False positive rate
RNA-seq
K562 Cells
Biotechnology
Subjects
Details
- ISSN :
- 10916490 and 00278424
- Volume :
- 115
- Database :
- OpenAIRE
- Journal :
- Proceedings of the National Academy of Sciences
- Accession number :
- edsair.doi.dedup.....0c3f1e8b82b92c1ffb1ae3a211096d82
- Full Text :
- https://doi.org/10.1073/pnas.1806018115