Back to Search Start Over

Nucleotide-level distance metrics to quantify alternative splicing implemented in TranD.

Authors :
Nanni A
Titus-McQuillan J
Bankole KS
Pardo-Palacios F
Signor S
Vlaho S
Moskalenko O
Morse AM
Rogers RL
Conesa A
McIntyre LM
Source :
Nucleic acids research [Nucleic Acids Res] 2024 Mar 21; Vol. 52 (5), pp. e28.
Publication Year :
2024

Abstract

Advances in affordable transcriptome sequencing combined with better exon and gene prediction has motivated many to compare transcription across the tree of life. We develop a mathematical framework to calculate complexity and compare transcript models. Structural features, i.e. intron retention (IR), donor/acceptor site variation, alternative exon cassettes, alternative 5'/3' UTRs, are compared and the distance between transcript models is calculated with nucleotide level precision. All metrics are implemented in a PyPi package, TranD and output can be used to summarize splicing patterns for a transcriptome (1GTF) and between transcriptomes (2GTF). TranD output enables quantitative comparisons between: annotations augmented by empirical RNA-seq data and the original transcript models; transcript model prediction tools for longread RNA-seq (e.g. FLAIR versus Isoseq3); alternate annotations for a species (e.g. RefSeq vs Ensembl); and between closely related species. In C. elegans, Z. mays, D. melanogaster, D. simulans and H. sapiens, alternative exons were observed more frequently in combination with an alternative donor/acceptor than alone. Transcript models in RefSeq and Ensembl are linked and both have unique transcript models with empirical support. D. melanogaster and D. simulans, share many transcript models and long-read RNAseq data suggests that both species are under-annotated. We recommend combined references.<br /> (© The Author(s) 2024. Published by Oxford University Press on behalf of Nucleic Acids Research.)

Details

Language :
English
ISSN :
1362-4962
Volume :
52
Issue :
5
Database :
MEDLINE
Journal :
Nucleic acids research
Publication Type :
Academic Journal
Accession number :
38340337
Full Text :
https://doi.org/10.1093/nar/gkae056