Back to Search
Start Over
TAPAS: tool for alternative polyadenylation site analysis
- Source :
- Bioinformatics. 34:2521-2529
- Publication Year :
- 2018
- Publisher :
- Oxford University Press (OUP), 2018.
-
Abstract
- Motivation The length of the 3′ untranslated region (3′ UTR) of an mRNA is essential for many biological activities such as mRNA stability, sub-cellular localization, protein translation, protein binding and translation efficiency. Moreover, correlation between diseases and the shortening (or lengthening) of 3′ UTRs has been reported in the literature. This length is largely determined by the polyadenylation cleavage site in the mRNA. As alternative polyadenylation (APA) sites are common in mammalian genes, several tools have been published recently for detecting APA sites from RNA-Seq data or performing shortening/lengthening analysis. These tools consider either up to only two APA sites in a gene or only APA sites that occur in the last exon of a gene, although a gene may generally have more than two APA sites and an APA site may sometimes occur before the last exon. Furthermore, the tools are unable to integrate the analysis of shortening/lengthening events with APA site detection. Results We propose a new tool, called TAPAS, for detecting novel APA sites from RNA-Seq data. It can deal with more than two APA sites in a gene as well as APA sites that occur before the last exon. The tool is based on an existing method for finding change points in time series data, but some filtration techniques are also adopted to remove change points that are likely false APA sites. It is then extended to identify APA sites that are expressed differently between two biological samples and genes that contain 3′ UTRs with shortening/lengthening events. Our extensive experiments on simulated and real RNA-Seq data demonstrate that TAPAS outperforms the existing tools for APA site detection or shortening/lengthening analysis significantly. Availability and implementation https://github.com/arefeen/TAPAS Supplementary information Supplementary data are available at Bioinformatics online.
- Subjects :
- 0301 basic medicine
Statistics and Probability
Untranslated region
Polyadenylation
education
Computational biology
Biology
Biochemistry
03 medical and health sciences
Exon
Polyadenylation site
0302 clinical medicine
mental disorders
Animals
Humans
RNA, Messenger
Protein translation
3' Untranslated Regions
Molecular Biology
Gene
Supplementary data
Messenger RNA
Sequence Analysis, RNA
Eukaryota
Original Papers
Computer Science Applications
Computational Mathematics
030104 developmental biology
Computational Theory and Mathematics
030220 oncology & carcinogenesis
Software
psychological phenomena and processes
Subjects
Details
- ISSN :
- 13674811 and 13674803
- Volume :
- 34
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....9e1295ab25f7b013a90c8c9d0cd53ed6