Back to Search
Start Over
KISSPLICE: de-novo calling alternative splicing events from RNA-seq data
- Source :
- BMC Bioinformatics, BMC Bioinformatics, BioMed Central, 2012, 13 (Suppl 6), pp.S5. ⟨10.1186/1471-2105-13-S6-S5⟩, BMC Bioinformatics, 2012, 13 (Suppl 6), pp.S5. ⟨10.1186/1471-2105-13-S6-S5⟩, BMC Bioinformatics (13), . (2012)
- Publication Year :
- 2012
- Publisher :
- HAL CCSD, 2012.
-
Abstract
- Background In this paper, we address the problem of identifying and quantifying polymorphisms in RNA-seq data when no reference genome is available, without assembling the full transcripts. Based on the fundamental idea that each polymorphism corresponds to a recognisable pattern in a De Bruijn graph constructed from the RNA-seq reads, we propose a general model for all polymorphisms in such graphs. We then introduce an exact algorithm, called KIS SPLICE, to extract alternative splicing events. Results We show that KIS SPLICE enables to identify more correct events than general purpose transcriptome assemblers. Additionally, on a 71 M reads dataset from human brain and liver tissues, KIS SPLICE identified 3497 alternative splicing events, out of which 56% are not present in the annotations, which confirms recent estimates showing that the complexity of alternative splicing has been largely underestimated so far. Conclusions We propose new models and algorithms for the detection of polymorphism in RNA-seq data. This opens the way to a new kind of studies on large HTS RNA-seq datasets, where the focus is not the global reconstruction of full-length transcripts, but local assembly of polymorphic regions. KIS SPLICE is available for download at http://alcovna.genouest.org/kissplice/.
- Subjects :
- Sequence analysis
0206 medical engineering
RNA-Seq
02 engineering and technology
Computational biology
Biology
Biochemistry
Genome
Polymorphism, Single Nucleotide
De Bruijn graph
GRAPHS
Transcriptome
03 medical and health sciences
symbols.namesake
Structural Biology
[SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry, Molecular Biology/Genomics [q-bio.GN]
Humans
natural sciences
Molecular Biology
030304 developmental biology
Genetics
0303 health sciences
Models, Statistical
Sequence Analysis, RNA
Applied Mathematics
Alternative splicing
Reference Standards
[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM]
GENOME
Computer Science Applications
Alternative Splicing
Exact algorithm
Proceedings
Tandem Repeat Sequences
symbols
DNA microarray
[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]
020602 bioinformatics
Algorithms
Subjects
Details
- Language :
- English
- ISSN :
- 14712105
- Database :
- OpenAIRE
- Journal :
- BMC Bioinformatics, BMC Bioinformatics, BioMed Central, 2012, 13 (Suppl 6), pp.S5. ⟨10.1186/1471-2105-13-S6-S5⟩, BMC Bioinformatics, 2012, 13 (Suppl 6), pp.S5. ⟨10.1186/1471-2105-13-S6-S5⟩, BMC Bioinformatics (13), . (2012)
- Accession number :
- edsair.doi.dedup.....c7ff16a9440072b19efa2bb3e1caa299
- Full Text :
- https://doi.org/10.1186/1471-2105-13-S6-S5⟩