Back to Search Start Over

Validation of Splicing Events in Transcriptome Sequencing Data.

Authors :
Kaisers W
Ptok J
Schwender H
Schaal H
Source :
International journal of molecular sciences [Int J Mol Sci] 2017 May 23; Vol. 18 (6). Date of Electronic Publication: 2017 May 23.
Publication Year :
2017

Abstract

Genomic alignments of sequenced cellular messenger RNA contain gapped alignments which are interpreted as consequence of intron removal. The resulting gap-sites, genomic locations of alignment gaps, are landmarks representing potential splice-sites. As alignment algorithms report gap-sites with a considerable false discovery rate, validations are required. We describe two quality scores, gap quality score ( gqs ) and weighted gap information score ( wgis ), developed for validation of putative splicing events: While gqs solely relies on alignment data wgis additionally considers information from the genomic sequence. FASTQ files obtained from 54 human dermal fibroblast samples were aligned against the human genome (GRCh38) using TopHat and STAR aligner. Statistical properties of gap-sites validated by gqs and wgis were evaluated by their sequence similarity to known exon-intron borders. Within the 54 samples, TopHat identifies 1,000,380 and STAR reports 6,487,577 gap-sites. Due to the lack of strand information, however, the percentage of identified GT-AG gap-sites is rather low. While gap-sites from TopHat contain ≈89% GT-AG, gap-sites from STAR only contain ≈42% GT-AG dinucleotide pairs in merged data from 54 fibroblast samples. Validation with gqs yields 156,251 gap-sites from TopHat alignments and 166,294 from STAR alignments. Validation with wgis yields 770,327 gap-sites from TopHat alignments and 1,065,596 from STAR alignments. Both alignment algorithms, TopHat and STAR, report gap-sites with considerable false discovery rate, which can drastically be reduced by validation with gqs and wgis .<br />Competing Interests: The authors declare no conflict of interest.

Details

Language :
English
ISSN :
1422-0067
Volume :
18
Issue :
6
Database :
MEDLINE
Journal :
International journal of molecular sciences
Publication Type :
Academic Journal
Accession number :
28545234
Full Text :
https://doi.org/10.3390/ijms18061110