Back to Search
Start Over
Shedding light on dark genes: enhanced targeted resequencing by optimizing the combination of enrichment technology and DNA fragment length
- Source :
- Scientific Reports, Scientific Reports, Vol 10, Iss 1, Pp 1-11 (2020)
- Publication Year :
- 2020
- Publisher :
- Springer Science and Business Media LLC, 2020.
-
Abstract
- The exome contains many obscure regions difficult to explore with current short-read sequencing methods. Repetitious genomic regions prevent the unique alignment of reads, which is essential for the identification of clinically-relevant genetic variants. Long-read technologies attempt to resolve multiple-mapping regions, but they still produce many sequencing errors. Thus, a new approach is required to enlighten the obscure regions of the genome and rescue variants that would be otherwise neglected. This work aims to improve the alignment of multiple-mapping reads through the extension of the standard DNA fragment size. As Illumina can sequence fragments up to 550 bp, we tested different DNA fragment lengths using four major commercial WES platforms and found that longer DNA fragments achieved a higher genotypability. This metric, which indicates base calling calculated by combining depth of coverage with the confidence of read alignment, increased from hundreds to thousands of genes, including several associated with clinical phenotypes. While depth of coverage has been considered crucial for the assessment of WES performance, we demonstrated that genotypability has a greater impact in revealing obscure regions, with ~1% increase in variant calling in respect to shorter DNA fragments. Results confirmed that this approach enlightened many regions previously not explored.
- Subjects :
- 0301 basic medicine
Genotype
Sequence analysis
lcsh:Medicine
Genomics
Computational biology
Biology
Genome
Article
DNA sequencing
03 medical and health sciences
0302 clinical medicine
Genetics
Humans
Exome
lcsh:Science
Gene
Multidisciplinary
Genome, Human
lcsh:R
High-Throughput Nucleotide Sequencing
DNA
Sequence Analysis, DNA
Targeted resequencing
030104 developmental biology
030220 oncology & carcinogenesis
Next-generation sequencing
lcsh:Q
Base calling
Human genome
Sequence Analysis
Algorithms
Human
Subjects
Details
- ISSN :
- 20452322
- Volume :
- 10
- Database :
- OpenAIRE
- Journal :
- Scientific Reports
- Accession number :
- edsair.doi.dedup.....72b5c3dd8b17cd5b13aaba8ed796de35