1. Chromosomal Junction Detection from Whole-Genome Sequencing on Formalin-Fixed, Paraffin-Embedded Tumors
- Author
-
Andrew L. Feldman, Stephen J. Murphy, Robert A. Sikkink, John C. Cheville, Farhad Kosari, Janet Schaefer Kline, George Vasmatzis, Benjamin R. Kipp, Vishnu Serla, Faye R. Harris, Sarah H. Johnson, Jin Jen, Yean Lee, Anurag Sharma, Eric D. Wieben, James B. Smadbeck, Giannoula Karagouga, Bruce W. Eckloff, Jesse S. Voss, and Marie Christine Aubry
- Subjects
Male ,0301 basic medicine ,endocrine system ,Tissue Fixation ,DNA Copy Number Variations ,Formalin fixed paraffin embedded ,Computational biology ,Biology ,Genome ,Translocation, Genetic ,Pathology and Forensic Medicine ,Fixatives ,03 medical and health sciences ,chemistry.chemical_compound ,0302 clinical medicine ,Formaldehyde ,Neoplasms ,medicine ,Humans ,Allele ,Gene ,Whole genome sequencing ,Paraffin Embedding ,Whole Genome Sequencing ,medicine.diagnostic_test ,Genome, Human ,DNA, Neoplasm ,Genomics ,030104 developmental biology ,chemistry ,030220 oncology & carcinogenesis ,Fresh frozen ,Molecular Medicine ,Female ,Algorithms ,DNA ,Fluorescence in situ hybridization - Abstract
DNA junctions (DNAJs) frequently impact clinically relevant genes in tumors and are important for diagnostic and therapeutic purposes. Although routinely screened through fluorescence in situ hybridization assays, such testing only allows the interrogation of single-gene regions or known fusion partners. Comprehensive assessment of DNAJs present across the entire genome can only be determined from whole-genome sequencing. Structural variance analysis from whole-genome paired-end sequencing data is, however, frequently restricted to copy number changes without DNAJ detection. Through optimized whole-genome sequencing and specialized bioinformatics algorithms, complete structural variance analysis is reported, including DNAJs, from formalin-fixed DNA. Selective library assembly from larger fragments (500 bp) and economical sequencing depths (300 to 400 million reads) provide representative genomic coverage profiles and increased allelic coverage to levels compatible with DNAJ calling (40× to 60×). Although consistently fragmented, more recently formalin-fixed, specimens (2 years' storage) revealed consistent populations of larger DNA fragments. Optimized bioinformatics efficiently detected90% of DNAJs in two prostate tumors (approximately 60% tumor) previously analyzed by mate-pair sequencing on fresh frozen tissue, with evidence of at least one spanning-read in 99% of DNAJs. Rigorous masking with data from unrelated formalin-fixed tissue progressively eliminated many false-positive DNAJs, without loss of true positives, resulting in low numbers of false-positive passing current filters. This methodology enables more comprehensive clinical genomics testing on formalin-fixed clinical specimens.
- Published
- 2021