1. GenomeWarp: an alignment-based variant coordinate transformation
- Author
-
Cory Y. McLean, Yeongwoo Hwang, Ryan Poplin, and Mark A. DePristo
- Subjects
Statistics and Probability ,Source code ,Computer science ,media_common.quotation_subject ,Genomics ,computer.software_genre ,Biochemistry ,Genome ,03 medical and health sciences ,0302 clinical medicine ,Humans ,Molecular Biology ,030304 developmental biology ,media_common ,0303 health sciences ,Genome, Human ,Genome Analysis ,Applications Notes ,Computer Science Applications ,Computational Mathematics ,Transformation (function) ,Computational Theory and Mathematics ,Human genome ,Data mining ,computer ,Software ,030217 neurology & neurosurgery ,Reference genome - Abstract
Summary Reference genomes are refined to reflect error corrections and other improvements. While this process improves novel data generation and analysis, incorporating data analyzed on an older reference genome assembly requires transforming the coordinates and representations of the data to the new assembly. Multiple tools exist to perform this transformation for coordinate-only data types, but none supports accurate transformation of genome-wide short variation. Here we present GenomeWarp, a tool for efficiently transforming variants between genome assemblies. GenomeWarp transforms regions and short variants in a conservative manner to minimize false positive and negative variants in the target genome, and converts over 99% of regions and short variants from a representative human genome. Availability and implementation GenomeWarp is written in Java. All source code and the user manual are freely available at https://github.com/verilylifesciences/genomewarp. Supplementary information Supplementary data are available at Bioinformatics online.
- Published
- 2019
- Full Text
- View/download PDF