1. OMBlast: alignment tool for optical mapping using a seed-and-extend approach
- Author
-
Pui-Yan Kwok, Alden King-Yung Leung, Tsz-Piu Kwok, Ting-Fung Chan, Kevin Y. Yip, Raymond Wan, Ming Xiao, and Hancock, John
- Subjects
0301 basic medicine ,Computer science ,computer.software_genre ,Biochemistry ,Genome ,Mathematical Sciences ,chemistry.chemical_compound ,0302 clinical medicine ,Software ,Computer graphics (images) ,Optical Restriction Mapping ,computer.programming_language ,Genomics ,Biological Sciences ,Original Papers ,Computer Science Applications ,Computational Mathematics ,Computational Theory and Mathematics ,Data mining ,Sequence Analysis ,Statistics and Probability ,Java ,Bioinformatics ,Sequence analysis ,Bioengineering ,Sequence alignment ,Saccharomyces cerevisiae ,Set (abstract data type) ,03 medical and health sciences ,Information and Computing Sciences ,Optical mapping ,Escherichia coli ,Animals ,Humans ,Caenorhabditis elegans ,Molecular Biology ,Gene ,business.industry ,SIGNAL (programming language) ,Sequence Analysis, DNA ,DNA ,Genome Analysis ,Range (mathematics) ,030104 developmental biology ,chemistry ,Generic health relevance ,business ,Sequence Alignment ,computer ,030217 neurology & neurosurgery - Abstract
Motivation Optical mapping is a technique for capturing fluorescent signal patterns of long DNA molecules (in the range of 0.1โ1 Mbp). Recently, it has been complementing the widely used short-read sequencing technology by assisting with scaffolding and detecting large and complex structural variations (SVs). Here, we introduce a fast, robust and accurate tool called OMBlast for aligning optical maps, the set of signal locations on the molecules generated from optical mapping. Our method is based on the seed-and-extend approach from sequence alignment, with modifications specific to optical mapping. Results Experiments with both synthetic and our real data demonstrate that OMBlast has higher accuracy and faster mapping speed than existing alignment methods. Our tool also shows significant improvement when aligning data with SVs. Availability and Implementation OMBlast is implemented for Java 1.7 and is released under a GPL license. OMBlast can be downloaded from https://github.com/aldenleung/OMBlast and run directly on machines equipped with a Java virtual machine. Supplementary information Supplementary data are available at Bioinformatics online
- Published
- 2016
- Full Text
- View/download PDF