1. KEC: unique sequence search by K-mer exclusion
- Author
-
Pavel Beran, Stephen P. Cohen, Dagmar Stehlíková, and Vladislav Čurn
- Subjects
Statistics and Probability ,Computational Mathematics ,Computational Theory and Mathematics ,k-mer ,Computer science ,Computational biology ,Molecular Biology ,Biochemistry ,Sequence search ,Computer Science Applications - Abstract
Summary Searching for amino acid or nucleic acid sequences unique to one organism may be challenging depending on size of the available datasets. K-mer elimination by cross-reference (KEC) allows users to quickly and easily find unique sequences by providing target and non-target sequences. Due to its speed, it can be used for datasets of genomic size and can be run on desktop or laptop computers with modest specifications. Availability and implementation KEC is freely available for non-commercial purposes. Source code and executable binary files compiled for Linux, Mac and Windows can be downloaded from https://github.com/berybox/KEC. Supplementary information Supplementary data are available at Bioinformatics online.
- Published
- 2021
- Full Text
- View/download PDF