Back to Search Start Over

Kmasker plants - a tool for assessing complex sequence space in plant species

Authors :
Martin Grosse
Jochen Kumlehn
Thomas Schmutzer
Sindy Chamas
Goetz Hensel
Thomas Münch
Sebastian Beier
Krishna Mohan Pathi
Iris Koeppel
Christian Hertig
Uwe Scholz
Chris Ulpinnis
Markus Schwalbe
Sophia Gerasimova
Nagaveni Budhagatapalli
Robert Eric Hoffie
Stefan Hiekel
Source :
The Plant journal : for cell and molecular biologyReferences. 102(3)
Publication Year :
2019

Abstract

Many plant genomes display high levels of repetitive sequences. The assembly of these complex genomes using short high-throughput sequence reads is still a challenging task. Underestimation or disregard of repeat complexity in these datasets can easily misguide downstream analysis. Detection of repetitive regions by k-mer counting methods has proved to be reliable. Easy-to-use applications utilizing k-mer counting are in high demand, especially in the domain of plants. We present Kmasker plants, a tool that uses k-mer count information as an assistant throughout the analytical workflow of genome data that is provided as a command-line and web-based solution. Beside its core competence to screen and mask repetitive sequences, we have integrated features that enable comparative studies between different cultivars or closely related species and methods that estimate target specificity of guide RNAs for application of site-directed mutagenesis using Cas9 endonuclease. In addition, we have set up a web service for Kmasker plants that maintains pre-computed indices for 10 of the economically most important cultivated plants. Source code for Kmasker plants has been made publically available at https://github.com/tschmutzer/kmasker. The web service is accessible at https://kmasker.ipk-gatersleben.de.

Details

ISSN :
1365313X
Volume :
102
Issue :
3
Database :
OpenAIRE
Journal :
The Plant journal : for cell and molecular biologyReferences
Accession number :
edsair.doi.dedup.....abb859fc21335aabf0638fe7206de243