Back to Search Start Over

Plaseval: a framework for comparing and evaluating plasmid detection tools

Authors :
Aniket Mane
Haley Sanderson
Aaron P. White
Rahat Zaheer
Robert Beiko
Cédric Chauve
Source :
BMC Bioinformatics, Vol 25, Iss 1, Pp 1-26 (2024)
Publication Year :
2024
Publisher :
BMC, 2024.

Abstract

Abstract Background Plasmids play a major role in the transfer of antimicrobial resistance (AMR) genes among bacteria via horizontal gene transfer. The identification of plasmids in short-read assemblies is a challenging problem and a very active research area. Plasmid binning aims at detecting, in a draft genome assembly, groups (bins) of contigs likely to originate from the same plasmid. Several methods for plasmid binning have been developed recently, such as PlasBin-flow, HyAsP, gplas, MOB-suite, and plasmidSPAdes. This motivates the problem of evaluating the performances of plasmid binning methods, either against a given ground truth or between them. Results We describe PlasEval, a novel method aimed at comparing the results of plasmid binning tools. PlasEval computes a dissimilarity measure between two sets of plasmid bins, that can originate either from two plasmid binning tools, or from a plasmid binning tool and a ground truth set of plasmid bins. The PlasEval dissimilarity accounts for the contig content of plasmid bins, the length of contigs and is repeat-aware. Moreover, the dissimilarity score computed by PlasEval is broken down into several parts, that allows to understand qualitative differences between the compared sets of plasmid bins. We illustrate the use of PlasEval by benchmarking four recently developed plasmid binning tools—PlasBin-flow, HyAsP, gplas, and MOB-recon—on a data set of 53 E. coli bacterial genomes. Conclusion Analysis of the results of plasmid binning methods using PlasEval shows that their behaviour varies significantly. PlasEval can be used to decide which specific plasmid binning method should be used for a specific dataset. The disagreement between different methods also suggests that the problem of plasmid binning on short-read contigs requires further research. We believe that PlasEval can prove to be an effective tool in this regard. PlasEval is publicly available at https://github.com/acme92/PlasEval

Details

Language :
English
ISSN :
14712105
Volume :
25
Issue :
1
Database :
Directory of Open Access Journals
Journal :
BMC Bioinformatics
Publication Type :
Academic Journal
Accession number :
edsdoj.59f0e9ae82ea498c88a24f1450bf00e9
Document Type :
article
Full Text :
https://doi.org/10.1186/s12859-024-05941-0