Back to Search
Start Over
ARGs-OAP v2.0 with an expanded SARG database and Hidden Markov Models for enhancement characterization and quantification of antibiotic resistance genes in environmental metagenomes
- Source :
- Bioinformatics. 34:2263-2270
- Publication Year :
- 2018
- Publisher :
- Oxford University Press (OUP), 2018.
-
Abstract
- Motivation Much global attention has been paid to antibiotic resistance in monitoring its emergence, accumulation and dissemination. For rapid characterization and quantification of antibiotic resistance genes (ARGs) in metagenomic datasets, an online analysis pipeline, ARGs-OAP has been developed consisting of a database termed Structured Antibiotic Resistance Genes (the SARG) with a hierarchical structure (ARGs type-subtype-reference sequence). Results The new release of the database, termed SARG version 2.0, contains sequences not only from CARD and ARDB databases, but also carefully selected and curated sequences from the latest protein collection of the NCBI-NR database, to keep up to date with the increasing number of ARG deposited sequences. SARG v2.0 has tripled the sequences of the first version and demonstrated improved coverage of ARGs detection in metagenomes from various environmental samples. In addition to annotation of high-throughput raw reads using a similarity search strategy, ARGs-OAP v2.0 now provides model-based identification of assembled sequences using SARGfam, a high-quality profile Hidden Markov Model (HMM), containing profiles of ARG subtypes. Additionally, ARGs-OAP v2.0 improves cell number quantification by using the average coverage of essential single copy marker genes, as an option in addition to the previous method based on the 16S rRNA gene. Availability and implementation ARGs-OAP can be accessed through http://smile.hku.hk/SARGs. The database could be downloaded from the same site. Source codes for this study can be downloaded from https://github.com/xiaole99/ARGs-OAP-v2.0. Supplementary information Supplementary data are available at Bioinformatics online.
- Subjects :
- 0301 basic medicine
Statistics and Probability
Databases, Factual
Sequence analysis
Computer science
Bacterial genome size
Drug resistance
010501 environmental sciences
Bacterial Physiological Phenomena
computer.software_genre
01 natural sciences
Biochemistry
Genome
03 medical and health sciences
Antibiotic resistance
Genome, Archaeal
Hidden Markov model
Molecular Biology
Gene
0105 earth and related environmental sciences
Bacteria
Database
Drug Resistance, Microbial
Sequence Analysis, DNA
16S ribosomal RNA
Archaea
Computer Science Applications
Computational Mathematics
Identification (information)
030104 developmental biology
Computational Theory and Mathematics
Metagenomics
Metagenome
Microbial genetics
computer
Genome, Bacterial
Software
Antibiotic resistance genes
Subjects
Details
- ISSN :
- 13674811 and 13674803
- Volume :
- 34
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....0abf18858a492525efb8768da5e4e473