Back to Search
Start Over
Functional group and diversity analysis of BIOFACQUIM: A Mexican natural product database
- Source :
- F1000Research
- Publication Year :
- 2019
- Publisher :
- F1000 Research Ltd, 2019.
-
Abstract
- Background: Natural product databases are important in drug discovery and other research areas. Their structural contents and functional group analysis are relevant to increase their knowledge in terms of chemical diversity and chemical space coverage. BIOFACQUIM is an emerging database of natural products characterized and isolated in Mexico. Herein, we discuss the results of a first systematic functional group analysis and global diversity of an updated version of BIOFACQUIM. Methods: BIOFACQUIM was augmented through a literature search and data curation. A structural content analysis of the dataset was done. This involved a functional group analysis with a novel algorithm to identify automatically all functional groups in a molecule and an assessment of the global diversity using consensus diversity plots. To this end, BIOFACQUIM was compared to two major and large databases: ChEMBL 25, and a herein assembled collection of natural products with 169,839 unique compounds. Results: The structural content analysis showed that 16.1% of compounds, 11.3% of scaffolds, and 6.3% of functional groups present in the current version of BIOFACQUIM have not been reported in the other large reference datasets. It also gave a diversity increase in terms of scaffolds and molecular fingerprints regarding the previous version of the dataset, as well as a higher similarity to the assembled collection of natural products than to ChEMBL 25, in terms of diversity and frequent functional groups. Conclusions: A total of 148 natural products were added to BIOFACQUIM, which meant a diversity increase in terms of scaffolds and fingerprints. Regardless of its relatively small size, there are a significant number of compounds, scaffolds, and functional groups that are not present in the reference datasets, showing that curated databases of natural products, such as BIOFACQUIM, can serve as a starting point to increase the biologically relevant chemical space.
- Subjects :
- 0301 basic medicine
Databases, Factual
Computer science
natural products
In silico
computer.software_genre
01 natural sciences
General Biochemistry, Genetics and Molecular Biology
diversity
chemistry.chemical_compound
03 medical and health sciences
functional groups
Drug Discovery
General Pharmacology, Toxicology and Pharmaceutics
Functional group (ecology)
Mexico
030304 developmental biology
Biological Products
0303 health sciences
Natural product
Data curation
Database
General Immunology and Microbiology
Drug discovery
010405 organic chemistry
compound databases
Articles
Consensus Diversity Plot
data mining
General Medicine
chEMBL
Chemical space
0104 chemical sciences
030104 developmental biology
chemistry
in silico
computer
Diversity (business)
Research Article
Subjects
Details
- ISSN :
- 20461402
- Volume :
- 8
- Database :
- OpenAIRE
- Journal :
- F1000Research
- Accession number :
- edsair.doi.dedup.....8de37bf20357ac717216c598cefa9cf0