Back to Search
Start Over
Benefiting from big data in natural products: importance of preserving foundational skills and prioritizing data quality
- Source :
- Natural Product Reports, Natural Product Reports 38 (2021) 11, Natural Product Reports, 38(11), 1947-1953
- Publication Year :
- 2021
- Publisher :
- Royal Society of Chemistry (RSC), 2021.
-
Abstract
- Systematic, large-scale, studies at the genomic, metabolomic, and functional level have transformed the natural product sciences. Improvements in technology and reduction in cost for obtaining spectroscopic, chromatographic, and genomic data coupled with the creation of readily accessible curated and functionally annotated data sets have altered the practices of virtually all natural product research laboratories. Gone are the days when the natural products researchers were expected to devote themselves exclusively to the isolation, purification, and structure elucidation of small molecules. We now also engage with big data in taxonomic, genomic, proteomic, and/or metabolomic collections, and use these data to generate and test hypotheses. While the oft stated aim for the use of large-scale -omics data in the natural products sciences is to achieve a rapid increase in the rate of discovery of new drugs, this has not yet come to pass. At the same time, new technologies have provided unexpected opportunities for natural products chemists to ask and answer new and different questions. With this viewpoint, we discuss the evolution of big data as a part of natural products research and provide a few examples of how discoveries have been enabled by access to big data. We also draw attention to some of the limitations in our existing engagement with large datasets and consider what would be necessary to overcome them.<br />Big data is changing how we do natural products research and creating exciting new possibilities. Continued attention to enhancing data quality, increasing access, and preserving foundational skills is needed.
- Subjects :
- Big Data
Structure (mathematical logic)
Biological Products
Bioinformatics
Computer science
Emerging technologies
business.industry
Spectrum Analysis
Genomic data
Organic Chemistry
Big data
Genomics
Biochemistry
Data science
Natural (archaeology)
Data Accuracy
Omics data
Chemistry
Data quality
Bioinformatica
Drug Discovery
Life Science
Isolation (database systems)
business
Subjects
Details
- ISSN :
- 14604752 and 02650568
- Volume :
- 38
- Database :
- OpenAIRE
- Journal :
- Natural Product Reports
- Accession number :
- edsair.doi.dedup.....ab3702030ab2ce8bc29c6f47c23f602d