Back to Search
Start Over
A streamlined workflow for conversion, peer review, and publication of genomics metadata as omics data papers
- Source :
- GigaScience
- Publication Year :
- 2021
-
Abstract
- Background Data papers have emerged as a powerful instrument for open data publishing, obtaining credit, and establishing priority for datasets generated in scientific experiments. Academic publishing improves data and metadata quality through peer review and increases the impact of datasets by enhancing their visibility, accessibility, and reusability. Objective We aimed to establish a new type of article structure and template for omics studies: the omics data paper. To improve data interoperability and further incentivize researchers to publish well-described datasets, we created a prototype workflow for streamlined import of genomics metadata from the European Nucleotide Archive directly into a data paper manuscript. Methods An omics data paper template was designed by defining key article sections that encourage the description of omics datasets and methodologies. A metadata import workflow, based on REpresentational State Transfer services and Xpath, was prototyped to extract information from the European Nucleotide Archive, ArrayExpress, and BioSamples databases. Findings The template and workflow for automatic import of standard-compliant metadata into an omics data paper manuscript provide a mechanism for enhancing existing metadata through publishing. Conclusion The omics data paper structure and workflow for import of genomics metadata will help to bring genomic and other omics datasets into the spotlight. Promoting enhanced metadata descriptions and enforcing manuscript peer review and data auditing of the underlying datasets brings additional quality to datasets. We hope that streamlined metadata reuse for scholarly publishing encourages authors to create enhanced metadata descriptions in the form of data papers to improve both the quality of their metadata and its findability and accessibility.
- Subjects :
- 0106 biological sciences
Representational state transfer
European Nucleotide Archive
Databases, Factual
computer.internet_protocol
Computer science
workflow
AcademicSubjects/SCI02254
Health Informatics
01 natural sciences
data paper
03 medical and health sciences
Technical Note
030304 developmental biology
XPath
Structure (mathematical logic)
0303 health sciences
Metadata
FAIR principles
Findability
Genomics
Data science
Computer Science Applications
omics
Open data
Workflow
data
MIxS
standards
AcademicSubjects/SCI00960
MINSEQE
computer
010606 plant biology & botany
Subjects
Details
- ISSN :
- 2047217X
- Volume :
- 10
- Issue :
- 5
- Database :
- OpenAIRE
- Journal :
- GigaScience
- Accession number :
- edsair.doi.dedup.....5d889050f74302ef7dc9dc9a02fe3957