Back to Search
Start Over
The Data Use Ontology to streamline responsible access to human biomedical datasets
- Source :
- Cell Genomics
- Publication Year :
- 2021
-
Abstract
- Summary Human biomedical datasets that are critical for research and clinical studies to benefit human health also often contain sensitive or potentially identifying information of individual participants. Thus, care must be taken when they are processed and made available to comply with ethical and regulatory frameworks and informed consent data conditions. To enable and streamline data access for these biomedical datasets, the Global Alliance for Genomics and Health (GA4GH) Data Use and Researcher Identities (DURI) work stream developed and approved the Data Use Ontology (DUO) standard. DUO is a hierarchical vocabulary of human and machine-readable data use terms that consistently and unambiguously represents a dataset’s allowable data uses. DUO has been implemented by major international stakeholders such as the Broad and Sanger Institutes and is currently used in annotation of over 200,000 datasets worldwide. Using DUO in data management and access facilitates researchers’ discovery and access of relevant datasets. DUO annotations increase the FAIRness of datasets and support data linkages using common data use profiles when integrating the data for secondary analyses. DUO is implemented in the Web Ontology Language (OWL) and, to increase community awareness and engagement, hosted in an open, centralized GitHub repository. DUO, together with the GA4GH Passport standard, offers a new, efficient, and streamlined data authorization and access framework that has enabled increased sharing of biomedical datasets worldwide.<br />Graphical abstract<br />Highlights Biomedical advances depend on the efficient and compliant re-use of sensitive human data The Data Use Ontology standardizes terms and definitions for consented data uses The Data Use Ontology facilitates discovery of, request for, and access to datasets Over 200,000 datasets worldwide have been annotated using the Data Use Ontology<br />The GA4GH Data Use Ontology (DUO) provides unambiguous, machine-readable standard language for consent forms and the data sharing policies they represent. Lawson et al. describe the DUO standard and implementations throughout the data access workflow to expedite data access while maintaining or improving compliant processes.
- Subjects :
- secondary data use
Vocabulary
Technology
Computer science
media_common.quotation_subject
Data management
data restrictions
Ontology (information science)
Biochemistry, biophysics & molecular biology [F05] [Life sciences]
Ontologia
Biochemistry, Genetics and Molecular Biology (miscellaneous)
03 medical and health sciences
Annotation
0302 clinical medicine
data access
Conjunts de dades
Genetics
ontology
Biochimie, biophysique & biologie moléculaire [F05] [Sciences du vivant]
automated data access
030304 developmental biology
media_common
FAIR
Computer science [C05] [Engineering, computing & technology]
0303 health sciences
business.industry
Authorization
GA4GH
standard
Ontology language
Sciences informatiques [C05] [Ingénierie, informatique & technologie]
controlled access
Data science
3. Good health
Data access
Community awareness
consent
business
030217 neurology & neurosurgery
Subjects
Details
- ISSN :
- 2666979X
- Database :
- OpenAIRE
- Journal :
- Cell Genomics
- Accession number :
- edsair.doi.dedup.....df22314d877b3036a3aa912462062c75
- Full Text :
- https://doi.org/10.1016/j.xgen.2021.100028