Back to Search Start Over

The Data Use Ontology to streamline responsible access to human biomedical datasets

Authors :
Heidi J. Sofia
Kathy Reinold
Petr Holub
Gregory A. Rushton
Sarion R. Bowers
Melissa A. Konopko
Anthony J. Brookes
Chihiro Hata
Jaime M. Guidry Auvil
Giselle Kerry
Stephanie O. M. Dyke
Rebecca R. Boyles
Tony Burdett
Mallory A. Freeberg
Fabian Prasser
Soichi Ogishima
Jordi Rambla
Aina Jene
Matthew Brush
Mélanie Courtot
Ilia Tulchinsky
Esther van Enckevort
Minae Kawashima
Moran N. Cabili
Jonathan Lawson
Laura A.D. Paglione
Helen Parkinson
Tommi Nyrönen
Adrian Thorogood
Satoshi Nagaie
Craig Voisin
Anthony A. Philippakis
Pinar Alper
Haoyuan Li
Susheel Varma
John Dylan Spalding
Hayley L. Clissold
Gary I. Saunders
Natsuko Yamamoto
Nicola Mulder
Morris A. Swertz
Ravi N. Pandya
Melissa Haendel
Mikael Linden
Mizuki Morita
Vivian Ota Wang
Jean Muller
Chisato Yamasaki
Lyndon Zass
Francis Jeanson
Irene Kyomugisha
Stacey Donnelly
Tiffany Boughtwood
Laura Lyman Rodriguez
Jamal Nasir
Andrea Saltzman
Shuichi Kawashima
Source :
Cell Genomics
Publication Year :
2021

Abstract

Summary Human biomedical datasets that are critical for research and clinical studies to benefit human health also often contain sensitive or potentially identifying information of individual participants. Thus, care must be taken when they are processed and made available to comply with ethical and regulatory frameworks and informed consent data conditions. To enable and streamline data access for these biomedical datasets, the Global Alliance for Genomics and Health (GA4GH) Data Use and Researcher Identities (DURI) work stream developed and approved the Data Use Ontology (DUO) standard. DUO is a hierarchical vocabulary of human and machine-readable data use terms that consistently and unambiguously represents a dataset’s allowable data uses. DUO has been implemented by major international stakeholders such as the Broad and Sanger Institutes and is currently used in annotation of over 200,000 datasets worldwide. Using DUO in data management and access facilitates researchers’ discovery and access of relevant datasets. DUO annotations increase the FAIRness of datasets and support data linkages using common data use profiles when integrating the data for secondary analyses. DUO is implemented in the Web Ontology Language (OWL) and, to increase community awareness and engagement, hosted in an open, centralized GitHub repository. DUO, together with the GA4GH Passport standard, offers a new, efficient, and streamlined data authorization and access framework that has enabled increased sharing of biomedical datasets worldwide.<br />Graphical abstract<br />Highlights Biomedical advances depend on the efficient and compliant re-use of sensitive human data The Data Use Ontology standardizes terms and definitions for consented data uses The Data Use Ontology facilitates discovery of, request for, and access to datasets Over 200,000 datasets worldwide have been annotated using the Data Use Ontology<br />The GA4GH Data Use Ontology (DUO) provides unambiguous, machine-readable standard language for consent forms and the data sharing policies they represent. Lawson et al. describe the DUO standard and implementations throughout the data access workflow to expedite data access while maintaining or improving compliant processes.

Details

ISSN :
2666979X
Database :
OpenAIRE
Journal :
Cell Genomics
Accession number :
edsair.doi.dedup.....df22314d877b3036a3aa912462062c75
Full Text :
https://doi.org/10.1016/j.xgen.2021.100028