Back to Search Start Over

Towards an Interoperable Ecosystem of Research Cohort andReal-world Data Catalogues Enabling Multi-center Studies

Authors :
Morris Swertz
Esther van Enckevort
José Luis Oliveira
Isabel Fortier
Julie Bergeron
Nicolas H. Thurin
Eleanor Hyde
Alexander Kellmann
Romin Pahoueshnja
Miriam Sturkenboom
Marianne Cunnington
Anne-Marie Nybo Andersen
Yannick Marcon
Gonçalo Gonçalves
Rosa Gini
Groningen Institute for Gastro Intestinal Genetics and Immunology (3GI)
Source :
Yearbook of medical informatics, Yearbook of medical informatics, 31(1), 262-272, Swertz, M, van Enckevort, E, Oliveira, J L, Fortier, I, Bergeron, J, Thurin, N H, Hyde, E, Kellmann, A, Pahoueshnja, R, Sturkenboom, M, Cunnington, M, Nybo Andersen, A M, Marcon, Y, Gonçalves, G & Gini, R 2022, ' Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies ', Yearbook of medical informatics, vol. 31, no. 1, pp. 262-272 . https://doi.org/10.1055/s-0042-1742522
Publication Year :
2022

Abstract

Objectives: Existing individual-level human data cover large populations on many dimensions such as lifestyle, demography, laboratory measures, clinical parameters, etc. Recent years have seen large investments in data catalogues to FAIRify data descriptions to capitalise on this great promise, i.e. make catalogue contents more Findable, Accessible, Interoperable and Reusable. However, their valuable diversity also created heterogeneity, which poses challenges to optimally exploit their richness.Methods: In this opinion review, we analyse catalogues for human subject research ranging from cohort studies to surveillance, administrative and healthcare records.Results: We observe that while these catalogues are heterogeneous, have various scopes, and use different terminologies, still the underlying concepts seem potentially harmonizable. We propose a unified framework to enable catalogue data sharing, with catalogues of multi-center cohorts nested as a special case in catalogues of real-world data sources. Moreover, we list recommendations to create an integrated community of metadata catalogues and an open catalogue ecosystem to sustain these efforts and maximise impact.Conclusions: We propose to embrace the autonomy of motivated catalogue teams and invest in their collaboration via minimal standardisation efforts such as clear data licensing, persistent identifiers for linking same records between catalogues, minimal metadata ‘common data elements’ using shared ontologies, symmetric architectures for data sharing (push/pull) with clear provenance tracks to process updates and acknowledge original contributors. And most importantly, we encourage the creation of environments for collaboration and resource sharing between catalogue developers, building on international networks such as OpenAIRE and research data alliance, as well as domain specific ESFRIs such as BBMRI and ELIXIR. OBJECTIVES: Existing individual-level human data cover large populations on many dimensions such as lifestyle, demography, laboratory measures, clinical parameters, etc. Recent years have seen large investments in data catalogues to FAIRify data descriptions to capitalise on this great promise, i.e. make catalogue contents more Findable, Accessible, Interoperable and Reusable. However, their valuable diversity also created heterogeneity, which poses challenges to optimally exploit their richness. METHODS: In this opinion review, we analyse catalogues for human subject research ranging from cohort studies to surveillance, administrative and healthcare records. RESULTS: We observe that while these catalogues are heterogeneous, have various scopes, and use different terminologies, still the underlying concepts seem potentially harmonizable. We propose a unified framework to enable catalogue data sharing, with catalogues of multi-center cohorts nested as a special case in catalogues of real-world data sources. Moreover, we list recommendations to create an integrated community of metadata catalogues and an open catalogue ecosystem to sustain these efforts and maximise impact. CONCLUSIONS: We propose to embrace the autonomy of motivated catalogue teams and invest in their collaboration via minimal standardisation efforts such as clear data licensing, persistent identifiers for linking same records between catalogues, minimal metadata 'common data elements' using shared ontologies, symmetric architectures for data sharing (push/pull) with clear provenance tracks to process updates and acknowledge original contributors. And most importantly, we encourage the creation of environments for collaboration and resource sharing between catalogue developers, building on international networks such as OpenAIRE and research data alliance, as well as domain specific ESFRIs such as BBMRI and ELIXIR.

Details

ISSN :
23640502
Database :
OpenAIRE
Journal :
Yearbook of medical informatics
Accession number :
edsair.doi.dedup.....c5176ebccf473b9c3a8f4e949b17654a
Full Text :
https://doi.org/10.1055/s-0042-1742522