Back to Search Start Over

RADx Data Hub: A Cloud Repository for FAIR, Harmonized COVID-19 Data

Authors :
Martinez-Romero, Marcos
Horridge, Matthew
Mistry, Nilesh
Weyhmiller, Aubrie
Yu, Jimmy K.
Fujimoto, Alissa
Henry, Aria
O'Connor, Martin J.
Sier, Ashley
Suber, Stephanie
Akdogan, Mete U.
Cao, Yan
Valliappan, Somu
Mieczkowska, Joanna O.
team, the RADx Data Hub
Krishnamurthy, Ashok
Keller, Michael A.
Musen, Mark A.
Publication Year :
2025

Abstract

The COVID-19 pandemic highlighted the urgent need for robust systems to enable rapid data collection, integration, and analysis for public health responses. Existing approaches often relied on disparate, non-interoperable systems, creating bottlenecks in comprehensive analyses and timely decision-making. To address these challenges, the U.S. National Institutes of Health (NIH) launched the Rapid Acceleration of Diagnostics (RADx) initiative in 2020, with the RADx Data Hub, a centralized repository for de-identified and curated COVID-19 data, as its cornerstone. The RADx Data Hub hosts diverse study data, including clinical data, testing results, smart sensor outputs, self-reported symptoms, and information on social determinants of health. Built on cloud infrastructure, the RADx Data Hub integrates metadata standards, interoperable formats, and ontology-based tools to adhere to the FAIR (Findable, Accessible, Interoperable, Reusable) principles for data sharing. Initially developed for COVID-19 research, its architecture and processes are adaptable to other scientific disciplines. This paper provides an overview of the data hosted by the RADx Data Hub and describes the platform's capabilities and architecture.

Subjects

Subjects :
Computer Science - Databases

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2502.00265
Document Type :
Working Paper