Back to Search Start Over

RADCURE: An open‐source head and neck cancer CT dataset for clinical radiation therapy insights.

Authors :
Welch, Mattea L.
Kim, Sejin
Hope, Andrew J.
Huang, Shao Hui
Lu, Zhibin
Marsilla, Joseph
Kazmierski, Michal
Rey‐McIntyre, Katrina
Patel, Tirth
O'Sullivan, Brian
Waldron, John
Bratman, Scott
Haibe‐Kains, Benjamin
Tadic, Tony
Source :
Medical Physics. Apr2024, Vol. 51 Issue 4, p3101-3109. 9p.
Publication Year :
2024

Abstract

Purpose: This manuscript presents RADCURE, one of the most extensive head and neck cancer (HNC) imaging datasets accessible to the public. Initially collected for clinical radiation therapy (RT) treatment planning, this dataset has been retrospectively reconstructed for use in imaging research. Acquisition and Validation Methods: RADCURE encompasses data from 3346 patients, featuring computed tomography (CT) RT simulation images with corresponding target and organ‐at‐risk contours. These CT scans were collected using systems from three different manufacturers. Standard clinical imaging protocols were followed, and contours were manually generated and reviewed at weekly RT quality assurance rounds. RADCURE imaging and structure set data was extracted from our institution's radiation treatment planning and oncology information systems using a custom‐built data mining and processing system. Furthermore, images were linked to our clinical anthology of outcomes data for each patient and includes demographic, clinical and treatment information based on the 7th edition TNM staging system (Tumor‐Node‐Metastasis Classification System of Malignant Tumors). The median patient age is 63, with the final dataset including 80% males. Half of the cohort is diagnosed with oropharyngeal cancer, while laryngeal, nasopharyngeal, and hypopharyngeal cancers account for 25%, 12%, and 5% of cases, respectively. The median duration of follow‐up is five years, with 60% of the cohort surviving until the last follow‐up point. Data Format and Usage Notes: The dataset provides images and contours in DICOM CT and RT‐STRUCT formats, respectively. We have standardized the nomenclature for individual contours—such as the gross primary tumor, gross nodal volumes, and 19 organs‐at‐risk—to enhance the RT‐STRUCT files' utility. Accompanying demographic, clinical, and treatment data are supplied in a comma‐separated values (CSV) file format. This comprehensive dataset is publicly accessible via The Cancer Imaging Archive. Potential Applications: RADCURE's amalgamation of imaging, clinical, demographic, and treatment data renders it an invaluable resource for a broad spectrum of radiomics image analysis research endeavors. Researchers can utilize this dataset to advance routine clinical procedures using machine learning or artificial intelligence, to identify new non‐invasive biomarkers, or to forge prognostic models. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00942405
Volume :
51
Issue :
4
Database :
Academic Search Index
Journal :
Medical Physics
Publication Type :
Academic Journal
Accession number :
176451127
Full Text :
https://doi.org/10.1002/mp.16972