1. Standardization in Quantitative Imaging: A Multicenter Comparison of Radiomic Features from Different Software Packages on Digital Reference Objects and Patient Data Sets
- Author
-
McNitt-Gray, M, Napel, S, Jaggi, A, Mattonen, SA, Hadjiiski, L, Muzi, M, Goldgof, D, Balagurunathan, Y, Pierce, LA, Kinahan, PE, Jones, EF, Nguyen, A, Virkud, A, Chan, HP, Emaminejad, N, Wahi-Anwar, M, Daly, M, Abdalah, M, Yang, H, Lu, L, Lv, W, Rahmim, A, Gastounioti, A, Pati, S, Bakas, S, Kontos, D, Zhao, B, Kalpathy-Cramer, J, and Farahani, K
- Subjects
Information and Computing Sciences ,Biomedical and Clinical Sciences ,Software Engineering ,Biomedical Imaging ,Bioengineering ,Humans ,Image Processing ,Computer-Assisted ,Neoplasms ,Positron Emission Tomography Computed Tomography ,Radiometry ,Reference Standards ,Software ,Radiomics ,Quantitative Imaging ,Standardization ,Multi-center ,Feature Definitions - Abstract
Radiomic features are being increasingly studied for clinical applications. We aimed to assess the agreement among radiomic features when computed by several groups by using different software packages under very tightly controlled conditions, which included standardized feature definitions and common image data sets. Ten sites (9 from the NCI's Quantitative Imaging Network] positron emission tomography-computed tomography working group plus one site from outside that group) participated in this project. Nine common quantitative imaging features were selected for comparison including features that describe morphology, intensity, shape, and texture. The common image data sets were: three 3D digital reference objects (DROs) and 10 patient image scans from the Lung Image Database Consortium data set using a specific lesion in each scan. Each object (DRO or lesion) was accompanied by an already-defined volume of interest, from which the features were calculated. Feature values for each object (DRO or lesion) were reported. The coefficient of variation (CV), expressed as a percentage, was calculated across software packages for each feature on each object. Thirteen sets of results were obtained for the DROs and patient data sets. Five of the 9 features showed excellent agreement with CV < 1%; 1 feature had moderate agreement (CV < 10%), and 3 features had larger variations (CV ≥ 10%) even after attempts at harmonization of feature calculations. This work highlights the value of feature definition standardization as well as the need to further clarify definitions for some features.
- Published
- 2020