1. Comparison of robust to standardized CT radiomics models to predict overall survival for non-small cell lung cancer patients
- Author
-
Jan Unkelbach, Stephanie Tanadini-Lang, Diem Vuong, Carol Oliveira, Hubert S. Gabryś, Matthias Guckenberger, Sven Hillinger, Robert Foerster, Florian Amstutz, Sandra Thierstein, Solange Peters, Miklos Pless, Marta Bogowicz, A. Xyrafas, and S. Denzler
- Subjects
Lung Neoplasms ,business.industry ,Intraclass correlation ,Contrast (statistics) ,General Medicine ,Logistic regression ,medicine.disease ,030218 nuclear medicine & medical imaging ,3. Good health ,03 medical and health sciences ,0302 clinical medicine ,Radiomics ,Feature (computer vision) ,Robustness (computer science) ,030220 oncology & carcinogenesis ,Carcinoma, Non-Small-Cell Lung ,medicine ,Overall survival ,Humans ,Prospective Studies ,Lung cancer ,business ,Nuclear medicine ,Tomography, X-Ray Computed ,Retrospective Studies - Abstract
BACKGROUND Radiomics is a promising tool for the identification of new prognostic biomarkers. Radiomic features can be affected by different scanning protocols, often present in retrospective and prospective clinical data. We compared a computed tomography (CT) radiomics model based on a large but highly heterogeneous multicentric image dataset with robust feature pre-selection to a model based on a smaller but standardized image dataset without pre-selection. MATERIALS AND METHODS Primary tumor radiomics was extracted from pre-treatment CTs of IIIA/N2/IIIB NSCLC patients from a prospective Swiss multicentric randomized trial (npatient = 124, ninstitution = 14, SAKK 16/00) and a validation dataset (npatient = 31, ninstitution = 1). Four robustness studies investigating inter-observer delineation variation, motion, convolution kernel, and contrast were conducted to identify robust features using an intraclass correlation coefficient threshold >0.9. Two 12-months overall survival (OS) logistic regression models were trained: (a) on the entire multicentric heterogeneous dataset but with robust feature pre-selection (MCR) and (b) on a smaller standardized subset using all features (STD). Both models were validated on the validation dataset acquired with similar reconstruction parameters as the STD dataset. The model performances were compared using the DeLong test. RESULTS In total, 113 stable features were identified (nshape = 8, nintensity = 0, ntexture = 7, nwavelet = 98). The convolution kernel had the strongest influence on the feature robustness (
- Published
- 2019