1. Reproducible radiomics through automated machine learning validated on twelve clinical applications
- Author
-
Starmans, Martijn P. A., van der Voort, Sebastian R., Phil, Thomas, Timbergen, Milea J. M., Vos, Melissa, Padmos, Guillaume A., Kessels, Wouter, Hanff, David, Grunhagen, Dirk J., Verhoef, Cornelis, Sleijfer, Stefan, Bent, Martin J. van den, Smits, Marion, Dwarkasing, Roy S., Els, Christopher J., Fiduzi, Federico, van Leenders, Geert J. L. H., Blazevic, Anela, Hofland, Johannes, Brabander, Tessa, van Gils, Renza A. H., Franssen, Gaston J. H., Feelders, Richard A., de Herder, Wouter W., Buisman, Florian E., Willemssen, Francois E. J. A., Koerkamp, Bas Groot, Angus, Lindsay, van der Veldt, Astrid A. M., Rajicic, Ana, Odink, Arlette E., Deen, Mitchell, T., Jose M. Castillo, Veenland, Jifke, Schoots, Ivo, Renckens, Michel, Doukas, Michail, de Man, Rob A., IJzermans, Jan N. M., Miclea, Razvan L., Vermeulen, Peter B., Bron, Esther E., Thomeer, Maarten G., Visser, Jacob J., Niessen, Wiro J., and Klein, Stefan
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Radiomics uses quantitative medical imaging features to predict clinical outcomes. Currently, in a new clinical application, finding the optimal radiomics method out of the wide range of available options has to be done manually through a heuristic trial-and-error process. In this study we propose a framework for automatically optimizing the construction of radiomics workflows per application. To this end, we formulate radiomics as a modular workflow and include a large collection of common algorithms for each component. To optimize the workflow per application, we employ automated machine learning using a random search and ensembling. We evaluate our method in twelve different clinical applications, resulting in the following area under the curves: 1) liposarcoma (0.83); 2) desmoid-type fibromatosis (0.82); 3) primary liver tumors (0.80); 4) gastrointestinal stromal tumors (0.77); 5) colorectal liver metastases (0.61); 6) melanoma metastases (0.45); 7) hepatocellular carcinoma (0.75); 8) mesenteric fibrosis (0.80); 9) prostate cancer (0.72); 10) glioma (0.71); 11) Alzheimer's disease (0.87); and 12) head and neck cancer (0.84). We show that our framework has a competitive performance compared human experts, outperforms a radiomics baseline, and performs similar or superior to Bayesian optimization and more advanced ensemble approaches. Concluding, our method fully automatically optimizes the construction of radiomics workflows, thereby streamlining the search for radiomics biomarkers in new applications. To facilitate reproducibility and future research, we publicly release six datasets, the software implementation of our framework, and the code to reproduce this study., Comment: 33 pages, 4 figures, 4 tables, 2 supplementary figures, 3 supplementary table, submitted to Medical Image Analysis; revision
- Published
- 2021