1. Imbalance-Aware Self-supervised Learning for 3D Radiomic Representations
- Author
-
de Buijne, Marleen, Cotin, Stéphane, Speidel, Stefanie, Zhen, Yefeng, Essert, Caroline, de Buijne, M ( Marleen ), Cotin, S ( Stéphane ), Speidel, S ( Stefanie ), Zhen, Y ( Yefeng ), Essert, C ( Caroline ), Li, Hongwei, Xue, Fei-Fei, Chaitanya, Krishna; https://orcid.org/0000-0002-9036-6967, Luo, Shengda, Ezhov, Ivan, Wiestler, Benedikt; https://orcid.org/0000-0002-2963-7772, Zhang, Jianguo, Menze, Bjoern; https://orcid.org/0000-0003-4136-5690, de Buijne, Marleen, Cotin, Stéphane, Speidel, Stefanie, Zhen, Yefeng, Essert, Caroline, de Buijne, M ( Marleen ), Cotin, S ( Stéphane ), Speidel, S ( Stefanie ), Zhen, Y ( Yefeng ), Essert, C ( Caroline ), Li, Hongwei, Xue, Fei-Fei, Chaitanya, Krishna; https://orcid.org/0000-0002-9036-6967, Luo, Shengda, Ezhov, Ivan, Wiestler, Benedikt; https://orcid.org/0000-0002-2963-7772, Zhang, Jianguo, and Menze, Bjoern; https://orcid.org/0000-0003-4136-5690
- Abstract
Radiomics can quantify the properties of regions of interest in medical image data. Classically, they account for pre-defined statistics of shape, texture, and other low-level image features. Alternatively, deep learning-based representations are derived from supervised learning but require expensive annotations and often suffer from overfitting and data imbalance issues. In this work, we address the challenge of learning the representation of a 3D medical image for an effective quantification under data imbalance. We propose a self-supervised representation learning framework to learn high-level features of 3D volumes as a complement to existing radiomics features. Specifically, we demonstrate how to learn image representations in a self-supervised fashion using a 3D Siamese network. More importantly, we deal with data imbalance by exploiting two unsupervised strategies: a) sample re-weighting, and b) balancing the composition of training batches. When combining the learned self-supervised feature with traditional radiomics, we show significant improvement in brain tumor classification and lung cancer staging tasks covering MRI and CT imaging modalities. Codes are available in https://github.com/hongweilibran/imbalanced-SSL.
- Published
- 2021