1. Bayesian risk prediction model for colorectal cancer mortality through integration of clinicopathologic and genomic data
- Author
-
Zhao, M. (Melissa), Lau, M. C. (Mai Chan), Haruki, K. (Koichiro), Väyrynen, J. P. (Juha P.), Gurjao, C. (Carino), Väyrynen, S. A. (Sara A.), Costa, A. D. (Andressa Dias), Borowsky, J. (Jennifer), Fujiyoshi, K. (Kenji), Arima, K. (Kota), Hamada, T. (Tsuyoshi), Lennerz, J. K. (Jochen K.), Fuchs, C. S. (Charles S.), Nishihara, R. (Reiko), Chan, A. T. (Andrew T.), Ng, K. (Kimmie), Zhang, X. (Xuehong), Meyerhardt, J. A. (Jeffrey A.), Song, M. (Mingyang), Wang, M. (Molin), Giannakis, M. (Marios), Nowak, J. A. (Jonathan A.), Yu, K.-H. (Kun-Hsing), Ugai, T. (Tomotaka), Ogino, S. (Shuji), Zhao, M. (Melissa), Lau, M. C. (Mai Chan), Haruki, K. (Koichiro), Väyrynen, J. P. (Juha P.), Gurjao, C. (Carino), Väyrynen, S. A. (Sara A.), Costa, A. D. (Andressa Dias), Borowsky, J. (Jennifer), Fujiyoshi, K. (Kenji), Arima, K. (Kota), Hamada, T. (Tsuyoshi), Lennerz, J. K. (Jochen K.), Fuchs, C. S. (Charles S.), Nishihara, R. (Reiko), Chan, A. T. (Andrew T.), Ng, K. (Kimmie), Zhang, X. (Xuehong), Meyerhardt, J. A. (Jeffrey A.), Song, M. (Mingyang), Wang, M. (Molin), Giannakis, M. (Marios), Nowak, J. A. (Jonathan A.), Yu, K.-H. (Kun-Hsing), Ugai, T. (Tomotaka), and Ogino, S. (Shuji)
- Abstract
Routine tumor-node-metastasis (TNM) staging of colorectal cancer is imperfect in predicting survival due to tumor pathobiological heterogeneity and imprecise assessment of tumor spread. We leveraged Bayesian additive regression trees (BART), a statistical learning technique, to comprehensively analyze patient-specific tumor characteristics for the improvement of prognostic prediction. Of 75 clinicopathologic, immune, microbial, and genomic variables in 815 stage II–III patients within two U.S.-wide prospective cohort studies, the BART risk model identified seven stable survival predictors. Risk stratifications (low risk, intermediate risk, and high risk) based on model-predicted survival were statistically significant (hazard ratios 0.19–0.45, vs. higher risk; P < 0.0001) and could be externally validated using The Cancer Genome Atlas (TCGA) data (P = 0.0004). BART demonstrated model flexibility, interpretability, and comparable or superior performance to other machine-learning models. Integrated bioinformatic analyses using BART with tumor-specific factors can robustly stratify colorectal cancer patients into prognostic groups and be readily applied to clinical oncology practice.
- Published
- 2023