Back to Search Start Over

Using random forests to uncover the predictive power of distance-varying cell interactions in tumor microenvironments.

Authors :
VanderDoes, Jeremy
Marceaux, Claire
Yokote, Kenta
Asselin-Labat, Marie-Liesse
Rice, Gregory
Hywood, Jack D.
Source :
PLoS Computational Biology; 6/14/2024, Vol. 20 Issue 6, p1-28, 28p
Publication Year :
2024

Abstract

Tumor microenvironments (TMEs) contain vast amounts of information on patient's cancer through their cellular composition and the spatial distribution of tumor cells and immune cell populations. Exploring variations in TMEs between patient groups, as well as determining the extent to which this information can predict outcomes such as patient survival or treatment success with emerging immunotherapies, is of great interest. Moreover, in the face of a large number of cell interactions to consider, we often wish to identify specific interactions that are useful in making such predictions. We present an approach to achieve these goals based on summarizing spatial relationships in the TME using spatial K functions, and then applying functional data analysis and random forest models to both predict outcomes of interest and identify important spatial relationships. This approach is shown to be effective in simulation experiments at both identifying important spatial interactions while also controlling the false discovery rate. We further used the proposed approach to interrogate two real data sets of Multiplexed Ion Beam Images of TMEs in triple negative breast cancer and lung cancer patients. The methods proposed are publicly available in a companion R package funkycells. Author summary: Spatial data on the tumor microenvironment (TME) are becoming more prevalent. Existing methods to interrogate such data often have several limitations: (1) they can rely on estimating the spatial relationships among cells by examining simple counts of cells within a single radius, (2) they may not come with ways to evaluate the statistical significance of any findings, or (3) they model individual interactions independently of other interactions. Our approach leverages techniques in spatial statistics and uses a benchmark ensemble machine learning method to address each of these deficiencies; it (1) uses K functions to encode the relative densities of cells over all radii up to a user-selected maximum radius, (2) employs permutation and cross-validation to evaluate the statistical significance of any findings on the spatial interactions in the TME, and (3) models multiple interactions simultaneously. Our approach is freely available with an R implementation called funkycells. In the analysis of two real data sets, we have seen that the method performs well, and gives the expected results. We think this will be a robust tool for researchers looking to interrogate TME data. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
1553734X
Volume :
20
Issue :
6
Database :
Complementary Index
Journal :
PLoS Computational Biology
Publication Type :
Academic Journal
Accession number :
177908378
Full Text :
https://doi.org/10.1371/journal.pcbi.1011361