Back to Search
Start Over
Using random forests to uncover the predictive power of distance-varying cell interactions in tumor microenvironments.
- Source :
- PLoS Computational Biology; 6/14/2024, Vol. 20 Issue 6, p1-28, 28p
- Publication Year :
- 2024
-
Abstract
- Tumor microenvironments (TMEs) contain vast amounts of information on patient's cancer through their cellular composition and the spatial distribution of tumor cells and immune cell populations. Exploring variations in TMEs between patient groups, as well as determining the extent to which this information can predict outcomes such as patient survival or treatment success with emerging immunotherapies, is of great interest. Moreover, in the face of a large number of cell interactions to consider, we often wish to identify specific interactions that are useful in making such predictions. We present an approach to achieve these goals based on summarizing spatial relationships in the TME using spatial K functions, and then applying functional data analysis and random forest models to both predict outcomes of interest and identify important spatial relationships. This approach is shown to be effective in simulation experiments at both identifying important spatial interactions while also controlling the false discovery rate. We further used the proposed approach to interrogate two real data sets of Multiplexed Ion Beam Images of TMEs in triple negative breast cancer and lung cancer patients. The methods proposed are publicly available in a companion R package funkycells. Author summary: Spatial data on the tumor microenvironment (TME) are becoming more prevalent. Existing methods to interrogate such data often have several limitations: (1) they can rely on estimating the spatial relationships among cells by examining simple counts of cells within a single radius, (2) they may not come with ways to evaluate the statistical significance of any findings, or (3) they model individual interactions independently of other interactions. Our approach leverages techniques in spatial statistics and uses a benchmark ensemble machine learning method to address each of these deficiencies; it (1) uses K functions to encode the relative densities of cells over all radii up to a user-selected maximum radius, (2) employs permutation and cross-validation to evaluate the statistical significance of any findings on the spatial interactions in the TME, and (3) models multiple interactions simultaneously. Our approach is freely available with an R implementation called funkycells. In the analysis of two real data sets, we have seen that the method performs well, and gives the expected results. We think this will be a robust tool for researchers looking to interrogate TME data. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 1553734X
- Volume :
- 20
- Issue :
- 6
- Database :
- Complementary Index
- Journal :
- PLoS Computational Biology
- Publication Type :
- Academic Journal
- Accession number :
- 177908378
- Full Text :
- https://doi.org/10.1371/journal.pcbi.1011361