Back to Search Start Over

Information-incorporated sparse hierarchical cancer heterogeneity analysis.

Authors :
Han W
Zhang S
Ma S
Ren M
Source :
Statistics in medicine [Stat Med] 2024 May 20; Vol. 43 (11), pp. 2280-2297. Date of Electronic Publication: 2024 Mar 30.
Publication Year :
2024

Abstract

Cancer heterogeneity analysis is essential for precision medicine. Most of the existing heterogeneity analyses only consider a single type of data and ignore the possible sparsity of important features. In cancer clinical practice, it has been suggested that two types of data, pathological imaging and omics data, are commonly collected and can produce hierarchical heterogeneous structures, in which the refined sub-subgroup structure determined by omics features can be nested in the rough subgroup structure determined by the imaging features. Moreover, sparsity pursuit has extraordinary significance and is more challenging for heterogeneity analysis, because the important features may not be the same in different subgroups, which is ignored by the existing heterogeneity analyses. Fortunately, rich information from previous literature (for example, those deposited in PubMed) can be used to assist feature selection in the present study. Advancing from the existing analyses, in this study, we propose a novel sparse hierarchical heterogeneity analysis framework, which can integrate two types of features and incorporate prior knowledge to improve feature selection. The proposed approach has satisfactory statistical properties and competitive numerical performance. A TCGA real data analysis demonstrates the practical value of our approach in analyzing data heterogeneity and sparsity.<br /> (© 2024 John Wiley & Sons Ltd.)

Details

Language :
English
ISSN :
1097-0258
Volume :
43
Issue :
11
Database :
MEDLINE
Journal :
Statistics in medicine
Publication Type :
Academic Journal
Accession number :
38553996
Full Text :
https://doi.org/10.1002/sim.10071