Back to Search
Start Over
Dimension-wise sparse low-rank approximation of a matrix with application to variable selection in high-dimensional integrative analyzes of association.
- Source :
-
Journal of Applied Statistics . Dec2022, Vol. 49 Issue 15, p3889-3907. 19p. 1 Chart, 5 Graphs. - Publication Year :
- 2022
-
Abstract
- Many research proposals involve collecting multiple sources of information from a set of common samples, with the goal of performing an integrative analysis describing the associations between sources. We propose a method that characterizes the dominant modes of co-variation between the variables in two datasets while simultaneously performing variable selection. Our method relies on a sparse, low rank approximation of a matrix containing pairwise measures of association between the two sets of variables. We show that the proposed method shares a close connection with another group of methods for integrative data analysis – sparse canonical correlation analysis (CCA). Under some assumptions, the proposed method and sparse CCA aim to select the same subsets of variables. We show through simulation that the proposed method can achieve better variable selection accuracies than two state-of-the-art sparse CCA algorithms. Empirically, we demonstrate through the analysis of DNA methylation and gene expression data that the proposed method selects variables that have as high or higher canonical correlation than the variables selected by sparse CCA methods, which is a rather surprising finding given that objective function of the proposed method does not actually maximize the canonical correlation. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 02664763
- Volume :
- 49
- Issue :
- 15
- Database :
- Academic Search Index
- Journal :
- Journal of Applied Statistics
- Publication Type :
- Academic Journal
- Accession number :
- 159934780
- Full Text :
- https://doi.org/10.1080/02664763.2021.1967892