Back to Search Start Over

Dimension-wise sparse low-rank approximation of a matrix with application to variable selection in high-dimensional integrative analyzes of association.

Authors :
Poythress, J. C.
Park, Cheolwoo
Ahn, Jeongyoun
Source :
Journal of Applied Statistics. Dec2022, Vol. 49 Issue 15, p3889-3907. 19p. 1 Chart, 5 Graphs.
Publication Year :
2022

Abstract

Many research proposals involve collecting multiple sources of information from a set of common samples, with the goal of performing an integrative analysis describing the associations between sources. We propose a method that characterizes the dominant modes of co-variation between the variables in two datasets while simultaneously performing variable selection. Our method relies on a sparse, low rank approximation of a matrix containing pairwise measures of association between the two sets of variables. We show that the proposed method shares a close connection with another group of methods for integrative data analysis – sparse canonical correlation analysis (CCA). Under some assumptions, the proposed method and sparse CCA aim to select the same subsets of variables. We show through simulation that the proposed method can achieve better variable selection accuracies than two state-of-the-art sparse CCA algorithms. Empirically, we demonstrate through the analysis of DNA methylation and gene expression data that the proposed method selects variables that have as high or higher canonical correlation than the variables selected by sparse CCA methods, which is a rather surprising finding given that objective function of the proposed method does not actually maximize the canonical correlation. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02664763
Volume :
49
Issue :
15
Database :
Academic Search Index
Journal :
Journal of Applied Statistics
Publication Type :
Academic Journal
Accession number :
159934780
Full Text :
https://doi.org/10.1080/02664763.2021.1967892