Back to Search Start Over

Sparse support vector machines with L 0 approximation for ultra-high dimensional omics data.

Authors :
Liu Z
Elashoff D
Piantadosi S
Source :
Artificial intelligence in medicine [Artif Intell Med] 2019 May; Vol. 96, pp. 134-141. Date of Electronic Publication: 2019 Apr 30.
Publication Year :
2019

Abstract

Omics data usually have ultra-high dimension (p) and small sample size (n). Standard support vector machines (SVMs), which minimize the L <subscript>2</subscript> norm for the primal variables, only lead to sparse solutions for the dual variables. L <subscript>1</subscript> based SVMs, directly minimizing the L <subscript>1</subscript> norm, have been used for feature selection with omics data. However, most current methods directly solve the primal formulations of the problem, which are not computationally scalable. The computational complexity increases with the number of features. In addition, L <subscript>1</subscript> norm is known to be asymptotically biased and not consistent for feature selection. In this paper, we develop an efficient method for sparse support vector machines with L <subscript>0</subscript> norm approximation. The proposed method approximates the L <subscript>0</subscript> minimization through solving a series of L <subscript>2</subscript> optimization problems, which can be formulated with dual variables. It finds the optimal solution for p primal variables through estimating n dual variables, which is more efficient as long as the sample size is small. L <subscript>0</subscript> approximation leads to sparsity in both dual and primal variables, and can be used for both feature and sample selections. The proposed method identifies much less number of features and achieves similar performances in simulations. We apply the proposed method to feature selections with metagenomic sequencing and gene expression data. It can identify biologically important genes and taxa efficiently.<br /> (Copyright © 2019 Elsevier B.V. All rights reserved.)

Details

Language :
English
ISSN :
1873-2860
Volume :
96
Database :
MEDLINE
Journal :
Artificial intelligence in medicine
Publication Type :
Academic Journal
Accession number :
31164207
Full Text :
https://doi.org/10.1016/j.artmed.2019.04.004