Back to Search
Start Over
Efficient sparse semismooth Newton methods for the clustered lasso problem
- Publication Year :
- 2018
- Publisher :
- arXiv, 2018.
-
Abstract
- We focus on solving the clustered lasso problem, which is a least squares problem with the $\ell_1$-type penalties imposed on both the coefficients and their pairwise differences to learn the group structure of the regression parameters. Here we first reformulate the clustered lasso regularizer as a weighted ordered-lasso regularizer, which is essential in reducing the computational cost from $O(n^2)$ to $O(n\log (n))$. We then propose an inexact semismooth Newton augmented Lagrangian ({\sc Ssnal}) algorithm to solve the clustered lasso problem or its dual via this equivalent formulation, depending on whether the sample size is larger than the dimension of the features. An essential component of the {\sc Ssnal} algorithm is the computation of the generalized Jacobian of the proximal mapping of the clustered lasso regularizer. Based on the new formulation, we derive an efficient procedure for its computation. Comprehensive results on the global convergence and local linear convergence of the {\sc Ssnal} algorithm are established. For the purpose of exposition and comparison, we also summarize/design several first-order methods that can be used to solve the problem under consideration, but with the key improvement from the new formulation of the clustered lasso regularizer. As a demonstration of the applicability of our algorithms, numerical experiments on the clustered lasso problem are performed. The experiments show that the {\sc Ssnal} algorithm substantially outperforms the best alternative algorithm for the clustered lasso problem.
- Subjects :
- FOS: Computer and information sciences
0211 other engineering and technologies
MathematicsofComputing_NUMERICALANALYSIS
Machine Learning (stat.ML)
010103 numerical & computational mathematics
02 engineering and technology
01 natural sciences
Least squares
Theoretical Computer Science
Statistics::Machine Learning
Lasso (statistics)
Statistics - Machine Learning
FOS: Mathematics
Applied mathematics
0101 mathematics
Mathematics - Optimization and Control
Mathematics
021103 operations research
Augmented Lagrangian method
Group (mathematics)
Applied Mathematics
ComputingMethodologies_PATTERNRECOGNITION
Optimization and Control (math.OC)
Convex optimization
Pairwise comparison
Focus (optics)
Software
Subjects
Details
- Database :
- OpenAIRE
- Accession number :
- edsair.doi.dedup.....83732268f36eb9381a518951fad0d98f
- Full Text :
- https://doi.org/10.48550/arxiv.1808.07181