Back to Search Start Over

HyperSTAR: Task-Aware Hyperparameter Recommendation for Training and Compression.

Authors :
Liu, Chang
Mittal, Gaurav
Karianakis, Nikolaos
Fragoso, Victor
Yu, Ye
Fu, Yun
Chen, Mei
Source :
International Journal of Computer Vision. Jun2024, Vol. 132 Issue 6, p1913-1927. 15p.
Publication Year :
2024

Abstract

Hyperparameter optimization (HPO) methods alleviate the significant effort required to obtain hyperparameters that perform optimally on visual learning problems. Existing methods are computationally inefficient because they are task agnostic (i.e., they do not adapt to a given task). We present HyperSTAR (System for Task Aware Hyperparameter Recommendation), a task-aware HPO algorithm that improves HPO efficiency for a target dataset by using prior knowledge from previous hyperparameter searches to recommend effective hyperparameters conditioned on the target dataset. HyperSTAR ranks and recommends hyperparameters by predicting their performance on the target dataset. To do so, it learns a joint dataset-hyperparameter space in an end-to-end manner that enables its performance predictor to use previously found effective hyperparameters for other similar tasks. The hyperparameter recommendations of HyperSTAR combined with existing HPO techniques lead to a task-aware HPO system that reduces the time to find the optimal hyperparameters for the target learning problem. Our experiments on image classification, object detection, and model pruning validate that HyperSTAR reduces the evaluation of different hyperparameter configurations by about 50 % compared to existing methods and, when combined with Hyperband, uses only 25 % of the budget required by the vanilla Hyperband and Bayesian Optimized Hyperband to achieve the best performance. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09205691
Volume :
132
Issue :
6
Database :
Academic Search Index
Journal :
International Journal of Computer Vision
Publication Type :
Academic Journal
Accession number :
177595859
Full Text :
https://doi.org/10.1007/s11263-023-01961-0