Can weight sharing outperform random architecture search? An investigation with TuNAS

Authors :: Bender, Gabriel
Liu, Hanxiao
Chen, Bo
Chu, Grace
Cheng, Shuyang
Kindermans, Pieter-Jan
Le, Quoc
Source :: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 14323-14332
Publication Year :: 2020
Abstract: Efficient Neural Architecture Search methods based on weight sharing have shown good promise in democratizing Neural Architecture Search for computer vision models. There is, however, an ongoing debate whether these efficient methods are significantly better than random search. Here we perform a thorough comparison between efficient and random search methods on a family of progressively larger and more challenging search spaces for image classification and detection on ImageNet and COCO. While the efficacies of both methods are problem-dependent, our experiments demonstrate that there are large, realistic tasks where efficient search methods can provide substantial gains over random search. In addition, we propose and evaluate techniques which improve the quality of searched architectures and reduce the need for manual hyper-parameter tuning. Source code and experiment data are available at https://github.com/google-research/google-research/tree/master/tunas<br />Comment: Published at CVPR 2020

Subjects :: Computer Science - Machine Learning
Computer Science - Computer Vision and Pattern Recognition
Statistics - Machine Learning
I.2.10

Database :: arXiv
Journal :: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 14323-14332
Publication Type :: Report
Accession number :: edsarx.2008.06120
Document Type :: Working Paper

Tools