Back to Search Start Over

Graph interpolating activation improves both natural and robust accuracies in data-efficient deep learning.

Authors :
WANG, BAO
OSHER, STAN J.
Source :
European Journal of Applied Mathematics. Jun2021, Vol. 32 Issue 3, p540-569. 30p.
Publication Year :
2021

Abstract

Improving the accuracy and robustness of deep neural nets (DNNs) and adapting them to small training data are primary tasks in deep learning (DL) research. In this paper, we replace the output activation function of DNNs, typically the data-agnostic softmax function, with a graph Laplacian-based high-dimensional interpolating function which, in the continuum limit, converges to the solution of a Laplace–Beltrami equation on a high-dimensional manifold. Furthermore, we propose end-to-end training and testing algorithms for this new architecture. The proposed DNN with graph interpolating activation integrates the advantages of both deep learning and manifold learning. Compared to the conventional DNNs with the softmax function as output activation, the new framework demonstrates the following major advantages: First, it is better applicable to data-efficient learning in which we train high capacity DNNs without using a large number of training data. Second, it remarkably improves both natural accuracy on the clean images and robust accuracy on the adversarial images crafted by both white-box and black-box adversarial attacks. Third, it is a natural choice for semi-supervised learning. This paper is a significant extension of our earlier work published in NeurIPS, 2018. For reproducibility, the code is available at https://github.com/BaoWangMath/DNN-DataDependentActivation. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09567925
Volume :
32
Issue :
3
Database :
Academic Search Index
Journal :
European Journal of Applied Mathematics
Publication Type :
Academic Journal
Accession number :
150152077
Full Text :
https://doi.org/10.1017/S0956792520000406