Back to Search
Start Over
When Not to Classify: Detection of Reverse Engineering Attacks on DNN Image Classifiers
- Publication Year :
- 2018
-
Abstract
- This paper addresses detection of a reverse engineering (RE) attack targeting a deep neural network (DNN) image classifier; by querying, RE's aim is to discover the classifier's decision rule. RE can enable test-time evasion attacks, which require knowledge of the classifier. Recently, we proposed a quite effective approach (ADA) to detect test-time evasion attacks. In this paper, we extend ADA to detect RE attacks (ADA-RE). We demonstrate our method is successful in detecting "stealthy" RE attacks before they learn enough to launch effective test-time evasion attacks.
Details
- Database :
- arXiv
- Publication Type :
- Report
- Accession number :
- edsarx.1811.02658
- Document Type :
- Working Paper