1. High-Frequency Adversarial Defense for Speech and Audio
- Author
-
Muhammad A. Shah, Bhiksha Raj, and Raphael Olivier
- Subjects
Masking (art) ,symbols.namesake ,Adversarial system ,Gaussian noise ,Computer science ,Speech recognition ,Frequency domain ,symbols ,Spectrogram ,Smoothing ,Task (project management) ,Domain (software engineering) - Abstract
Recent work suggests that adversarial examples are enabled by high-frequency components in the dataset. In the speech domain where spectrograms are used extensively, masking those components seems like a sound direction for defenses against attacks. We explore a smoothing approach based on additive noise masking in priority high frequencies. We show that this approach is much more robust than the naive noise filtering approach, and a promising research direction. We successfully apply our defense on a Librispeech speaker identification task, and on the UrbanSound8K audio classification dataset.
- Published
- 2021