Back to Search
Start Over
Attention-Based LSTM Algorithm for Audio Replay Detection in Noisy Environments
- Source :
- Applied Sciences, Vol 9, Iss 8, p 1539 (2019)
- Publication Year :
- 2019
- Publisher :
- MDPI AG, 2019.
-
Abstract
- Even though audio replay detection has improved in recent years, its performance is known to severely deteriorate with the existence of strong background noises. Given the fact that different frames of an utterance have different impacts on the performance of spoofing detection, this paper introduces attention-based long short-term memory (LSTM) to extract representative frames for spoofing detection in noisy environments. With this attention mechanism, the specific and representative frame-level features will be automatically selected by adjusting their weights in the framework of attention-based LSTM. The experiments, conducted using the ASVspoof 2017 dataset version 2.0, show that the equal error rate (EER) of the proposed approach was about 13% lower than the constant Q cepstral coefficients-Gaussian mixture model (CQCC-GMM) baseline in noisy environments with four different signal-to-noise ratios (SNR). Meanwhile, the proposed algorithm also improved the performance of traditional LSTM on audio replay detection systems in noisy environments. Experiments using bagging with different frame lengths were also conducted to further improve the proposed approach.
Details
- Language :
- English
- ISSN :
- 20763417 and 27694739
- Volume :
- 9
- Issue :
- 8
- Database :
- Directory of Open Access Journals
- Journal :
- Applied Sciences
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.549650cd27694739874274c96cf1b1ae
- Document Type :
- article
- Full Text :
- https://doi.org/10.3390/app9081539