1. ASVtorch toolkit: Speaker verification with deep neural networks
- Author
-
Kong Aik Lee, Ville Vestman, and Tomi Kinnunen
- Subjects
Speaker recognition ,PyTorch ,Deep learning ,Computer software ,QA76.75-76.765 - Abstract
The human voice differs substantially between individuals. This facilitates automatic speaker verification (ASV) — recognizing a person from his/her voice. ASV accuracy has substantially increased throughout the past decade due to recent advances in machine learning, particularly deep learning methods. An unfortunate downside has been substantially increased complexity of ASV systems. To help non-experts to kick-start reproducible ASV development, a state-of-the-art toolkit implementing various ASV pipelines and functionalities is required. To this end, we introduce a new open-source toolkit, ASVtorch, implemented in Python using the widely used PyTorch machine learning framework.
- Published
- 2021
- Full Text
- View/download PDF