Back to Search
Start Over
End-to-End Residual CNN with L-GM Loss Speaker Verification System
- Source :
- DSL
- Publication Year :
- 2018
- Publisher :
- arXiv, 2018.
-
Abstract
- We propose an end-to-end speaker verification system based on the neural network and trained by a loss function with less computational complexity. The end-to-end speaker verification system in this paper consists of a ResNet architecture to extract features from utterance, then produces utterance-level speaker embeddings, and train using the large-margin Gaussian Mixture loss function. Influenced by the large-margin and likelihood regularization, large-margin Gaussian Mixture loss function benefits the speaker verification performance. Experimental results demonstrate that the Residual CNN with large-margin Gaussian Mixture loss outperforms DNN-based i-vector baseline by more than 10% improvement in accuracy rate.<br />Comment: 5 pages. arXiv admin note: text overlap with arXiv:1803.02988, arXiv:1705.02304, arXiv:1706.08612 by other authors
- Subjects :
- FOS: Computer and information sciences
Sound (cs.SD)
Artificial neural network
Computational complexity theory
Computer science
Speech recognition
Gaussian
Feature extraction
02 engineering and technology
Function (mathematics)
Residual
Regularization (mathematics)
Computer Science - Sound
030507 speech-language pathology & audiology
03 medical and health sciences
symbols.namesake
0202 electrical engineering, electronic engineering, information engineering
symbols
020201 artificial intelligence & image processing
0305 other medical science
Hidden Markov model
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- DSL
- Accession number :
- edsair.doi.dedup.....05999fd2feb97af5113294b483765b64
- Full Text :
- https://doi.org/10.48550/arxiv.1805.00645