Back to Search
Start Over
Optimizing Neural Network Embeddings Using a Pair-Wise Loss for Text-Independent Speaker Verification
- Source :
- ASRU
- Publication Year :
- 2019
- Publisher :
- IEEE, 2019.
-
Abstract
- This paper proposes a new loss function called the “quartet” loss for the better optimization of the neural networks for matching tasks. For such tasks, where neural network embeddings are the key component, the optimization of the network for better embeddings is critical. The embeddings are required to be class discriminative, resulting in minimal inter-class variation and maximal intra-class variation even for unseen classes for better generalization of the network. The quartet loss explicitly computes the distance metric between pairs of inputs and increases the gap between the similarity score distributions between the same class pairs and the different class pairs. We evaluate on the speaker verification task and demonstrate the performance of the loss on our proposed neural network.
Details
- Database :
- OpenAIRE
- Journal :
- 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
- Accession number :
- edsair.doi...........dadd90ed3373f2437b5ff8937cf2c6c7
- Full Text :
- https://doi.org/10.1109/asru46091.2019.9003794