Back to Search Start Over

Optimizing Neural Network Embeddings Using a Pair-Wise Loss for Text-Independent Speaker Verification

Authors :
Tianyan Zhou
Hira Dhamyal
Rita Singh
Bhiksha Raj
Source :
ASRU
Publication Year :
2019
Publisher :
IEEE, 2019.

Abstract

This paper proposes a new loss function called the “quartet” loss for the better optimization of the neural networks for matching tasks. For such tasks, where neural network embeddings are the key component, the optimization of the network for better embeddings is critical. The embeddings are required to be class discriminative, resulting in minimal inter-class variation and maximal intra-class variation even for unseen classes for better generalization of the network. The quartet loss explicitly computes the distance metric between pairs of inputs and increases the gap between the similarity score distributions between the same class pairs and the different class pairs. We evaluate on the speaker verification task and demonstrate the performance of the loss on our proposed neural network.

Details

Database :
OpenAIRE
Journal :
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
Accession number :
edsair.doi...........dadd90ed3373f2437b5ff8937cf2c6c7
Full Text :
https://doi.org/10.1109/asru46091.2019.9003794