Back to Search Start Over

Randomization effect on iterative-based speaker diarization system for telephone conversations

Authors :
Ami Moyal
Tal Furmanov
Lidiya Aminov
Itshak Lapidot
Source :
2014 IEEE 28th Convention of Electrical & Electronics Engineers in Israel (IEEEI).
Publication Year :
2014
Publisher :
IEEE, 2014.

Abstract

The primary objective of speaker diarization system is to designate speech segments to one of K speakers in the conversation. We use a hidden-distortion-model (HDM)-based system. HDM allows using different emission models as speaker models. We investigate the effect of randomization in two different levels. One level is stochastic training versus deterministic training and the other, random model initialization versus preserving initialization from the previous iteration. The emission models were codebooks (CBs) trained using K-means algorithm, both, batch and stochastic versions, as well as a self-organizing map (SOM) in its stochastic version. The evaluation performed on 108 telephone conversations from the LDC CallHome corpus. We will show that randomizing is always outperforming the deterministic training. Stochastic training demonstrated relative improvement of 3.5%. Random initialization achieved relative improvement of 7.28% comparing to preservation of initialization from the previous iteration.

Details

Database :
OpenAIRE
Journal :
2014 IEEE 28th Convention of Electrical & Electronics Engineers in Israel (IEEEI)
Accession number :
edsair.doi...........e5dba10197b042dea9a9d0bad3e7b0b9
Full Text :
https://doi.org/10.1109/eeei.2014.7005738