Back to Search Start Over

Binaural Multichannel Blind Speaker Separation With a Causal Low-Latency and Low-Complexity Approach

Authors :
Nils L. Westhausen
Bernd T. Meyer
Source :
IEEE Open Journal of Signal Processing, Vol 5, Pp 238-247 (2024)
Publication Year :
2024
Publisher :
IEEE, 2024.

Abstract

In this article, we introduce a causal low-latency low-complexity approach for binaural multichannel blind speaker separation in noisy reverberant conditions. The model, referred to as Group Communication Binaural Filter and Sum Network (GCBFSnet) predicts complex filters for filter-and-sum beamforming in the time-frequency domain. We apply Group Communication (GC), i.e., latent model variables are split into groups and processed with a shared sequence model with the aim of reducing the complexity of a simple model only containing one convolutional and one recurrent module. With GC we are able to reduce the size of the model by up to 83% and the complexity up to 73% compared to the model without GC, while mostly retaining performance. Even for the smallest model configuration, GCBFSnet matches the performance of a low-complexity TasNet baseline in most metrics despite the larger size and higher number of required operations of the baseline.

Details

Language :
English
ISSN :
26441322
Volume :
5
Database :
Directory of Open Access Journals
Journal :
IEEE Open Journal of Signal Processing
Publication Type :
Academic Journal
Accession number :
edsdoj.43417e269aa94b4e9462dfe9903494ac
Document Type :
article
Full Text :
https://doi.org/10.1109/OJSP.2023.3343320