Back to Search
Start Over
Binaural Multichannel Blind Speaker Separation With a Causal Low-Latency and Low-Complexity Approach
- Source :
- IEEE Open Journal of Signal Processing, Vol 5, Pp 238-247 (2024)
- Publication Year :
- 2024
- Publisher :
- IEEE, 2024.
-
Abstract
- In this article, we introduce a causal low-latency low-complexity approach for binaural multichannel blind speaker separation in noisy reverberant conditions. The model, referred to as Group Communication Binaural Filter and Sum Network (GCBFSnet) predicts complex filters for filter-and-sum beamforming in the time-frequency domain. We apply Group Communication (GC), i.e., latent model variables are split into groups and processed with a shared sequence model with the aim of reducing the complexity of a simple model only containing one convolutional and one recurrent module. With GC we are able to reduce the size of the model by up to 83% and the complexity up to 73% compared to the model without GC, while mostly retaining performance. Even for the smallest model configuration, GCBFSnet matches the performance of a low-complexity TasNet baseline in most metrics despite the larger size and higher number of required operations of the baseline.
Details
- Language :
- English
- ISSN :
- 26441322
- Volume :
- 5
- Database :
- Directory of Open Access Journals
- Journal :
- IEEE Open Journal of Signal Processing
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.43417e269aa94b4e9462dfe9903494ac
- Document Type :
- article
- Full Text :
- https://doi.org/10.1109/OJSP.2023.3343320