Back to Search Start Over

A review on speech separation in cocktail party environment: challenges and approaches.

Authors :
Agrawal, Jharna
Gupta, Manish
Garg, Hitendra
Source :
Multimedia Tools & Applications; Aug2023, Vol. 82 Issue 20, p31035-31067, 33p
Publication Year :
2023

Abstract

The Cocktail party problem, which is tracing and identifying a specific speaker's speech while numerous speakers communicate concurrently is one of the crucial problems still to be addressed for automated speech recognition (ASR) and speaker recognition. In this study, we attempt to thoroughly explore traditional methods for speech separation in a cocktail party environment and further analyze traditional single-channel methods for example source-driven methods like Computational Auditory Scene Analysis (CASA), data-driven methods like non-negative matrix factorization (NMF), model-driven methods, customary multi-channel methods such as beamforming, blind source separation for multi-channel and the newly developed deep learning approaches such as meta-learning based methods, self-supervised learning. This paper further accentuates numerous datasets and evaluation metrics in the domain of speech processing & brings out the comparison between traditional methods and methods based on deep learning for speech separation. This study provides a basic understanding and comprehensive knowledge of state-of-the-art researches in the area of speech separation and serves as a brief overview to new researchers. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13807501
Volume :
82
Issue :
20
Database :
Complementary Index
Journal :
Multimedia Tools & Applications
Publication Type :
Academic Journal
Accession number :
167307515
Full Text :
https://doi.org/10.1007/s11042-023-14649-x