Back to Search Start Over

Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information

Authors :
Lin, Jiuxin
Wang, Peng
Dinkel, Heinrich
Chen, Jun
Wu, Zhiyong
Yan, Zhiyong
Wang, Yongqing
Zhang, Junbo
Wang, Yujun
Publication Year :
2023
Publisher :
arXiv, 2023.

Abstract

Previously, Target Speaker Extraction (TSE) has yielded outstanding performance in certain application scenarios for speech enhancement and source separation. However, obtaining auxiliary speaker-related information is still challenging in noisy environments with significant reverberation. inspired by the recently proposed distance-based sound separation, we propose the near sound (NS) extractor, which leverages distance information for TSE to reliably extract speaker information without requiring previous speaker enrolment, called speaker embedding self-enrollment (SESE). Full- & sub-band modeling is introduced to enhance our NS-Extractor's adaptability towards environments with significant reverberation. Experimental results on several cross-datasets demonstrate the effectiveness of our improvements and the excellent performance of our proposed NS-Extractor in different application scenarios.<br />Accepted by InterSpeech2023

Details

Database :
OpenAIRE
Accession number :
edsair.doi.dedup.....e305fc21b7e08d89a669f83bbc503acd
Full Text :
https://doi.org/10.48550/arxiv.2306.16241