Back to Search Start Over

Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization

Authors :
Radu Horaud
Xiaofei Li
Laurent Girin
Interpretation and Modelling of Images and Videos (PERCEPTION )
Inria Grenoble - Rhône-Alpes
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Laboratoire Jean Kuntzmann (LJK )
Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])
GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing (GIPSA-CRISSP)
Département Parole et Cognition (GIPSA-DPC)
Grenoble Images Parole Signal Automatique (GIPSA-lab )
Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut Polytechnique de Grenoble - Grenoble Institute of Technology-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut Polytechnique de Grenoble - Grenoble Institute of Technology-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Grenoble Images Parole Signal Automatique (GIPSA-lab )
Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut Polytechnique de Grenoble - Grenoble Institute of Technology-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut Polytechnique de Grenoble - Grenoble Institute of Technology-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])
European Project: 609465,EC:FP7:ICT,FP7-ICT-2013-10,EARS(2014)
European Project: 340113,EC:FP7:ERC,ERC-2013-ADG,VHIA(2014)
Source :
ICASSP 2017-Proceedings, ICASSP 2017-IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2017-IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2017, New Orleans, United States. pp.541-545, ⟨10.1109/ICASSP.2017.7952214⟩, ICASSP
Publication Year :
2017
Publisher :
HAL CCSD, 2017.

Abstract

International audience; This paper addresses the problem of under-determined convolutive audio source separation in a semi-oracle configuration where the mixing filters are assumed to be known. We propose a separation procedure based on the convolutive transfer function (CTF), which is a more appropriate model for strongly reverberant signals than the widely-used multi-plicative transfer function approximation. In the short-time Fourier transform domain, source signals are estimated by minimizing the mixture fitting cost using Lasso optimization, with a $l_1$-norm regularization to exploit the spectral sparsity of source signals. Experiments show that the proposed method achieves satisfactory performance on highly reverberant speech mixtures, with a much lower computational cost compared to time-domain dual techniques.

Details

Language :
English
Database :
OpenAIRE
Journal :
ICASSP 2017-Proceedings, ICASSP 2017-IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2017-IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2017, New Orleans, United States. pp.541-545, ⟨10.1109/ICASSP.2017.7952214⟩, ICASSP
Accession number :
edsair.doi.dedup.....8fc636053ec53087c6b1139173f1bd07
Full Text :
https://doi.org/10.1109/ICASSP.2017.7952214⟩