Author: "Cédric Févotte" / Topic: non-negative matrix factorization - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Cédric Févotte"' showing total 39 results

Start Over Author "Cédric Févotte" Topic non-negative matrix factorization

39 results on '"Cédric Févotte"'

1. Bayesian mean-parameterized nonnegative binary matrix factorization

Author: Alberto Lumbreras, Cédric Févotte, Louis Filstroff, Criteo AI Lab, Criteo [Paris], Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Centre National de la Recherche Scientifique (CNRS), and ANR-19-P3IA-0004,ANITI,Artificial and Natural Intelligence Toulouse Institute(2019)
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Networks and Communications, Computer science, Posterior probability, Machine Learning (stat.ML), 02 engineering and technology, [STAT.OT]Statistics [stat]/Other Statistics [stat.ML], Data matrix (multivariate statistics), Machine Learning (cs.LG), Non-negative matrix factorization, symbols.namesake, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Factorization, Statistics - Machine Learning, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Logical matrix, Missing data, Computer Science Applications, Binary data, symbols, 020201 artificial intelligence & image processing, Algorithm, Information Systems, Gibbs sampling
Abstract: International audience; Binary data matrices can represent many types of data such as social networks, votes, or gene expression. In some cases, the analysis of binary matrices can be tackled with nonneg-ative matrix factorization (NMF), where the observed data matrix is approximated by the product of two smaller nonnegative matrices. In this context, probabilistic NMF assumes a generative model where the data is usually Bernoulli-distributed. Often, a link function is used to map the factorization to the [0, 1] range, ensuring a valid Bernoulli mean parameter. However, link functions have the potential disadvantage to lead to uninterpretable models. Mean-parameterized NMF, on the contrary, overcomes this problem. We propose a unified framework for Bayesian mean-parameterized nonnegative binary matrix factorization models (NBMF). We analyze three models which correspond to three possible constraints that respect the mean-parameterization without the need for link functions. Furthermore, we derive a novel collapsed Gibbs sampler and a collapsed variational algorithm to infer the posterior distribution of the factors. Next, we extend the proposed models to a nonpara-metric setting where the number of used latent dimensions is automatically driven by the observed data. We analyze the performance of our NBMF methods in multiple datasets for different tasks such as dictionary learning and prediction of missing data. Experiments show that our methods provide similar or superior results than the state of the art, while automatically detecting the number of relevant components.
Published: 2020

2. A Comparative Study of Gamma Markov Chains for Temporal Non-Negative Factorization

Author: Olivier Gouvert, Cédric Févotte, Olivier Cappé, Louis Filstroff, Aalto University School of Science and Technology [Aalto, Finland], Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Montreal Institute for Learning Algorithms [Montréal] (MILA), Centre de Recherches Mathématiques [Montréal] (CRM), Université de Montréal (UdeM)-Université de Montréal (UdeM), Centre National de la Recherche Scientifique (CNRS), Value from Data (VALDA ), Département d'informatique - ENS Paris (DI-ENS), Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Inria de Paris, Institut National de Recherche en Informatique et en Automatique (Inria), This work has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program under grant agreement N° 681839 (project FACTORY)., ANR-19-P3IA-0004,ANITI,Artificial and Natural Intelligence Toulouse Institute(2019), European Project: CoG-6681839,ERC FACTORY, Université Toulouse 1 Capitole (UT1)-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1)-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Département d'informatique de l'École normale supérieure (DI-ENS), École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Inria de Paris, Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), École normale supérieure - Paris (ENS-PSL), and Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-École normale supérieure - Paris (ENS-PSL)
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer science, Markov process, Inference, Machine Learning (stat.ML), 02 engineering and technology, Poisson distribution, Matrix decomposition, Non-negative matrix factorization, Machine Learning (cs.LG), symbols.namesake, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], Statistics - Machine Learning, Gamma Markov chains, 0202 electrical engineering, electronic engineering, information engineering, Gamma distribution, MAP estimation, Electrical and Electronic Engineering, Time series, Time series data, Markov chain, Probabilistic logic, 020206 networking & telecommunications, Exponential function, Signal Processing, symbols, Algorithm
Abstract: Non-negative matrix factorization (NMF) has become a well-established class of methods for the analysis of non-negative data. In particular, a lot of effort has been devoted to probabilistic NMF, namely estimation or inference tasks in probabilistic models describing the data, based for example on Poisson or exponential likelihoods. When dealing with time series data, several works have proposed to model the evolution of the activation coefficients as a non-negative Markov chain, most of the time in relation with the Gamma distribution, giving rise to so-called temporal NMF models. In this paper, we review four Gamma Markov chains of the NMF literature, and show that they all share the same drawback: the absence of a well-defined stationary distribution. We then introduce a fifth process, an overlooked model of the time series literature named BGAR(1), which overcomes this limitation. These temporal NMF models are then compared in a MAP framework on a prediction task, in the context of the Poisson likelihood., Comment: Code available at https://github.com/lfilstro/TemporalNMF
Published: 2021

3. Positive Semidefinite Matrix Factorization: A Connection with Phase Retrieval and Affine Rank Minimization

Author: Yanbin Lang, Dana Lahat, Vincent Y. F. Tan, Cédric Févotte, Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, National University of Singapore (NUS), and ANR-19-P3IA-0004,ANITI,Artificial and Natural Intelligence Toulouse Institute(2019)
Subjects: Signal Processing (eess.SP), FOS: Computer and information sciences, Optimization, Computer Science - Machine Learning, Optimization problem, Computer science, Diagonal, Machine Learning (stat.ML), 02 engineering and technology, Positive-definite matrix, Tensors, Statistics - Computation, Machine Learning (cs.LG), Non-negative matrix factorization, Symmetric matrices, Matrix (mathematics), Factorization, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], Statistics - Machine Learning, Linear programming, FOS: Electrical engineering, electronic engineering, information engineering, 0202 electrical engineering, electronic engineering, information engineering, Symmetric matrix, Nonnegative matrix, Electrical Engineering and Systems Science - Signal Processing, Electrical and Electronic Engineering, Computation (stat.CO), Prediction algorithms, Approximation algorithm, 020206 networking & telecommunications, Approximation algorithms, Signal Processing, Combinatorial optimization, Affine transformation, Phase retrieval, Signal processing algorithms, Algorithm
Abstract: Positive semidefinite matrix factorization (PSDMF) expresses each entry of a nonnegative matrix as the inner product of two positive semidefinite (psd) matrices. When all these psd matrices are constrained to be diagonal, this model is equivalent to nonnegative matrix factorization. Applications include combinatorial optimization, quantum-based statistical models, and recommender systems, among others. However, despite the increasing interest in PSDMF, only a few PSDMF algorithms were proposed in the literature. In this work, we provide a collection of tools for PSDMF, by showing that PSDMF algorithms can be designed based on phase retrieval (PR) and affine rank minimization (ARM) algorithms. This procedure allows a shortcut in designing new PSDMF algorithms, as it allows to leverage some of the useful numerical properties of existing PR and ARM methods to the PSDMF framework. Motivated by this idea, we introduce a new family of PSDMF algorithms based on iterative hard thresholding (IHT). This family subsumes previously-proposed projected gradient PSDMF methods. We show that there is high variability among PSDMF optimization problems that makes it beneficial to try a number of methods based on different principles to tackle difficult problems. In certain cases, our proposed methods are the only algorithms able to find a solution. In certain other cases, they converge faster. Our results support our claim that the PSDMF framework can inherit desired numerical properties from PR and ARM algorithms, leading to more efficient PSDMF algorithms, and motivate further study of the links between these models., Comment: 18 pages (16 paper + 2 supplementary material), 9 figures, accepted for publication in the IEEE Transactions on Signal Processing. This is a revised version: there is a new additional PSDMF algorithm based on CGIHT, more numerical experiments, and some background material moved to Supplementary Material (pages 17 and 18 in this document). Supplementary Material also contains some extra figures
Published: 2021

4. On the Identifiability of Transform Learning for Non-negative Matrix Factorization

Author: Cédric Févotte, Emmanuel Soubies, Sixin Zhang, Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Centre National de la Recherche Scientifique (CNRS), ANR-19-P3IA-0004,ANITI,Artificial and Natural Intelligence Toulouse Institute(2019), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), European Project: 6681839, Université Toulouse 1 Capitole (UT1)-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), and Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1)-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3)
Subjects: Computer Science::Machine Learning, Linear programming, Computer science, Gaussian, Statistical estimation, 02 engineering and technology, Non-negative matrix factorization, Matrix decomposition, symbols.namesake, Joint-diagonalization, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, 0202 electrical engineering, electronic engineering, information engineering, Source separation, Discrete cosine transform, joint diagonalization, [INFO]Computer Science [cs], NMF, Index Terms-NMF, Electrical and Electronic Engineering, Applied Mathematics, 020206 networking & telecommunications, Nonnegative Matrix Factorization, Quasi-Newton method, identifiability, Computer Science::Numerical Analysis, Orthogonal basis, Nonconvex optimization, Signal Processing, source separation, symbols, Identifiability, transform learning, Likelihood function, Algorithm
Abstract: International audience; Non-negative matrix factorization with transform learning (TL-NMF) aims at estimating a short-time orthogonal transform that projects temporal data into a domain that is more amenable to NMF than off-the-shelf time-frequency transforms. In this work, we study the identifiability of TL-NMF under the Gaussian composite model. We prove that one can uniquely identify row-spaces of the orthogonal transform by optimizing the likelihood function of the model. This result is illustrated on a toy source separation problem which demonstrates the ability of TL-NMF to learn a suitable orthogonal basis.
Published: 2020

5. A Ranking Model Motivated by Nonnegative Matrix Factorization with Applications to Tennis Tournaments

Author: Rui Xia, Louis Filstroff, Vincent Y. F. Tan, Cédric Févotte, Févotte, Cédric, Department of Electrical Engineering and Computer Science (EECS), Massachusetts Institute of Technology (MIT), Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), and Université Fédérale Toulouse Midi-Pyrénées
Subjects: Signal Processing (eess.SP), FOS: Computer and information sciences, Computer Science - Machine Learning, Theoretical computer science, Computer science, Machine Learning (stat.ML), Low-rank approximation, 02 engineering and technology, Latent variable, Machine Learning (cs.LG), Non-negative matrix factorization, Nonnegative matrix factorization, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], Statistics - Machine Learning, FOS: Electrical engineering, electronic engineering, information engineering, 0202 electrical engineering, electronic engineering, information engineering, Majorization-minimization, Electrical Engineering and Systems Science - Signal Processing, BTL ranking model, 020206 networking & telecommunications, Statistical model, Sports analytics, [STAT.ML] Statistics [stat]/Machine Learning [stat.ML], Probability model, Ranking, Key (cryptography), 020201 artificial intelligence & image processing, Majorization minimization
Abstract: We propose a novel ranking model that combines the Bradley-Terry-Luce probability model with a nonnegative matrix factorization framework to model and uncover the presence of latent variables that influence the performance of top tennis players. We derive an efficient, provably convergent, and numerically stable majorization-minimization-based algorithm to maximize the likelihood of datasets under the proposed statistical model. The model is tested on datasets involving the outcomes of matches between 20 top male and female tennis players over 14 major tournaments for men (including the Grand Slams and the ATP Masters 1000) and 16 major tournaments for women over the past 10 years. Our model automatically infers that the surface of the court (e.g., clay or hard court) is a key determinant of the performances of male players, but less so for females. Top players on various surfaces over this longitudinal period are also identified in an objective manner., Comment: 16 pages, 2 figures, 9 tables. Accepted and to be presented at the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD) 2019. Supplementary material, code and datasets can be found in this URL https://github.com/XiaRui1996/btl-nmf
Published: 2020

6. A Quasi-Newton Algorithm on the Orthogonal Manifold for NMF with Transform Learning

Author: Cédric Févotte, Dylan Fagot, Pierre Ablin, Herwig Wendt, Alexandre Gramfort, Modelling brain structure, function and variability based on high-field MRI data (PARIETAL), Inria Saclay - Ile de France, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Service NEUROSPIN (NEUROSPIN), Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay, Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1)-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1)-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées, CoMputational imagINg anD viSion (IRIT-MINDS), Joseph Louis LAGRANGE (LAGRANGE), Université Nice Sophia Antipolis (... - 2019) (UNS), COMUE Université Côte d'Azur (2015 - 2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015 - 2019) (COMUE UCA)-Observatoire de la Côte d'Azur, Université Côte d'Azur (UCA)-COMUE Université Côte d'Azur (2015 - 2019) (COMUE UCA)-Université Côte d'Azur (UCA)-Centre National de la Recherche Scientifique (CNRS), A Supported by the Center for Data Science, funded by the IDEX Paris-Saclay, ANR-11-IDEX-0003-02, and the European Research Council (ERCSLAB-StG-676943).‡Supported by the European Research Council (ERCFACTORY-CoG6681839)., ANR-11-IDEX-0003-02/11-IDEX-0003,IPS,IPS(2011), European Project: 676943,H2020,SLAB(2016), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Centre National de la Recherche Scientifique (CNRS), Institut National de Recherche en Informatique et en Automatique (Inria), ANR-11-IDEX-0003,IPS,Idex Paris-Saclay(2011), European Project: 681839,H2020,ERC-2015-CoG,FACTORY(2016), ANR: 11-IDEX-0003,IPS,Idex Paris-Saclay(2011), Service NEUROSPIN (NEUROSPIN), Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Inria Saclay - Ile de France, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), and Université de Toulouse (UT)
Subjects: FOS: Computer and information sciences, Hessian matrix, Computer Science - Machine Learning, Optimization problem, Computer science, Machine Learning (stat.ML), 02 engineering and technology, [STAT.OT]Statistics [stat]/Other Statistics [stat.ML], computer.software_genre, Machine Learning (cs.LG), Matrix decomposition, Non-negative matrix factorization, symbols.namesake, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, Statistics - Machine Learning, Non-convex optimization, 0202 electrical engineering, electronic engineering, information engineering, Source separation, Trigonometric functions, Orthogonal matrix, Audio signal processing, Manifolds, Transform learning, ComputingMilieux_MISCELLANEOUS, Approximation algorithm, 020206 networking & telecommunications, Fourier transform, ComputingMethodologies_PATTERNRECOGNITION, symbols, 020201 artificial intelligence & image processing, computer, Algorithm, Nonnegative matrix factorization (NMF)
Abstract: International audience; Nonnegative matrix factorization (NMF) is a popular method for audio spectral unmixing. While NMF is traditionally applied to off-the-shelf time-frequency representations based on the short-time Fourier or Cosine transforms, the ability to learn transforms from raw data attracts increasing attention. However, this adds an important computational overhead. When assumed orthogonal (like the Fourier or Cosine transforms), learning the transform yields a non-convex optimization problem on the orthogonal matrix manifold. In this paper, we derive a quasi-Newton method on the manifold using sparse approximations of the Hessian. Experiments on synthetic and real audio data show that the proposed algorithm out-performs state-of-the-art first-order and coordinate-descent methods by orders of magnitude. A Python package for fast TL-NMF is released online at https://github.com/pierreablin/tlnmf.
Published: 2019

7. Majorization-minimization algorithms for convolutive NMF with the beta-divergence

Author: Paris Smaragdis, Dylan Fagot, Herwig Wendt, Cédric Févotte, Centre National de la Recherche Scientifique - CNRS (FRANCE), Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE), Université Toulouse III - Paul Sabatier - UT3 (FRANCE), Université Toulouse - Jean Jaurès - UT2J (FRANCE), Université Toulouse 1 Capitole - UT1 (FRANCE), University of Illinois at Urbana-Champaign - UIUC (USA), Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, CoMputational imagINg anD viSion (IRIT-MINDS), Centre National de la Recherche Scientifique (CNRS), University of Illinois at Urbana-Champaign [Urbana], University of Illinois System, Université Toulouse 1 Capitole (UT1)-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1)-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Institut National Polytechnique de Toulouse - INPT (FRANCE), Adobe Systems Inc. (Adobe Advanced Technology Labs), and Adobe Systems Inc.
Subjects: Heuristic (computer science), BETA (programming language), Computer science, 020206 networking & telecommunications, Majorization-minimization(MM), 02 engineering and technology, Nonnegative matrix factorization(NMF), Divergence, Non-negative matrix factorization, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, 0202 electrical engineering, electronic engineering, information engineering, Spectrogram, Traitement du signal et de l'image, 020201 artificial intelligence & image processing, Divergence (statistics), Algorithm, computer, Majorization minimization, computer.programming_language
Abstract: International audience; Nonnegative matrix factorization (NMF) has become a method of choice for spectrogram decomposition. However, its inability to capture dependencies across columns of the input motivated the introduction of a variant, convolutive NMF. While algorithms for solving the convolutive NMF problem were previously proposed, they rely on the use of a heuristic that does not insure the convergence of the algorithm (in particular in terms of objective function values). The goal of this work is to propose rigorous update rules, based on a majorization-minimization (MM) approach, for convolutive NMF with the β-divergence (a standard family of measures of fit). Specifically , we derive and study two variants of a convolutive NMF algorithm that are guaranteed to decrease the objective function value at each iteration. The complexity of the algorithms is studied, and the performance in terms of execution time and objective function are evaluated and compared in several numerical experiments using real-world audio data. Experiments show that the proposed MM algorithms consistently provide lower values of the objective function than the heuristic, at similar computational cost.
Published: 2019

8. Factor analysis of dynamic PET images: beyond Gaussian noise

Author: Nicolas Dobigeon, Clovis Tauber, Cédric Févotte, Simon Stute, Maria-Joao Ribeiro, Yanna Cruz Cavalcanti, Thomas Oberlin, Ecole Nationale Supérieure d'Electrotechnique, d'Electronique, d'Informatique, d'Hydraulique et de Télécommunications (ENSEEIHT), Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Université de Toulouse (UT), Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Centre National de la Recherche Scientifique (CNRS), Imagerie Moléculaire in Vivo (IMIV - U1023 - ERL9218), Service Hospitalier Frédéric Joliot (SHFJ), Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS), Imagerie et cerveau (iBrain - Inserm U1253 - UNIV Tours ), Université de Tours (UT)-Institut National de la Santé et de la Recherche Médicale (INSERM), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées, Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS), Commissariat à l'Energie Atomique et aux énergies alternatives - CEA (FRANCE), Centre National de la Recherche Scientifique - CNRS (FRANCE), Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE), Institut National de la Santé et de la Recherche Médicale - INSERM (FRANCE), Université Paris-Saclay (FRANCE), Université Toulouse III - Paul Sabatier - UT3 (FRANCE), Université Toulouse - Jean Jaurès - UT2J (FRANCE), Université Toulouse 1 Capitole - UT1 (FRANCE), Université de Tours (FRANCE), Université Paris-Sud 11 (FRANCE), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Centre National de la Recherche Scientifique (CNRS)-Institut National de la Santé et de la Recherche Médicale (INSERM), Université de Tours-Institut National de la Santé et de la Recherche Médicale (INSERM), Institut National Polytechnique de Toulouse - INPT (FRANCE), Févotte, Cédric, Université Fédérale Toulouse Midi-Pyrénées-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse III - Paul Sabatier (UT3), and Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse Capitole (UT Capitole)
Subjects: FOS: Computer and information sciences, Computer science, Computer Vision and Pattern Recognition (cs.CV), Gaussian, Normal Distribution, Computer Science - Computer Vision and Pattern Recognition, 030218 nuclear medicine & medical imaging, Nonnegative matrix factorization, Traitement des images, 0302 clinical medicine, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], Statistics - Machine Learning, Image Processing, Computer-Assisted, Poisson noise, Radiological and Ultrasound Technology, Phantoms, Imaging, Image and Video Processing (eess.IV), Brain, Computer Science Applications, [INFO.INFO-TI] Computer Science [cs]/Image Processing [eess.IV], [INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV], symbols, β-divergence, Factor analysis, Algorithm, Algorithms, FOS: Physical sciences, Machine Learning (stat.ML), Image processing, Dynamic PET, Iterative reconstruction, Non-negative matrix factorization, 03 medical and health sciences, symbols.namesake, FOS: Electrical engineering, electronic engineering, information engineering, Humans, NMF, Electrical and Electronic Engineering, Divergence (statistics), Shot noise, Unmixing, Electrical Engineering and Systems Science - Image and Video Processing, [STAT.ML] Statistics [stat]/Machine Learning [stat.ML], Noise, Gaussian noise, Positron-Emission Tomography, Physics - Data Analysis, Statistics and Probability, Factor Analysis, Statistical, Data Analysis, Statistics and Probability (physics.data-an), Software
Abstract: Factor analysis has proven to be a relevant tool for extracting tissue time-activity curves (TACs) in dynamic PET images, since it allows for an unsupervised analysis of the data. Reliable and interpretable results are possible only if considered with respect to suitable noise statistics. However, the noise in reconstructed dynamic PET images is very difficult to characterize, despite the Poissonian nature of the count-rates. Rather than explicitly modeling the noise distribution, this work proposes to study the relevance of several divergence measures to be used within a factor analysis framework. To this end, the $\beta$-divergence, widely used in other applicative domains, is considered to design the data-fitting term involved in three different factor models. The performances of the resulting algorithms are evaluated for different values of $\beta$, in a range covering Gaussian, Poissonian and Gamma-distributed noises. The results obtained on two different types of synthetic images and one real image show the interest of applying non-standard values of $\beta$ to improve factor analysis., Comment: This manuscript has been accepted for publication in IEEE Trans. Medical Imaging
Published: 2019

9. Unmixing dynamic PET images: Combining spatial heterogeneity and non-gaussian noise

Author: Thomas Oberlin, Yanna Cruz Cavalcant, Clovis Taube, Cédric Févotte, Nicolas Dobigeo, Simon Stute, Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1)-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1)-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées, Imagerie Moléculaire in Vivo (IMIV - U1023 - ERL9218), Service Hospitalier Frédéric Joliot (SHFJ), Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Institut National de la Santé et de la Recherche Médicale (INSERM)-Centre National de la Recherche Scientifique (CNRS), Imagerie et cerveau, Université de Tours-Institut National de la Santé et de la Recherche Médicale (INSERM), Commissariat à l'Energie Atomique et aux énergies alternatives - CEA (FRANCE), Centre National de la Recherche Scientifique - CNRS (FRANCE), Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE), Institut National de la Santé et de la Recherche Médicale - INSERM (FRANCE), Université Paris-Saclay (FRANCE), Université Toulouse III - Paul Sabatier - UT3 (FRANCE), Université Toulouse - Jean Jaurès - UT2J (FRANCE), Université Toulouse 1 Capitole - UT1 (FRANCE), Université de Tours (FRANCE), Université Paris-Sud 11 (FRANCE), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Institut National Polytechnique (Toulouse) (Toulouse INP), Centre National de la Recherche Scientifique (CNRS), Université Paris-Saclay-Direction de Recherche Fondamentale (CEA) (DRF (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA), Imagerie et cerveau (iBrain - Inserm U1253 - UNIV Tours ), Université de Tours (UT)-Institut National de la Santé et de la Recherche Médicale (INSERM), Institut National Polytechnique de Toulouse - INPT (FRANCE), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay, Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), and Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1)
Subjects: Computer science, Gaussian, factor analysis, 02 engineering and technology, Poisson distribution, Blind signal separation, 030218 nuclear medicine & medical imaging, Non-negative matrix factorization, Nonnegative matrix factorization, 03 medical and health sciences, symbols.namesake, Traitement des images, 0302 clinical medicine, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], blind source separation, 0202 electrical engineering, electronic engineering, information engineering, Divergence (statistics), Dynamic PET imaging, Shot noise, Index Terms-Dynamic PET imaging, nonnegative matrix factorization, Noise, Gaussian noise, [INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV], symbols, Blind source separation, β-divergence, 020201 artificial intelligence & image processing, Factor analysis, Algorithm
Abstract: International audience; An important task when processing dynamic PET images is to identify the time-activity curves (TACs) of the pure tissues, along with their corresponding spatial proportions. This step, often referred to as unmixing or factor analysis, is based on a loss function which measures the discrepancy between the observed data and the model. This loss function should be chosen according to the statistical properties of the noise, which is in this case hard to characterize. Indeed, while dynamic PET images results from a decay process that can be statistically described by a Poisson distribution, acquisition and post-filtering reconstruction drastically change the nature of the noise. In the literature dedicated to factor analysis of dynamic PET images, a common and underlying assumption consists in assuming that the dynamic PET images are corrupted by an additive Gaussian or by a Poisson noise. These assumptions lead to the choice of the squared Euclidian distance and the Kullback-Leibler divergence. We propose here to consider the β-divergence, which is able to encompass a wide family of divergence measures corresponding to various noise distributions. This loss function is incorporated into three different factor models and evaluated using four sets of synthetic data.
Published: 2019

10. Estimation with Low-Rank Time-Frequency Synthesis Models

Author: Matthieu Kowalski, Cédric Févotte, Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Centre National de la Recherche Scientifique (CNRS), Laboratoire des signaux et systèmes (L2S), Université Paris-Sud - Paris 11 (UP11)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS), European Project: 681839,H2020,ERC-2015-CoG,FACTORY(2016), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), and Université de Toulouse (UT)
Subjects: Signal Processing (eess.SP), FOS: Computer and information sciences, Computer Science - Machine Learning, Rank (linear algebra), Computer science, 02 engineering and technology, computer.software_genre, Non-negative matrix factorization, Machine Learning (cs.LG), Factorization, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, Audio and Speech Processing (eess.AS), 0202 electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, Electrical and Electronic Engineering, Electrical Engineering and Systems Science - Signal Processing, Audio signal processing, Shrinkage, 020206 networking & telecommunications, Time–frequency analysis, Generative model, Compressed sensing, Signal Processing, Spectrogram, 020201 artificial intelligence & image processing, Algorithm, computer, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: International audience; Many state-of-the art signal decomposition techniques rely on a low-rank factorization of a time-frequency (t-f) transform. In particular, nonnegative matrix factorization (NMF) of the spectrogram has been considered in many audio applications. This is an analysis approach in the sense that the factorization is applied to the squared magnitude of the analysis coefficients returned by the t-f transform. In this paper we instead propose a synthesis approach, where low-rankness is imposed to the synthesis coefficients of the data signal over a given t-f dictionary (such as a Gabor frame). As such we offer a novel modeling paradigm that bridges t-f synthesis modeling and traditional analysis-based NMF approaches. The proposed generative model allows in turn to design more sophisticated multi-layer representations that can efficiently capture diverse forms of structure. Additionally, the generative modeling allows to exploit t-f low-rankness for compressive sensing. We present efficient iterative shrinkage algorithms to perform estimation in the proposed models and illustrate the capabilities of the new modeling paradigm over audio signal processing examples.
Published: 2018

11. Single-channel audio source separation with NMF: divergences, constraints and algorithms

Author: Cédric Févotte, Emmanuel Vincent, Alexey Ozerov, Institut de recherche en informatique de Toulouse ( IRIT ), Institut National Polytechnique [Toulouse] ( INP ) -Université Toulouse 1 Capitole ( UT1 ) -Université Toulouse - Jean Jaurès ( UT2J ) -Université Paul Sabatier - Toulouse 3 ( UPS ) -Centre National de la Recherche Scientifique ( CNRS ), Speech Modeling for Facilitating Oral-Based Communication ( MULTISPEECH ), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique ( Inria ) -Institut National de Recherche en Informatique et en Automatique ( Inria ) -Department of Natural Language Processing & Knowledge Discovery ( LORIA - NLPKD ), Laboratoire Lorrain de Recherche en Informatique et ses Applications ( LORIA ), Institut National de Recherche en Informatique et en Automatique ( Inria ) -Université de Lorraine ( UL ) -Centre National de la Recherche Scientifique ( CNRS ) -Institut National de Recherche en Informatique et en Automatique ( Inria ) -Université de Lorraine ( UL ) -Centre National de la Recherche Scientifique ( CNRS ) -Laboratoire Lorrain de Recherche en Informatique et ses Applications ( LORIA ), Institut National de Recherche en Informatique et en Automatique ( Inria ) -Université de Lorraine ( UL ) -Centre National de la Recherche Scientifique ( CNRS ) -Université de Lorraine ( UL ) -Centre National de la Recherche Scientifique ( CNRS ), Technicolor R & I [Cesson Sévigné], Technicolor, Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Centre National de la Recherche Scientifique (CNRS), Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH), Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), and Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)
Subjects: Computer Science::Machine Learning, [ INFO.INFO-TS ] Computer Science [cs]/Signal and Image Processing, Channel (digital image), Computer science, 020206 networking & telecommunications, 02 engineering and technology, computer.software_genre, Measure (mathematics), Non-negative matrix factorization, Matrix decomposition, Statistics::Machine Learning, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, Factorization, Computer Science::Sound, 0202 electrical engineering, electronic engineering, information engineering, Source separation, 020201 artificial intelligence & image processing, Audio signal processing, Divergence (statistics), computer, Algorithm
Abstract: International audience; Spectral decomposition by nonnegative matrix factorisation (NMF) has become state-of-the-art practice in many audio signal processing tasks, such as source separation, enhancement or transcription. This chapter reviews the fundamentals of NMF-based audio decomposition, in unsupervised and informed settings. We formulate NMF as an optimisation problem and discuss the choice of the measure of fit. We present the standard majorisation-minimisation strategy to address optimisation for NMF with common beta-divergence, a family of measures of fit that takes the quadratic cost, the generalised Kullback-Leibler divergence and the Itakura-Saito divergence as special cases. We discuss the reconstruction of time-domain components from the spectral factorisation and present common variants of NMF-based spectral decomposition: supervised and informed settings, regularised versions, temporal models.
Published: 2018

12. An introduction to multichannel NMF for audio source separation

Author: Cédric Févotte, Emmanuel Vincent, Alexey Ozerov, Technicolor R & I [Cesson Sévigné], Technicolor, Institut de recherche en informatique de Toulouse ( IRIT ), Institut National Polytechnique [Toulouse] ( INP ) -Université Toulouse 1 Capitole ( UT1 ) -Université Toulouse - Jean Jaurès ( UT2J ) -Université Paul Sabatier - Toulouse 3 ( UPS ) -Centre National de la Recherche Scientifique ( CNRS ), Speech Modeling for Facilitating Oral-Based Communication ( MULTISPEECH ), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique ( Inria ) -Institut National de Recherche en Informatique et en Automatique ( Inria ) -Department of Natural Language Processing & Knowledge Discovery ( LORIA - NLPKD ), Laboratoire Lorrain de Recherche en Informatique et ses Applications ( LORIA ), Institut National de Recherche en Informatique et en Automatique ( Inria ) -Université de Lorraine ( UL ) -Centre National de la Recherche Scientifique ( CNRS ) -Institut National de Recherche en Informatique et en Automatique ( Inria ) -Université de Lorraine ( UL ) -Centre National de la Recherche Scientifique ( CNRS ) -Laboratoire Lorrain de Recherche en Informatique et ses Applications ( LORIA ), Institut National de Recherche en Informatique et en Automatique ( Inria ) -Université de Lorraine ( UL ) -Centre National de la Recherche Scientifique ( CNRS ) -Université de Lorraine ( UL ) -Centre National de la Recherche Scientifique ( CNRS ), Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Centre National de la Recherche Scientifique (CNRS), Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH), Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), and Université de Toulouse (UT)
Subjects: [ INFO.INFO-TS ] Computer Science [cs]/Signal and Image Processing, Computer science, Gaussian, Principal (computer security), 020206 networking & telecommunications, 02 engineering and technology, Non-negative matrix factorization, symbols.namesake, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, Computer Science::Sound, 0202 electrical engineering, electronic engineering, information engineering, symbols, Source separation, 020201 artificial intelligence & image processing, Joint (audio engineering), Algorithm
Abstract: International audience; This chapter introduces multichannel nonnegative matrix factorization (NMF) methods for audio source separation. All the methods and some of their extensions are introduced within a more general local Gaussian modeling (LGM) framework. These methods are very attractive since allow combining spatial and spectral cues in a joint and principal way, but also are natural extensions and generalizations of many single-channel NMF-based methods to the multichannel case. The chapter introduces the spectral (NMF-based) and spatial models, as well as the way to combine them within the LGM framework. Model estimation criteria and algorithms are described as well, while going deeper into details of some of them.
Published: 2018

13. Jacobi Algorithm for Nonnegative Matrix Factorization with Transform Learning

Author: Herwig Wendt, Dylan Fagot, Cédric Févotte, Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE), Centre National de la Recherche Scientifique - CNRS (FRANCE), Université Toulouse III - Paul Sabatier - UT3 (FRANCE), Université Toulouse - Jean Jaurès - UT2J (FRANCE), Université Toulouse 1 Capitole - UT1 (FRANCE), Institut National Polytechnique de Toulouse - INPT (FRANCE), CoMputational imagINg anD viSion (IRIT-MINDS), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Centre National de la Recherche Scientifique (CNRS), and Signal et Communications (IRIT-SC)
Subjects: MathematicsofComputing_NUMERICALANALYSIS, Single-channel source separation, 020206 networking & telecommunications, 02 engineering and technology, Matrix decomposition, Non-negative matrix factorization, 03 medical and health sciences, symbols.namesake, Matrix (mathematics), 0302 clinical medicine, Jacobi eigenvalue algorithm, Fourier transform, ComputingMethodologies_PATTERNRECOGNITION, Factorization, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, 0202 electrical engineering, electronic engineering, information engineering, symbols, Discrete cosine transform, Traitement du signal et de l'image, Orthogonal matrix, Algorithm, Transform learning, 030217 neurology & neurosurgery, Nonnegative matrix factorization (NMF), Mathematics
Abstract: Nonnegative matrix factorization (NMF) is the state-of-the-art approach to unsupervised audio source separation. It relies on the factorization of a given short-time frequency transform into a dictionary of spectral patterns and an activation matrix. Recently, we introduced transform learning for NMF (TL-NMF), in which the short-time transform is learnt together with the nonnegative factors. We imposed the transform to be orthogonal likewise the usual Fourier or Cosine transform. TL-NMF yields an original non-convex optimization problem over the manifold of orthogonal matrices, for which we proposed a projected gradient descent algorithm in our previous work. In this contribution we describe a new Jacobi approach in which the orthogonal matrix is represented as a randomly chosen product of elementary Givens matrices. The new approach performs favorably as compared to the gradient approach, in particular in terms of robustness with respect to initialization, as illustrated with synthetic and audio decomposition experiments. Index Terms-Nonnegative matrix factorization (NMF), transform learning, single-channel source separation.
Published: 2018

14. Nonnegative Matrix Factorization with Transform Learning

Author: Cédric Févotte, Herwig Wendt, Dylan Fagot, Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE), Centre National de la Recherche Scientifique - CNRS (FRANCE), Université Toulouse III - Paul Sabatier - UT3 (FRANCE), Université Toulouse - Jean Jaurès - UT2J (FRANCE), Université Toulouse 1 Capitole - UT1 (FRANCE), Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, CoMputational imagINg anD viSion (IRIT-MINDS), Centre National de la Recherche Scientifique (CNRS), and Institut National Polytechnique de Toulouse - INPT (FRANCE)
Subjects: FOS: Computer and information sciences, Optimization problem, Computer science, 02 engineering and technology, computer.software_genre, Matrix decomposition, Non-negative matrix factorization, Machine Learning (cs.LG), [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, Factorization, 0202 electrical engineering, electronic engineering, information engineering, Discrete cosine transform, Source separation, Traitement du signal et de l'image, Audio signal processing, Transform learning, Single-channel source separation, 020206 networking & telecommunications, Stationary point, Computer Science - Learning, Spectrogram, 020201 artificial intelligence & image processing, computer, Algorithm, Nonnegative matrix factorization (NMF)
Abstract: International audience; Traditional NMF-based signal decomposition relies on the factor-ization of spectral data, which is typically computed by means of short-time frequency transform. In this paper we propose to relax the choice of a pre-fixed transform and learn a short-time orthogonal transform together with the factorization. To this end, we formulate a regularized optimization problem reminiscent of conventional NMF, yet with the transform as additional unknown parameters, and design a novel block-descent algorithm enabling to find stationary points of this objective function. The proposed joint transform learning and factorization approach is tested for two audio signal processing ex-periments, illustrating its conceptual and practical benefits.
Published: 2017
Full Text: View/download PDF

15. Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring

Author: Cédric Févotte, Slim Essid, Département Images, Données, Signal (IDS), Télécom ParisTech, Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom [Paris] (IMT)-Télécom Paris-Institut Mines-Télécom [Paris] (IMT)-Télécom Paris, and Institut Mines-Télécom [Paris] (IMT)-Télécom Paris
Subjects: Computer science, Speech recognition, 02 engineering and technology, computer.software_genre, Structuring, Matrix decomposition, Non-negative matrix factorization, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, Histogram, 0202 electrical engineering, electronic engineering, information engineering, Media Technology, Electrical and Electronic Engineering, Hidden Markov model, Audio signal processing, ComputingMilieux_MISCELLANEOUS, business.industry, 020206 networking & telecommunications, Pattern recognition, Computer Science Applications, Speaker diarisation, ComputingMethodologies_PATTERNRECOGNITION, Signal Processing, Unsupervised learning, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer
Abstract: This paper introduces a new paradigm for unsupervised audiovisual document structuring. In this paradigm, a novel Nonnegative Matrix Factorization (NMF) algorithm is applied on histograms of counts (relating to a bag of features representation of the content) to jointly discover latent structuring patterns and their activations in time. Our NMF variant employs the Kullback-Leibler divergence as a cost function and imposes a temporal smoothness constraint to the activations. It is solved by a majorization-minimization technique. The approach proposed is meant to be generic and is particularly well suited to applications where the structuring patterns may overlap in time. As such, it is evaluated on two person-oriented video structuring tasks (one using the visual modality and the second the audio). This is done using a challenging database of political debate videos. Our results outperform reference results obtained by a method using Hidden Markov Models. Further, we show the potential that our general approach has for audio speaker diarization.
Published: 2013

16. Maximum Marginal Likelihood Estimation for Nonnegative Dictionary Learning in the Gamma-Poisson Model

Author: Onur Dikmen and Cédric Févotte
Subjects: ta113, sparse coding, business.industry, model order selection, nonnegative matrix factorization, Pattern recognition, Maximization, Overfitting, Poisson distribution, Automatic relevance determination, variational EM, Marginal likelihood, Non-negative matrix factorization, Matrix decomposition, Monte Carlo EM, symbols.namesake, ComputingMethodologies_PATTERNRECOGNITION, Signal Processing, symbols, Artificial intelligence, Pruning (decision trees), Electrical and Electronic Engineering, business, Divergence (statistics), Mathematics
Abstract: In this paper we describe an alternative to standard nonnegative matrix factorization (NMF) for nonnegative dictionary learning, i.e., the task of learning a dictionary with nonnegative values from nonnegative data, under the assumption of nonnegative expansion coefficients. A popular cost function used for NMF is the Kullback-Leibler divergence, which underlies a Poisson observation model. NMF can thus be considered as maximization of the joint likelihood of the dictionary and the expansion coefficients. This approach lacks optimality because the number of parameters (which include the expansion coefficients) grows with the number of observations. In this paper we describe variational Bayes and Monte-Carlo EM algorithms for optimization of the marginal likelihood, i.e., the likelihood of the dictionary where the expansion coefficients have been integrated out (given a Gamma prior). We compare the output of both maximum joint likelihood estimation (i.e., standard NMF) and maximum marginal likelihood estimation (MMLE) on real and synthetical datasets. In particular we present face reconstruction results on CBCL dataset and text retrieval results over the musiXmatch dataset, a collection of word counts in song lyrics. The MMLE approach is shown to prevent overfitting by automatically pruning out irrelevant dictionary columns, i.e., embedding automatic model order selection.
Published: 2012

17. Algorithms for Nonnegative Matrix Factorization with the β-Divergence

Author: Cédric Févotte, Jérôme Idier, Laboratoire Traitement et Communication de l'Information (LTCI), Télécom ParisTech-Institut Mines-Télécom [Paris] (IMT)-Centre National de la Recherche Scientifique (CNRS), Institut de Recherche en Communications et en Cybernétique de Nantes (IRCCyN), Mines Nantes (Mines Nantes)-École Centrale de Nantes (ECN)-Ecole Polytechnique de l'Université de Nantes (EPUN), Université de Nantes (UN)-Université de Nantes (UN)-PRES Université Nantes Angers Le Mans (UNAM)-Centre National de la Recherche Scientifique (CNRS), and ANR-09-JCJC-0073,TANGERINE(2009)
Subjects: multiplicative algorithms, Cognitive Neuroscience, majorization-equalization (ME), Multiplicative function, Parameterized complexity, 020206 networking & telecommunications, Monotonic function, 02 engineering and technology, Auxiliary function, majorization-minimization (MM), Non-negative matrix factorization, Euclidean distance, Computer Science - Learning, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, Arts and Humanities (miscellaneous), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Penalty method, Divergence (statistics), [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing, Algorithm, Nonnegative matrix factorization (NMF), beta-divergence, Mathematics
Abstract: This paper describes algorithms for nonnegative matrix factorization (NMF) with the beta-divergence (beta-NMF). The beta-divergence is a family of cost functions parametrized by a single shape parameter beta that takes the Euclidean distance, the Kullback-Leibler divergence and the Itakura-Saito divergence as special cases (beta = 2,1,0, respectively). The proposed algorithms are based on a surrogate auxiliary function (a local majorization of the criterion function). We first describe a majorization-minimization (MM) algorithm that leads to multiplicative updates, which differ from standard heuristic multiplicative updates by a beta-dependent power exponent. The monotonicity of the heuristic algorithm can however be proven for beta in (0,1) using the proposed auxiliary function. Then we introduce the concept of majorization-equalization (ME) algorithm which produces updates that move along constant level sets of the auxiliary function and lead to larger steps than MM. Simulations on synthetic and real data illustrate the faster convergence of the ME approach. The paper also describes how the proposed algorithms can be adapted to two common variants of NMF : penalized NMF (i.e., when a penalty function of the factors is added to the criterion function) and convex-NMF (when the dictionary is assumed to belong to a known subspace)., Comment: \`a para\^itre dans Neural Computation
Published: 2011

18. Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation

Author: Cédric Févotte and Alexey Ozerov
Subjects: Acoustics and Ultrasonics, Underdetermined system, business.industry, Pattern recognition, computer.software_genre, Blind signal separation, Matrix decomposition, Non-negative matrix factorization, Computer Science::Sound, Expectation–maximization algorithm, Source separation, Spectrogram, Artificial intelligence, Electrical and Electronic Engineering, business, Audio signal processing, computer, Mathematics
Abstract: We consider inference in a general data-driven object-based model of multichannel audio data, assumed generated as a possibly underdetermined convolutive mixture of source signals. We work in the short-time Fourier transform (STFT) domain, where convolution is routinely approximated as linear instantaneous mixing in each frequency band. Each source STFT is given a model inspired from nonnegative matrix factorization (NMF) with the Itakura-Saito divergence, which underlies a statistical model of superimposed Gaussian components. We address estimation of the mixing and source parameters using two methods. The first one consists of maximizing the exact joint likelihood of the multichannel data using an expectation-maximization (EM) algorithm. The second method consists of maximizing the sum of individual likelihoods of all channels using a multiplicative update algorithm inspired from NMF methodology. Our decomposition algorithms are applied to stereo audio source separation in various settings, covering blind and supervised separation, music and speech sources, synthetic instantaneous and convolutive mixtures, as well as professionally produced music recordings. Our EM method produces competitive results with respect to state-of-the-art as illustrated on two tasks from the international Signal Separation Evaluation Campaign (SiSEC 2008).
Published: 2010

19. Soft Nonnegative Matrix Co-Factorization

Author: Olivier Cappé, Nicolas Seichepine, Cédric Févotte, Slim Essid, Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom [Paris] (IMT)-Télécom Paris-Institut Mines-Télécom [Paris] (IMT)-Télécom Paris, Département Traitement du Signal et des Images (TSI), Centre National de la Recherche Scientifique (CNRS)-Télécom ParisTech, Joseph Louis LAGRANGE (LAGRANGE), Université Côte d'Azur (UCA)-Université Nice Sophia Antipolis (... - 2019) (UNS), COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Observatoire de la Côte d'Azur, and Université Côte d'Azur (UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Institut national des sciences de l'Univers (INSU - CNRS)-Centre National de la Recherche Scientifique (CNRS)
Subjects: Optimization problem, business.industry, Pattern recognition, Non-negative matrix factorization, Euclidean distance, Speaker diarisation, Factorization, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], Signal Processing, Source separation, Nonnegative matrix, Artificial intelligence, Electrical and Electronic Engineering, Divergence (statistics), business, Algorithm, ComputingMilieux_MISCELLANEOUS, Mathematics
Abstract: This work introduces a new framework for nonnegative matrix factorization (NMF) in multisensor or multimodal data configurations, where taking into account the mutual dependence that exists between the related parallel streams of data is expected to improve performance. In contrast with previous works that focused on co-factorization methods -where some factors are shared by the different modalities-we propose a soft co-factorization scheme which accounts for possible local discrepancies across modalities or channels. This objective is formalized as an optimization problem where concurrent factorizations are jointly performed while being tied by a coupling term that penalizes differences between the related factor matrices associated with different modalities. We provide majorization-minimization (MM) algorithms for three common measures of fit-the squared Euclidean norm, the Kullback-Leibler divergence and the Itakura-Saito divergence-and two possible coupling variants, using either the l1 or the squared Euclidean norm of differences. The approach is shown to achieve promising performance in two audio-related tasks: multimodal speaker diarization using audiovisual data and audio source separation using stereo data.
Published: 2014

20. Alternating direction method of multipliers for non-negative matrix factorization with the beta-divergence

Author: Cédric Févotte and Dennis L. Sun
Subjects: Mathematical optimization, Class (set theory), Simple (abstract algebra), Convergence (routing), Nonnegative matrix, Function (mathematics), Divergence (statistics), Algorithm, Mathematics, Non-negative matrix factorization, Matrix decomposition
Abstract: Non-negative matrix factorization (NMF) is a popular method for learning interpretable features from non-negative data, such as counts or magnitudes. Different cost functions are used with NMF in different applications. We develop an algorithm, based on the alternating direction method of multipliers, that tackles NMF problems whose cost function is a beta-divergence, a broad class of divergence functions. We derive simple, closed-form updates for the most commonly used beta-divergences. We demonstrate experimentally that this algorithm has faster convergence and yields superior results to state-of-the-art algorithms for this problem.
Published: 2014

21. Piecewise constant nonnegative matrix factorization

Author: Nicolas Seichepine, Olivier Cappé, Cédric Févotte, Slim Essid, Département Images, Données, Signal (IDS), Télécom ParisTech, Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom [Paris] (IMT)-Télécom Paris-Institut Mines-Télécom [Paris] (IMT)-Télécom Paris, and Institut Mines-Télécom [Paris] (IMT)-Télécom Paris
Subjects: Optimization problem, business.industry, 020206 networking & telecommunications, Pattern recognition, 02 engineering and technology, Synthetic data, Matrix decomposition, Non-negative matrix factorization, Matrix (mathematics), [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, 0202 electrical engineering, electronic engineering, information engineering, Piecewise, 020201 artificial intelligence & image processing, Segmentation, Artificial intelligence, business, Cluster analysis, Algorithm, ComputingMilieux_MISCELLANEOUS, Mathematics
Abstract: In this paper we propose a non-negative matrix factorization (NMF) model with piecewise-constant activation coefficients. This structure is enforced using a total variation penalty on the rows of the activation matrix. The resulting optimization problem is solved with a majorization-minimization procedure. The proposed algorithm is well suited to analyze data explained by underlying piecewise-constant sequences of states. Its properties are first illustrated using synthetic data. We then use it to solve a video structuring problem that involves both segmentation and clustering tasks. An improvement over a state-of-the-art temporally smoothed NMF algorithm of both clustering and segmentation quality measures is observed.
Published: 2014

22. Soft nonnegative matrix co-factorizationwith application to multimodal speaker diarization

Author: Slim Essid, Cédric Févotte, Nicolas Seichepine, Olivier Cappé, Département Images, Données, Signal (IDS), Télécom ParisTech, Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom [Paris] (IMT)-Télécom Paris-Institut Mines-Télécom [Paris] (IMT)-Télécom Paris, and Institut Mines-Télécom [Paris] (IMT)-Télécom Paris
Subjects: Modalities, business.industry, Speech recognition, Feature vector, 020206 networking & telecommunications, Pattern recognition, 02 engineering and technology, Speaker recognition, Matrix decomposition, Non-negative matrix factorization, Speaker diarisation, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, Nonnegative matrix, business, ComputingMilieux_MISCELLANEOUS, Mathematics, Curse of dimensionality
Abstract: This paper presents a new method for bimodal nonnegative matrix factorization (NMF). This method is well-suited to situations where two streams of data are concurrently analyzed and are expected to be related by loosely common factors. It allows for a soft co-factorization, which takes into account the relationship that exists between the modalities being processed, but returns different factors for distinct modalities. There is no need that the data related with each modality live in the same feature space; there is also no need that they have the same dimensionality. The co-factorization is obtained via a majorization-minimization (MM) algorithm. The behavior of the method is illustrated on both synthetic and real-world data. In particular, we show that exploiting the correlation between audio and video modalities in edited talk-show videos improve speaker diarization results.
Published: 2013

23. Automatic relevance determination in nonnegative matrix factorization with the β-divergence

Author: Cédric Févotte and Vincent Y. F. Tan
Subjects: Kullback–Leibler divergence, Databases, Factual, Economics, Overfitting, Bayesian inference, Matrix decomposition, Non-negative matrix factorization, Pattern Recognition, Automated, Artificial Intelligence, Maximum a posteriori estimation, Humans, Computer Simulation, Swimming, Mathematics, Models, Statistical, business.industry, Applied Mathematics, Pattern recognition, Bayes Theorem, Models, Theoretical, Computational Theory and Mathematics, Principal component analysis, Computer Vision and Pattern Recognition, Artificial intelligence, business, Scale parameter, Software, Algorithms
Abstract: This paper addresses the estimation of the latent dimensionality in nonnegative matrix factorization (NMF) with the β-divergence. The β-divergence is a family of cost functions that includes the squared euclidean distance, Kullback-Leibler (KL) and Itakura-Saito (IS) divergences as special cases. Learning the model order is important as it is necessary to strike the right balance between data fidelity and overfitting. We propose a Bayesian model based on automatic relevance determination (ARD) in which the columns of the dictionary matrix and the rows of the activation matrix are tied together through a common scale parameter in their prior. A family of majorization-minimization (MM) algorithms is proposed for maximum a posteriori (MAP) estimation. A subset of scale parameters is driven to a small lower bound in the course of inference, with the effect of pruning the corresponding spurious components. We demonstrate the efficacy and robustness of our algorithms by performing extensive experiments on synthetic data, the swimmer dataset, a music decomposition example, and a stock price prediction task.
Published: 2013

24. Non-negative dynamical system with application to speech and audio

Author: Cédric Févotte, John R. Hershey, and Jonathan Le Roux
Subjects: Signal processing, Dynamical systems theory, business.industry, Computer science, Noise (signal processing), Speech recognition, Speech coding, Markov process, Pattern recognition, Dynamical system, computer.software_genre, Linear dynamical system, Matrix decomposition, Non-negative matrix factorization, Speech enhancement, symbols.namesake, Computer Science::Sound, symbols, Artificial intelligence, Audio signal processing, business, computer
Abstract: Non-negative data arise in a variety of important signal processing domains, such as power spectra of signals, pixels in images, and count data. This paper introduces a novel non-negative dynamical system (NDS) for sequences of such data, and describes its application to modeling speech and audio power spectra. The NDS model can be interpreted both as an adaptation of linear dynamical systems (LDS) to non-negative data, and as an extension of non-negative matrix factorization (NMF) to support Markovian dynamics. Learning and inference algorithms were derived and experiments on speech enhancement were conducted by training sparse non-negative dynamical systems on speech data and adapting a noise model to the unknown noise condition. Results show that the model can capture the dynamics of speech in a useful way.
Published: 2013

25. Robust nonnegative matrix factorization for nonlinear unmixing of hyperspectral images

Author: Nicolas Dobigeon, Cédric Févotte, Centre National de la Recherche Scientifique - CNRS (FRANCE), Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE), Université Nice Sophia Antipolis (FRANCE), Observatoire de la Côte d'Azur (FRANCE), Université Toulouse III - Paul Sabatier - UT3 (FRANCE), Université Toulouse - Jean Jaurès - UT2J (FRANCE), Université Toulouse 1 Capitole - UT1 (FRANCE), Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Institut National Polytechnique (Toulouse) (Toulouse INP), Joseph Louis LAGRANGE (LAGRANGE), Université Côte d'Azur (UCA)-Université Nice Sophia Antipolis (... - 2019) (UNS), COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Observatoire de la Côte d'Azur, Université Côte d'Azur (UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Institut national des sciences de l'Univers (INSU - CNRS)-Centre National de la Recherche Scientifique (CNRS), and Institut National Polytechnique de Toulouse - INPT (FRANCE)
Subjects: 0211 other engineering and technologies, 02 engineering and technology, Non-negative matrix factorization, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], Traitement des images, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, 0202 electrical engineering, electronic engineering, information engineering, Robust nonnegative matrix factorization, Robustnonnegative matrix factorization, Traitement du signal et de l'image, Nonlinear unmixing, Computer vision, Synthèse d'image et réalité virtuelle, 021101 geological & geomatics engineering, Mathematics, Spectral signature, business.industry, Multiplicative function, Group-sparsity, Linear model, Hyperspectral imaging, [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], 020206 networking & telecommunications, Vision par ordinateur et reconnaissance de formes, Intelligence artificielle, [INFO.INFO-GR]Computer Science [cs]/Graphics [cs.GR], Nonlinear system, [INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV], Hyperspectral imagery, Artificial intelligence, business, Algorithm
Abstract: International audience; This paper introduces a robust linear model to describe hyperspectral data arising from the mixture of several pure spectral signatures. This new model not only generalizes the commonly used linear mixing model but also allows for possible nonlinear effects to be handled, relying on mild assumptions regarding these nonlinearities. Based on this model, a nonlinear unmixing procedure is proposed. The standard nonnegativity and sum-to-one constraints inherent to spectral unmixing are coupled with a group-sparse constraint imposed on the nonlinearity component. The resulting objective function is minimized using a multiplicative algorithm. Simulation results obtained on synthetic and real data show that the proposed strategy competes with state-of-the-art linear and nonlinear unmixing methods.
Published: 2013

26. Decomposing the video editing structure of a talk-show using nonnegative matrix factorization

Author: Slim Essid, Cédric Févotte, Département Images, Données, Signal (IDS), Télécom ParisTech, Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom [Paris] (IMT)-Télécom Paris-Institut Mines-Télécom [Paris] (IMT)-Télécom Paris, and Institut Mines-Télécom [Paris] (IMT)-Télécom Paris
Subjects: business.industry, Pattern recognition, 02 engineering and technology, Structuring, Matrix decomposition, Non-negative matrix factorization, ComputingMethodologies_PATTERNRECOGNITION, Video editing, Image representation, [INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV], 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, Hidden Markov model, Bag of features, ComputingMilieux_MISCELLANEOUS, Mathematics, Count data
Abstract: We introduce a novel video structuring scheme that exploits nonnegative matrix factorization (NMF) on count data (in a bag of features representation of the visual stream) to jointly discover latent structuring patterns and their activations in time. Our NMF variant employs the Kullback-Leibler divergence as a cost function and imposes a temporal smoothness constraint to the activations. It is solved by a majorization-minimization technique. Our method is shown to be successful for decomposing the high-level editing structure of talk-shows. It is evaluated using a challenging database of TV political-debate programs, and found to clearly outperform a reference HMM method.
Published: 2012

27. Optimal cost function and magnitude power for NMF-based speech separation and music interpolation

Author: Cédric Févotte, Paris Smaragdis, and Brian King
Subjects: Mathematical optimization, Source separation, Bilinear interpolation, Function (mathematics), Speech processing, Matrix decomposition, Mathematics, Non-negative matrix factorization, Power (physics), Interpolation
Abstract: There has been a significant amount of research in new algorithms and applications for nonnegative matrix factorization, but relatively little has been published on practical considerations for real-world applications, such as choosing optimal parameters for a particular application. In this paper, we will look at two applications, single-channel source separation of speech and interpolating missing music data. We will present the optimal parameters found for the experiments as well as discuss how parameters affect performance.
Published: 2012

28. Majorization-minimization algorithm for smooth Itakura-Saito nonnegative matrix factorization

Author: Cédric Févotte
Subjects: Audio signal, Markov process, computer.software_genre, Non-negative matrix factorization, Matrix decomposition, symbols.namesake, Computer Science::Sound, symbols, Source separation, Spectrogram, Minification, Audio signal processing, Algorithm, computer, Mathematics
Abstract: Nonnegative matrix factorization (NMF) with the Itakura-Saito divergence has proven efficient for audio source separation and music transcription, where the signal power spectrogram is factored into a “dictionary” matrix times an “activation” matrix. Given the nature of audio signals it is expected that the activation coefficients exhibit smoothness along time frames. This may be enforced by penalizing the NMF objective function with an extra term reflecting smoothness of the activation coefficients. We propose a novel regularization term that solves some deficiencies of our previous work and leads to an efficient implementation using a majorization-minimization procedure.
Published: 2011

29. Itakura-Saito nonnegative matrix factorization with group sparsity

Author: Francis Bach, Augustin Lefèvre, Cédric Févotte, Laboratoire d'informatique de l'école normale supérieure (LIENS), Département d'informatique - ENS Paris (DI-ENS), École normale supérieure - Paris (ENS-PSL), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-École normale supérieure - Paris (ENS-PSL), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS), Laboratoire Traitement et Communication de l'Information (LTCI), Télécom ParisTech-Institut Mines-Télécom [Paris] (IMT)-Centre National de la Recherche Scientifique (CNRS), Statistical Machine Learning and Parsimony (SIERRA), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Inria Paris-Rocquencourt, Institut National de Recherche en Informatique et en Automatique (Inria), European Project: 239993,EC:FP7:ERC,ERC-2009-StG,SIERRA(2009), Bach, Francis, Sparse Structured Methods for Machine Learning - SIERRA - - EC:FP7:ERC2009-12-01 - 2014-11-30 - 239993 - VALID, École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Centre National de la Recherche Scientifique (CNRS), Département d'informatique de l'École normale supérieure (DI-ENS), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-École normale supérieure - Paris (ENS Paris), Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL), and Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Inria Paris-Rocquencourt
Subjects: [INFO.INFO-TS] Computer Science [cs]/Signal and Image Processing, 02 engineering and technology, computer.software_genre, Blind signal separation, Synthetic data, Non-negative matrix factorization, Matrix decomposition, Statistics::Machine Learning, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, 0202 electrical engineering, electronic engineering, information engineering, Source separation, Test statistic, Audio signal processing, [SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processing, Mathematics, business.industry, 020206 networking & telecommunications, Pattern recognition, [INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG], ComputingMethodologies_PATTERNRECOGNITION, Computer Science::Sound, Unsupervised learning, 020201 artificial intelligence & image processing, Artificial intelligence, business, [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing, computer
Abstract: International audience; We propose an unsupervised inference procedure for audio source separation. Components in nonnegative matrix factorization (NMF) are grouped automatically in audio sources via a penalized maximum likelihood approach. The penalty term we introduce favors sparsity at the group level, and is motivated by the assumption that the local amplitude of the sources are independent. Our algorithm extends multiplicative updates for NMF; moreover we propose a test statistic to tune hyperparameters in our model, and illustrate its adequacy on synthetic data. Results on real audio tracks show that our sparsity prior allows to identify audio sources without knowledge on their spectral properties.
Published: 2011

30. Online algorithms for Nonnegative Matrix Factorization with the Itakura-Saito divergence

Author: Francis Bach, Cédric Févotte, Augustin Lefèvre, Statistical Machine Learning and Parsimony (SIERRA), Département d'informatique - ENS Paris (DI-ENS), École normale supérieure - Paris (ENS-PSL), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-École normale supérieure - Paris (ENS-PSL), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Inria Paris-Rocquencourt, Institut National de Recherche en Informatique et en Automatique (Inria), Laboratoire d'informatique de l'école normale supérieure (LIENS), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS), Laboratoire Traitement et Communication de l'Information (LTCI), Télécom ParisTech-Institut Mines-Télécom [Paris] (IMT)-Centre National de la Recherche Scientifique (CNRS), ANR-09-JCJC-0073,TANGERINE(2009), European Project: 239993,EC:FP7:ERC,ERC-2009-StG,SIERRA(2009), Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Inria Paris-Rocquencourt, Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL), Département d'informatique de l'École normale supérieure (DI-ENS), École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-École normale supérieure - Paris (ENS Paris), and Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Centre National de la Recherche Scientifique (CNRS)
Subjects: FOS: Computer and information sciences, online algorithms, Speech recognition, Machine Learning (stat.ML), 02 engineering and technology, computer.software_genre, matrix factorization, Blind signal separation, Matrix decomposition, Non-negative matrix factorization, Dimension (vector space), [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, Statistics - Machine Learning, 0202 electrical engineering, electronic engineering, information engineering, Source separation, Online algorithm, Audio signal processing, Mathematics, Audio signal, audio source separation, 020206 networking & telecommunications, Bregman divergences, machine learning, 020201 artificial intelligence & image processing, computer, [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
Abstract: Nonnegative matrix factorization (NMF) is now a common tool for audio source separation. When learning NMF on large audio databases, one major drawback is that the complexity in time is O(FKN) when updating the dictionary (where (F;N) is the dimension of the input power spectrograms, and K the number of basis spectra), thus forbidding its application on signals longer than an hour. We provide an online algorithm with a complexity of O(FK) in time and memory for updates in the dictionary. We show on audio simulations that the online approach is faster for short audio signals and allows to analyze audio signals of several hours.
Published: 2011
Full Text: View/download PDF

31. MAXIMUM MARGINAL LIKELIHOOD ESTIMATION FOR NONNEGATIVE DICTIONARY LEARNING

Author: Onur Dikmen, Cédric Févotte, Laboratoire Traitement et Communication de l'Information (LTCI), Télécom ParisTech-Institut Mines-Télécom [Paris] (IMT)-Centre National de la Recherche Scientifique (CNRS), and Févotte, Cédric
Subjects: Restricted maximum likelihood, [INFO.INFO-TS] Computer Science [cs]/Signal and Image Processing, sparse coding, model order selection, 02 engineering and technology, Poisson distribution, Conjugate prior, variational EM, Non-negative matrix factorization, 03 medical and health sciences, symbols.namesake, 0302 clinical medicine, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, automatic relevance determination, Expectation–maximization algorithm, 0202 electrical engineering, electronic engineering, information engineering, Divergence (statistics), Mathematics, [SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processing, Nonnegative matrix factorisation, business.industry, Pattern recognition, Maximum likelihood sequence estimation, Marginal likelihood, Statistics::Computation, symbols, 020201 artificial intelligence & image processing, Artificial intelligence, business, Algorithm, [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing, 030217 neurology & neurosurgery
Abstract: Submitted to ICASSP'2011; We describe an alternative to standard nonnegative matrix factorisation (NMF) for nonnegative dictionary learning. NMF with the Kullback-Leibler divergence can be seen as maximisation of the joint likelihood of the dictionary and the expansion coefficients under Poisson observation noise. This approach lacks optimality because the number of parameters (which include the expansion coefficients) grows with the number of observations. As such, we describe a variational EM algorithm for optimisation of the marginal likelihood, i.e., the likelihood of the dictionary where the expansion coefficients have been integrated out given a Gamma conjugate prior. We compare the output of both maximum joint likelihood estimation (i.e., standard NMF) and maximum marginal likelihood estimation (MMLE) on real and synthetical data. The MMLE approach is shown to embed automatic model order selection, similar to automatic relevance determination.
Published: 2010

32. Multichannel nonnegative matrix factorization in convolutive mixtures. With application to blind audio source separation

Author: Cédric Févotte and Alexey Ozerov
Subjects: business.industry, Statistical model, Pattern recognition, Blind signal separation, Matrix decomposition, Non-negative matrix factorization, symbols.namesake, Expectation–maximization algorithm, symbols, Source separation, Artificial intelligence, business, Divergence (statistics), Gaussian process, Mathematics
Abstract: We consider inference in a general data-driven object-based model of multichannel audio data, assumed generated as a possibly under-determined convolutive mixture of source signals. Each source is given a model inspired from nonnegative matrix factorization (NMF) with the Itakura-Saito divergence, which underlies a statistical model of superimposed Gaussian components. We address estimation of the mixing and source parameters using two methods. The first one consists of maximizing the exact joint likelihood of the multichannel data using an expectation-maximization algorithm. The second method consists of maximizing the sum of individual likelihoods of all channels using a multiplicative update algorithm inspired from NMF methodology. Our decomposition algorithms were applied to stereo music and assessed in terms of blind source separation performance.
Published: 2009

33. A tempering approach for Itakura-Saito non-negative matrix factorization. With application to music transcription

Author: Roland Badeau, Nancy Bertin, Cédric Févotte, Laboratoire Traitement et Communication de l'Information (LTCI), Télécom ParisTech-Institut Mines-Télécom [Paris] (IMT)-Centre National de la Recherche Scientifique (CNRS), and Badeau, Roland
Subjects: [INFO.INFO-TS] Computer Science [cs]/Signal and Image Processing, business.industry, 020206 networking & telecommunications, Pattern recognition, Context (language use), 02 engineering and technology, Function (mathematics), Matrix decomposition, Non-negative matrix factorization, Euclidean distance, Maxima and minima, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, Computer Science::Sound, Convergence (routing), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, Divergence (statistics), Algorithm, [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing, [SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processing, Mathematics
Abstract: International audience; In this paper we are interested in non-negative matrix factorization (NMF) with the Itakura-Saito (IS) divergence. Previous work has demonstrated the relevance of this cost function for the decomposition of audio power spectrograms. This is in particular due to its scale invariance, which makes it more robust to the wide dynamics of audio, a property which is not shared by other popular costs such as the Euclidean distance or the generalized Kulback-Leibler (KL) divergence. However, while the latter two cost functions are convex, the IS divergence is not, which makes it more prone to convergence to irrelevant local minima, as observed empirically. Thus, the aim of this paper is to propose a tempering scheme that favors convergence of IS-NMF to global minima. Our algorithm is based on NMF with the beta-divergence, where the shape parameter beta acts as a temperature parameter. Results on both synthetical and music data (in a transcription context) show the relevance of our approach.
Published: 2009

34. Nonnegative matrix factorization with the Itakura-Saito divergence: with application to music analysis

Author: Nancy Bertin, Jean-Louis Durrieu, and Cédric Févotte
Subjects: Mathematical optimization, Models, Statistical, Sound Spectrography, Time Factors, Markov chain, Cognitive Neuroscience, Information Storage and Retrieval, Statistical model, Function (mathematics), Markov Chains, Non-negative matrix factorization, Pattern Recognition, Automated, Euclidean distance, Arts and Humanities (miscellaneous), Acoustic Stimulation, Prior probability, Spectrogram, Humans, Divergence (statistics), Pitch Perception, Algorithm, Algorithms, Music, Mathematics
Abstract: This letter presents theoretical, algorithmic, and experimental results about nonnegative matrix factorization (NMF) with the Itakura-Saito (IS) divergence. We describe how IS-NMF is underlaid by a well-defined statistical model of superimposed gaussian components and is equivalent to maximum likelihood estimation of variance parameters. This setting can accommodate regularization constraints on the factors through Bayesian priors. In particular, inverse-gamma and gamma Markov chain priors are considered in this work. Estimation can be carried out using a space-alternating generalized expectation-maximization (SAGE) algorithm; this leads to a novel type of NMF algorithm, whose convergence to a stationary point of the IS cost function is guaranteed. We also discuss the links between the IS divergence and other cost functions used in NMF, in particular, the Euclidean distance and the generalized Kullback-Leibler (KL) divergence. As such, we describe how IS-NMF can also be performed using a gradient multiplicative algorithm (a standard algorithm structure in NMF) whose convergence is observed in practice, though not proven. Finally, we report a furnished experimental comparative study of Euclidean-NMF, KL-NMF, and IS-NMF algorithms applied to the power spectrogram of a short piano sequence recorded in real conditions, with various initializations and model orders. Then we show how IS-NMF can successfully be employed for denoising and upmix (mono to stereo conversion) of an original piece of early jazz music. These experiments indicate that IS-NMF correctly captures the semantics of audio and is better suited to the representation of music signals than NMF with the usual Euclidean and KL costs.
Published: 2008

35. Negative Binomial Matrix Factorization

Author: Thomas Oberlin, Olivier Gouvert, Cédric Févotte, Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Centre National de la Recherche Scientifique (CNRS), ANR-19-P3IA-0004,ANITI,Artificial and Natural Intelligence Toulouse Institute(2019), Centre National de la Recherche Scientifique - CNRS (FRANCE), Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE), Institut Supérieur de l'Aéronautique et de l'Espace - ISAE-SUPAERO (FRANCE), Université Toulouse III - Paul Sabatier - UT3 (FRANCE), Université Toulouse - Jean Jaurès - UT2J (FRANCE), and Université Toulouse 1 Capitole - UT1 (FRANCE)
Subjects: Poisson factorization, Applied Mathematics, Multiplicative function, Collaborative filtering, Negative binomial distribution, Informatique et langage, 020206 networking & telecommunications, Non-negative matrix factorization, 02 engineering and technology, Over-dispersion, Matrix decomposition, Matrix (mathematics), Overdispersion, Factorization, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], Signal Processing, 0202 electrical engineering, electronic engineering, information engineering, Applied mathematics, Majorization-minimization, Electrical and Electronic Engineering, Mathematics, Count data
Abstract: International audience; We introduce negative binomial matrix factoriza-tion (NBMF), a matrix factorization technique specially designed for analyzing over-dispersed count data. It can be viewed as an extension of Poisson factorization (PF) perturbed by a multiplicative term which models exposure. This term brings a degree of freedom for controlling the dispersion, making NBMF more robust to outliers. We describe a majorization-minimization (MM) algorithm for a maximum likelihood estimation of the parameters. We provide results on a recommendation task and demonstrate the ability of NBMF to efficiently exploit raw data.
Full Text: View/download PDF

36. Adversarially-Trained Nonnegative Matrix Factorization

Author: Ting Cai, Vincent Y. F. Tan, Cédric Févotte, National University of Singapore (NUS), Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Singapore NRF Fellowship (R-263-000-D02-281), ANR-19-P3IA-0004,ANITI,Artificial and Natural Intelligence Toulouse Institute(2019), European Project: CoG-6681839,ERC FACTORY, Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), and Université de Toulouse (UT)
Subjects: Signal Processing (eess.SP), FOS: Computer and information sciences, Computer Science - Machine Learning, Computer science, Generalization, MathematicsofComputing_NUMERICALANALYSIS, 02 engineering and technology, Non-negative Matrix Factorization, Data matrix (multivariate statistics), Matrix decomposition, Non-negative matrix factorization, Machine Learning (cs.LG), Matrix (mathematics), [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], 0202 electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, Adversarial Training, Matrix Completion, Electrical and Electronic Engineering, Electrical Engineering and Systems Science - Signal Processing, Matrix completion, Applied Mathematics, Dimensionality reduction, 020206 networking & telecommunications, Norm (mathematics), Signal Processing, Algorithm
Abstract: We consider an adversarially-trained version of the nonnegative matrix factorization, a popular latent dimensionality reduction technique. In our formulation, an attacker adds an arbitrary matrix of bounded norm to the given data matrix. We design efficient algorithms inspired by adversarial training to optimize for dictionary and coefficient matrices with enhanced generalization abilities. Extensive simulations on synthetic and benchmark datasets demonstrate the superior predictive performance on matrix completion tasks of our proposed method compared to state-of-the-art competitors, including other variants of adversarial nonnegative matrix factorization., Accepted to the IEEE Signal Processing Letters; 5 pages, 4 figures
Full Text: View/download PDF

37. Convex nonnegative matrix factorization with missing data

Author: Cédric Févotte, Valentin Emiya, Ronan Hamon, éQuipe AppRentissage et MultimediA [Marseille] (QARMA), Laboratoire d'informatique Fondamentale de Marseille (LIF), Aix Marseille Université (AMU)-École Centrale de Marseille (ECM)-Centre National de la Recherche Scientifique (CNRS)-Aix Marseille Université (AMU)-École Centrale de Marseille (ECM)-Centre National de la Recherche Scientifique (CNRS), Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Joseph Louis LAGRANGE (LAGRANGE), Université Nice Sophia Antipolis (1965 - 2019) (UNS), COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Institut national des sciences de l'Univers (INSU - CNRS)-Observatoire de la Côte d'Azur, COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Université Côte d'Azur (UCA)-Université Côte d'Azur (UCA)-Centre National de la Recherche Scientifique (CNRS), Centre National de la Recherche Scientifique (CNRS), ANR-14-CE27-0002,MAD,Inpainting de données audio manquantes(2014), Centre National de la Recherche Scientifique (CNRS)-École Centrale de Marseille (ECM)-Aix Marseille Université (AMU)-Centre National de la Recherche Scientifique (CNRS)-École Centrale de Marseille (ECM)-Aix Marseille Université (AMU), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Université Nice Sophia Antipolis (... - 2019) (UNS), Université Côte d'Azur (UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Université Côte d'Azur (UCA)-Centre National de la Recherche Scientifique (CNRS), Aix-Marseille Université - AMU (FRANCE), Centre National de la Recherche Scientifique - CNRS (FRANCE), Institut National Polytechnique de Toulouse - INPT (FRANCE), Université Toulouse III - Paul Sabatier - UT3 (FRANCE), Université Toulouse - Jean Jaurès - UT2J (FRANCE), Université Toulouse 1 Capitole - UT1 (FRANCE), and Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE)
Subjects: Matrix completion, Optimization problem, inpainting, Nonnegativity, Context (language use), 02 engineering and technology, low rank, Synthetic data, Data matrix (multivariate statistics), Matrix decomposition, Non-negative matrix factorization, Nonnegative matrix factorization, 030507 speech-language pathology & audiology, 03 medical and health sciences, Traitement des images, missing data, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], 0202 electrical engineering, electronic engineering, information engineering, Traitement du signal et de l'image, Convex combination, Synthèse d'image et réalité virtuelle, Spectrogram inpainting, Mathematics, business.industry, Matrix factorization, Pattern recognition, Vision par ordinateur et reconnaissance de formes, Intelligence artificielle, Missing data, Low-rankness, 020201 artificial intelligence & image processing, Artificial intelligence, 0305 other medical science, business, Algorithm, matrix completion
Abstract: International audience; Convex nonnegative matrix factorization (CNMF) is a variant of nonnegative matrix factorization (NMF) in which the components are a convex combination of atoms of a known dictionary. In this contribution, we propose to extend CNMF to the case where the data matrix and the dictionary have missing entries. After a formulation of the problem in this context of missing data, we propose a majorization-minimization algorithm for the solving of the optimization problem incurred. Experimental results with synthetic data and audio spectrograms highlight an improvement of the performance of reconstruction with respect to standard NMF. The performance gap is particularly significant when the task of reconstruction becomes arduous, e.g. when the ratio of missing data is high, the noise is steep, or the complexity of data is high.

38. Positive Semidefinite Matrix Factorization: A Link to Phase Retrieval And A Block Gradient Algorithm

Author: Cédric Févotte, Dana Lahat, Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Centre National de la Recherche Scientifique (CNRS), ANR-19-P3IA-0004,ANITI,Artificial and Natural Intelligence Toulouse Institute(2019), European Project: 681839,H2020,ERC-2015-CoG,FACTORY(2016), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), and Université Fédérale Toulouse Midi-Pyrénées
Subjects: Computer science, 0211 other engineering and technologies, Low-rank approximation, 02 engineering and technology, Positive-definite matrix, matrix factorization, Non-negative matrix factorization, Matrix decomposition, Matrix (mathematics), Positive semidefinite factorization, Factorization, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, 0202 electrical engineering, electronic engineering, information engineering, Nonnegative matrix, nonnegative factorizations, low-rank matrix recovery, Semidefinite programming, phase retrieval, low-rank approximation, 021103 operations research, 020206 networking & telecommunications, rank minimization, affine rank minimization, semidefinite programming, Phase retrieval, Algorithm
Abstract: International audience; This paper deals with positive semidefinite matrix factorization (PSDMF). PSDMF writes each entry of a nonnegative matrix as the inner product of two symmetric positive semidefinite matrices. PSDMF generalizes nonnegative matrix factorization. Exact PSDMF has found applications in combinatorial optimization, quantum communication complexity, and quantum information theory, among others. In this paper, we show, for the first time, a link between PSDMF and the problem of matrix recovery from phaseless measurements, which includes phase retrieval. We demonstrate the usefulness of this observation by proposing a new type of local optimization scheme for PSDMF, which is based on a generalization of the Wirtinger flow method for phase retrieval. Numerical experiments show that our algorithm can performs as well as state-of-the-art algorithms, in certain setups. We suggest that this link between the two types of problems, which have until now been addressed separately, opens the door to new applications, algorithms, and insights.
Full Text: View/download PDF

39. Positive Semidefinite Matrix Factorization Based on Truncated Wirtinger Flow

Author: Cédric Févotte, Dana Lahat, Signal et Communications (IRIT-SC), Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées, Centre National de la Recherche Scientifique (CNRS), ANR-19-P3IA-0004,ANITI,Artificial and Natural Intelligence Toulouse Institute(2019), European Project: 681839,H2020,ERC-2015-CoG,FACTORY(2016), Lahat, Dana, Artificial and Natural Intelligence Toulouse Institute - - ANITI2019 - ANR-19-P3IA-0004 - P3IA - VALID, New paradigms for latent factor estimation - FACTORY - - H20202016-09-01 - 2021-08-31 - 681839 - VALID, Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), and Université de Toulouse (UT)
Subjects: Low rank matrix approximation, Linear programming, [INFO.INFO-TS] Computer Science [cs]/Signal and Image Processing, Matrix factorization, 020206 networking & telecommunications, 02 engineering and technology, Positive-definite matrix, Nonnegative Matrix Factorization, Non-negative matrix factorization, Local convergence, Matrix decomposition, Matrix (mathematics), Positive semidefinite factorization, Factorization, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, 0202 electrical engineering, electronic engineering, information engineering, Applied mathematics, Optimization Algorithms, 020201 artificial intelligence & image processing, Affine transformation, Affine rank minimization, Mathematics, Phase retrieval
Abstract: Accepted to EUSIPCO 2020; International audience; This paper deals with algorithms for positive semidefinite matrix factorization (PSDMF). PSDMF is a recently-proposed extension of nonnegative matrix factorization with applications in combinatorial optimization, among others. In this paper, we focus on improving the local convergence of an alternating block gradient (ABG) method for PSDMF in a noise-free setting by replacing the quadratic objective function with the Poisson log-likelihood. This idea is based on truncated Wirtinger flow (TWF), a phase retrieval (PR) method that trims outliers in the gradient and thus regularizes it. Our motivation is a recent result linking PR with PSDMF. Our numerical experiments validate that the numerical benefits of TWF may carry over to PSDMF despite the more challenging setting, when initialized within its region of convergence. We then extend TWF from PR to affine rank minimization (ARM), and show that although the outliers are no longer an issue in the ARM setting, PSDMF with the new objective function may still achieves a smaller error for the same number of iterations. In a broader view, our results indicate that a proper choice of objective function may enhance convergence of matrix (or tensor) factorization methods.
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

39 results on '"Cédric Févotte"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources