Back to Search
Start Over
A tempering approach for Itakura-Saito non-negative matrix factorization. With application to music transcription
- Source :
- Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009, Taipei, Taiwan. pp.1545--1548, ICASSP
- Publication Year :
- 2009
- Publisher :
- HAL CCSD, 2009.
-
Abstract
- International audience; In this paper we are interested in non-negative matrix factorization (NMF) with the Itakura-Saito (IS) divergence. Previous work has demonstrated the relevance of this cost function for the decomposition of audio power spectrograms. This is in particular due to its scale invariance, which makes it more robust to the wide dynamics of audio, a property which is not shared by other popular costs such as the Euclidean distance or the generalized Kulback-Leibler (KL) divergence. However, while the latter two cost functions are convex, the IS divergence is not, which makes it more prone to convergence to irrelevant local minima, as observed empirically. Thus, the aim of this paper is to propose a tempering scheme that favors convergence of IS-NMF to global minima. Our algorithm is based on NMF with the beta-divergence, where the shape parameter beta acts as a temperature parameter. Results on both synthetical and music data (in a transcription context) show the relevance of our approach.
- Subjects :
- [INFO.INFO-TS] Computer Science [cs]/Signal and Image Processing
business.industry
020206 networking & telecommunications
Pattern recognition
Context (language use)
02 engineering and technology
Function (mathematics)
Matrix decomposition
Non-negative matrix factorization
Euclidean distance
Maxima and minima
[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
Computer Science::Sound
Convergence (routing)
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Artificial intelligence
business
Divergence (statistics)
Algorithm
[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
[SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processing
Mathematics
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009, Taipei, Taiwan. pp.1545--1548, ICASSP
- Accession number :
- edsair.doi.dedup.....4e5cba21c09fc70aea3b32b0a9cb2034