Back to Search
Start Over
Nonnegative Tucker Decomposition with Beta-divergence for Music Structure Analysis of audio signals
- Source :
- GRETSI, XXVIIIème Colloque Francophone de Traitement du Signal et des Images, GRETSI, XXVIIIème Colloque Francophone de Traitement du Signal et des Images, Sep 2022, Nancy, France, HAL
- Publication Year :
- 2022
- Publisher :
- HAL CCSD, 2022.
-
Abstract
- Nonnegative Tucker decomposition (NTD), a tensor decomposition model, has received increased interest in the recent years because of its ability to blindly extract meaningful patterns, in particular in Music Information Retrieval. Nevertheless, existing algorithms to compute NTD are mostly designed for the Euclidean loss. This work proposes a multiplicative updates algorithm to compute NTD with the beta-divergence loss, often considered a better loss for audio processing. We notably show how to implement efficiently the multiplicative rules using tensor algebra. Finally, we show on a music structure analysis task that unsupervised NTD fitted with beta-divergence loss outperforms earlier results obtained with the Euclidean loss.<br />Comment: 4 pages, 2 figures, 1 table, 1 algorithm. To be published in GRETSI2022. The algorithm is available at https://gitlab.inria.fr/amarmore/nonnegative-factorization
- Subjects :
- FOS: Computer and information sciences
Computer Science - Machine Learning
Sound (cs.SD)
ACM: H.: Information Systems/H.5: INFORMATION INTERFACES AND PRESENTATION (e.g., HCI)/H.5.5: Sound and Music Computing
G.1.6
H.5.5
Numerical Analysis (math.NA)
[INFO.INFO-NA]Computer Science [cs]/Numerical Analysis [cs.NA]
Computer Science - Sound
Machine Learning (cs.LG)
[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
Audio and Speech Processing (eess.AS)
ACM: G.: Mathematics of Computing/G.1: NUMERICAL ANALYSIS/G.1.6: Optimization
[INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]
FOS: Mathematics
FOS: Electrical engineering, electronic engineering, information engineering
Mathematics - Numerical Analysis
15-04
Electrical Engineering and Systems Science - Audio and Speech Processing
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- GRETSI, XXVIIIème Colloque Francophone de Traitement du Signal et des Images, GRETSI, XXVIIIème Colloque Francophone de Traitement du Signal et des Images, Sep 2022, Nancy, France, HAL
- Accession number :
- edsair.doi.dedup.....116daea96f0a84d7fd7b228b4766894b