Back to Search Start Over

Decomposing the video editing structure of a talk-show using nonnegative matrix factorization

Authors :
Slim Essid
Cédric Févotte
Département Images, Données, Signal (IDS)
Télécom ParisTech
Signal, Statistique et Apprentissage (S2A)
Laboratoire Traitement et Communication de l'Information (LTCI)
Institut Mines-Télécom [Paris] (IMT)-Télécom Paris-Institut Mines-Télécom [Paris] (IMT)-Télécom Paris
Institut Mines-Télécom [Paris] (IMT)-Télécom Paris
Source :
2012 19th IEEE International Conference on Image Processing (ICIP 2012), 2012 19th IEEE International Conference on Image Processing (ICIP 2012), Sep 2012, Orlando, France. pp.3105-3108, ⟨10.1109/ICIP.2012.6467557⟩, ICIP
Publication Year :
2012
Publisher :
HAL CCSD, 2012.

Abstract

We introduce a novel video structuring scheme that exploits nonnegative matrix factorization (NMF) on count data (in a bag of features representation of the visual stream) to jointly discover latent structuring patterns and their activations in time. Our NMF variant employs the Kullback-Leibler divergence as a cost function and imposes a temporal smoothness constraint to the activations. It is solved by a majorization-minimization technique. Our method is shown to be successful for decomposing the high-level editing structure of talk-shows. It is evaluated using a challenging database of TV political-debate programs, and found to clearly outperform a reference HMM method.

Details

Language :
English
Database :
OpenAIRE
Journal :
2012 19th IEEE International Conference on Image Processing (ICIP 2012), 2012 19th IEEE International Conference on Image Processing (ICIP 2012), Sep 2012, Orlando, France. pp.3105-3108, ⟨10.1109/ICIP.2012.6467557⟩, ICIP
Accession number :
edsair.doi.dedup.....4ea2f8a7d2f923add1cac029e61b075f