Start Over

Audiovisual head orientation estimation with particle filtering in multisensor scenarios

Authors :: Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Canton Ferrer, Cristian
Segura Perales, Carlos
Casas Pla, Josep Ramon
Pardàs Feliu, Montse
Hernando Pericás, Francisco Javier
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Canton Ferrer, Cristian
Segura Perales, Carlos
Casas Pla, Josep Ramon
Pardàs Feliu, Montse
Hernando Pericás, Francisco Javier
Publication Year :: 2008
Abstract: This article presents a multimodal approach to head pose estimation of individuals in environments equipped with multiple cameras and microphones, such as SmartRooms or automatic video conferencing. Determining the individuals head orientation is the basis for many forms of more sophisticated interactions between humans and technical devices and can also be used for automatic sensor selection (camera, microphone) in communications or video surveillance systems. The use of particle filters as a unified framework for the estimation of the head orientation for both monomodal and multimodal cases is proposed. In video, we estimate head orientation from color information by exploiting spatial redundancy among cameras. Audio information is processed to estimate the direction of the voice produced by a speaker making use of the directivity characteristics of the head radiation pattern. Furthermore, two different particle filter multimodal information fusion schemes for combining the audio and video streams are analyzed in terms of accuracy and robustness. In the first one, fusion is performed at a decision level by combining each monomodal head pose estimation, while the second one uses a joint estimation system combining information at data level. Experimental results conducted over the CLEAR 2006 evaluation database are reported and the comparison of the proposed multimodal head pose estimation algorithms with the reference monomodal approaches proves the effectiveness of the proposed approach.<br />Postprint (published version)

Details

Database :: OAIster
Notes :: 12 p., application/pdf, English
Publication Type :: Electronic Resource
Accession number :: edsoai.on1132951349
Document Type :: Electronic Resource

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Audiovisual head orientation estimation with particle filtering in multisensor scenarios

Abstract

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Audiovisual head orientation estimation with particle filtering in multisensor scenarios

Abstract

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources