Back to Search Start Over

The TA2 Database – A Multi-Modal Database From HomeEntertainment

Authors :
Petr Motlicek
Stefan Duffner
Danil Korchagin
Source :
International Journal of Computer and Electrical Engineering. :670-673
Publication Year :
2012
Publisher :
IACSIT Press, 2012.

Abstract

This paper presents a new database containing high-definition audio and video recordings in a rather unconstrained video-conferencing-like environment. The database consists of recordings of people sitting around a table in two separate rooms communicating and playing online games with each other. Extensive annotation of head positions, voice activity and word transcription has been performed on the dataset, making it especially useful for evaluating automatic speech-recognition, voice activity detection, speaker localisation, multi-face detection and tracking, and other audio-visual analysis algorithms.

Details

ISSN :
17938163
Database :
OpenAIRE
Journal :
International Journal of Computer and Electrical Engineering
Accession number :
edsair.doi...........6a1e2517491a06dcd76c8fa94560976e
Full Text :
https://doi.org/10.7763/ijcee.2012.v4.581