Back to Search Start Over

MODALITY corpus - SPEAKER 17 - COMMANDS C5

Authors :
Czyżewski, Andrzej
Kostek, Bożena
Bratoszewski, Piotr
Szykulski, Marcin
Kotus, Józef
Zaporowski, Szymon
Spaleniak, Paweł
Odya, Piotr
Publication Year :
2016
Publisher :
Gdańsk University of Technology, 2016.

Abstract

The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system, as every utterance was labelled. Recordings in noisy conditions can be used to test the robustness of speech recognition systems.

Details

Database :
OpenAIRE
Accession number :
edsair.r3b099df0bc2..0167a83834f7f069e803ce56ad2ae816
Full Text :
https://doi.org/10.34808/paf5-a642