Back to Search Start Over

MODALITY corpus - SPEAKER 35 - COMMANDS C3

Authors :
Czyżewski, Andrzej
Kostek, Bożena
Bratoszewski, Piotr
Szykulski, Marcin
Kotus, Józef
Zaporowski, Szymon
Spaleniak, Paweł
Odya, Piotr
Publication Year :
2016
Publisher :
Gdańsk University of Technology, 2016.

Abstract

The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system, as every utterance was labelled. Recordings in noisy conditions can be used to test the robustness of speech recognition systems.

Details

Database :
OpenAIRE
Accession number :
edsair.r3b099df0bc2..ef9978bd22a3b5765d01c421d3b2dcac
Full Text :
https://doi.org/10.34808/gwrz-t064