Back to Search Start Over

MODALITY corpus - SPEAKER 05 - SEQUENCE S1

Authors :
Czyżewski, Andrzej
Kostek, Bożena
Bratoszewski, Piotr
Szykulski, Marcin
Kotus, Józef
Zaporowski, Szymon
Spaleniak, Paweł
Odya, Piotr
Publication Year :
2016
Publisher :
Gdańsk University of Technology, 2016.

Abstract

The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system, as every utterance was labelled. Recordings in noisy conditions can be used to test the robustness of speech recognition systems.

Details

Database :
OpenAIRE
Accession number :
edsair.r3b099df0bc2..57569a3897ecfd1432112c18d02346ea
Full Text :
https://doi.org/10.34808/wssz-df89