Back to Search
Start Over
MODALITY corpus - SPEAKER 17 - COMMANDS C5
- Publication Year :
- 2016
- Publisher :
- Gdańsk University of Technology, 2016.
-
Abstract
- The MODALITY corpus is one of the multimodal database of word recordings in English. It consists of over 30 hours of multimodal recordings. The database contains high-resolution, high-framerate stereoscopic video streams and audio signals obtained from a microphone array and a laptop microphone. The corpus can be employed to develop an AVSR system, as every utterance was labelled. Recordings in noisy conditions can be used to test the robustness of speech recognition systems.
- Subjects :
- language
native speakers
multimodal
recordings
Subjects
Details
- Database :
- OpenAIRE
- Accession number :
- edsair.r3b099df0bc2..0167a83834f7f069e803ce56ad2ae816
- Full Text :
- https://doi.org/10.34808/paf5-a642