1. Multimodal dataset of real-time 2D and static 3D MRI of healthy French speakers
- Author
-
Ioannis Douros, Yves Laprie, Pierre-André Vuissoz, Jacques Felblinger, Karyna Isaieva, Justine Leclere, Imagerie Adaptative Diagnostique et Interventionnelle (IADI), Institut National de la Santé et de la Recherche Médicale (INSERM)-Université de Lorraine (UL), Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL), Centre Hospitalier Universitaire de Reims (CHU Reims), Centre d'Investigation Clinique - Innovation Technologique [Nancy] (CIC-IT), Centre d'investigation clinique [Nancy] (CIC), Centre Hospitalier Régional Universitaire de Nancy (CHRU Nancy)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Université de Lorraine (UL)-Centre Hospitalier Régional Universitaire de Nancy (CHRU Nancy)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Université de Lorraine (UL), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), and Isaieva, Karyna
- Subjects
Adult ,Male ,Statistics and Probability ,Data Descriptor ,Oral anatomy ,Speech production ,Computer science ,Science ,Speech recognition ,Context (language use) ,Library and Information Sciences ,01 natural sciences ,[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] ,030218 nuclear medicine & medical imaging ,Education ,Young Adult ,03 medical and health sciences ,Imaging, Three-Dimensional ,0302 clinical medicine ,Vocal tract images ,0103 physical sciences ,Humans ,Speech ,Segmentation ,Articulatory gestures ,010301 acoustics ,Language ,[SDV.IB] Life Sciences [q-bio]/Bioengineering ,Communication ,Middle Aged ,Magnetic Resonance Imaging ,Computer Science Applications ,Metadata ,[INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL] ,Female ,[SDV.IB]Life Sciences [q-bio]/Bioengineering ,France ,Statistics, Probability and Uncertainty ,Mr images ,Articulation (phonetics) ,Vocal tract ,Information Systems - Abstract
The study of articulatory gestures has a wide spectrum of applications, notably in speech production and recognition. Sets of phonemes, as well as their articulation, are language-specific; however, existing MRI databases mostly include English speakers. In our present work, we introduce a dataset acquired with MRI from 10 healthy native French speakers. A corpus consisting of synthetic sentences was used to ensure a good coverage of the French phonetic context. A real-time MRI technology with temporal resolution of 20 ms was used to acquire vocal tract images of the participants speaking. The sound was recorded simultaneously with MRI, denoised and temporally aligned with the images. The speech was transcribed to obtain phoneme-wise segmentation of sound. We also acquired static 3D MR images for a wide list of French phonemes. In addition, we include annotations of spontaneous swallowing., Measurement(s)Vocal tract images • SpeechTechnology Type(s)Magnetic Resonance Imaging • Microphone DeviceSample Characteristic - OrganismHomo sapiens Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.16404453
- Published
- 2021
- Full Text
- View/download PDF