1. A Multimodal Real-Time MRI Articulatory Corpus of French for Speech Research
- Author
-
Arun A. Joseph, Jens Frahm, Dirk Voit, Yves Laprie, Karyna Isaieva, Freddy Odille, Ioannis Douros, Anastasiia Tsukanova, Jacques Felblinger, Pierre-André Vuissoz, Laprie, Yves, Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS), Imagerie Adaptative Diagnostique et Interventionnelle (IADI), Institut National de la Santé et de la Recherche Médicale (INSERM)-Université de Lorraine (UL), Biomedizinische NMR Forschungs GmbH [Göttingen], Max-Planck-Institut für Biophysikalische Chemie - Max Planck Institute for Biophysical Chemistry [Göttingen], Max-Planck-Gesellschaft-Max-Planck-Gesellschaft, and Université de Lorraine (UL)-Institut National de la Santé et de la Recherche Médicale (INSERM)
- Subjects
Larynx ,[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI] ,Speech production ,Epiglottis ,Computer science ,speech production ,Speech recognition ,02 engineering and technology ,speech syn- thesis ,[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI] ,030507 speech-language pathology & audiology ,03 medical and health sciences ,Tongue ,0202 electrical engineering, electronic engineering, information engineering ,medicine ,Index Terms: speech corpus ,Spontaneous speech ,multi-modal database ,020206 networking & telecommunications ,Real-time MRI ,medicine.anatomical_structure ,Duration (music) ,French language ,real-time MRI data ,0305 other medical science ,Vocal tract ,3D MRI data - Abstract
International audience; In this work we describe the creation of ArtSpeechMRIfr: a real-time as well as static magnetic resonance imaging (rtMRI, 3D MRI) database of the vocal tract. The database contains also processed data: denoised audio, its phonetically aligned annotation, articulatory contours, and vocal tract volume information , which provides a rich resource for speech research. The database is built on data from two male speakers of French. It covers a number of phonetic contexts in the controlled part, as well as spontaneous speech, 3D MRI scans of sustained vocalic articulations, and of the dental casts of the subjects. The corpus for rtMRI consists of 79 synthetic sentences constructed from a phonetized dictionary that makes possible to shorten the duration of acquisitions while keeping a very good coverage of the phonetic contexts which exist in French. The 3D MRI includes acquisitions for 12 French vowels and 10 consonants, each of which was pronounced in several vocalic contexts. Ar-ticulatory contours (tongue, jaw, epiglottis, larynx, velum, lips) as well as 3D volumes were manually drawn for a part of the images.
- Published
- 2019