Back to Search Start Over

A Multimodal Real-Time MRI Articulatory Corpus of French for Speech Research

Authors :
Arun A. Joseph
Jens Frahm
Dirk Voit
Yves Laprie
Karyna Isaieva
Freddy Odille
Ioannis Douros
Anastasiia Tsukanova
Jacques Felblinger
Pierre-André Vuissoz
Laprie, Yves
Speech Modeling for Facilitating Oral-Based Communication (MULTISPEECH)
Inria Nancy - Grand Est
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Natural Language Processing & Knowledge Discovery (LORIA - NLPKD)
Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA)
Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA)
Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)
Imagerie Adaptative Diagnostique et Interventionnelle (IADI)
Institut National de la Santé et de la Recherche Médicale (INSERM)-Université de Lorraine (UL)
Biomedizinische NMR Forschungs GmbH [Göttingen]
Max-Planck-Institut für Biophysikalische Chemie - Max Planck Institute for Biophysical Chemistry [Göttingen]
Max-Planck-Gesellschaft-Max-Planck-Gesellschaft
Université de Lorraine (UL)-Institut National de la Santé et de la Recherche Médicale (INSERM)
Source :
Interspeech 2019 20th Ann Conf Intern Speech Comm Ass, INTERSPEECH 2019-20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019-20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria, INTERSPEECH, HAL
Publication Year :
2019
Publisher :
HAL CCSD, 2019.

Abstract

International audience; In this work we describe the creation of ArtSpeechMRIfr: a real-time as well as static magnetic resonance imaging (rtMRI, 3D MRI) database of the vocal tract. The database contains also processed data: denoised audio, its phonetically aligned annotation, articulatory contours, and vocal tract volume information , which provides a rich resource for speech research. The database is built on data from two male speakers of French. It covers a number of phonetic contexts in the controlled part, as well as spontaneous speech, 3D MRI scans of sustained vocalic articulations, and of the dental casts of the subjects. The corpus for rtMRI consists of 79 synthetic sentences constructed from a phonetized dictionary that makes possible to shorten the duration of acquisitions while keeping a very good coverage of the phonetic contexts which exist in French. The 3D MRI includes acquisitions for 12 French vowels and 10 consonants, each of which was pronounced in several vocalic contexts. Ar-ticulatory contours (tongue, jaw, epiglottis, larynx, velum, lips) as well as 3D volumes were manually drawn for a part of the images.

Details

Language :
English
Database :
OpenAIRE
Journal :
Interspeech 2019 20th Ann Conf Intern Speech Comm Ass, INTERSPEECH 2019-20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019-20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria, INTERSPEECH, HAL
Accession number :
edsair.doi.dedup.....998f6cff80441f7a27a090aab8f59c77