Author: "Poli, Maxime" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Poli, Maxime"' showing total 7 results

Start Over Author "Poli, Maxime"

7 results on '"Poli, Maxime"'

1. Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach

Author: Poli, Maxime, Chemla, Emmanuel, and Dupoux, Emmanuel
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Recent progress in Spoken Language Modeling has shown that learning language directly from speech is feasible. Generating speech through a pipeline that operates at the text level typically loses nuances, intonations, and non-verbal vocalizations. Modeling directly from speech opens up the path to more natural and expressive systems. On the other hand, speech-only systems require up to three orders of magnitude more data to catch up to their text-based counterparts in terms of their semantic abilities. We show that fine-tuning speech representation models on phoneme classification leads to more context-invariant representations, and language models trained on these units achieve comparable lexical comprehension to ones trained on hundred times more data., Comment: Accepted at EMNLP 2024 main conference. 9 pages, 4 figures
Published: 2024

2. Introducing topography in convolutional neural networks

Author: Poli, Maxime, Dupoux, Emmanuel, and Riad, Rachid
Subjects: Computer Science - Neural and Evolutionary Computing, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Parts of the brain that carry sensory tasks are organized topographically: nearby neurons are responsive to the same properties of input signals. Thus, in this work, inspired by the neuroscience literature, we proposed a new topographic inductive bias in Convolutional Neural Networks (CNNs). To achieve this, we introduced a new topographic loss and an efficient implementation to topographically organize each convolutional layer of any CNN. We benchmarked our new method on 4 datasets and 3 models in vision and audio tasks and showed equivalent performance to all benchmarks. Besides, we also showcased the generalizability of our topographic loss with how it can be used with different topographic organizations in CNNs. Finally, we demonstrated that adding the topographic inductive bias made CNNs more resistant to pruning. Our approach provides a new avenue to obtain models that are more memory efficient while maintaining better accuracy., Comment: Submitted to ICASSP 2023
Published: 2022

3. Shennong: A Python toolbox for audio speech features extraction

Author: Bernard, Mathieu, Poli, Maxime, Karadayi, Julien, and Dupoux, Emmanuel
Published: 2023
Full Text: View/download PDF

4. Shennong: a Python toolbox for audio speech features extraction

Author: Bernard, Mathieu, Poli, Maxime, Karadayi, Julien, and Dupoux, Emmanuel
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: We introduce Shennong, a Python toolbox and command-line utility for speech features extraction. It implements a wide range of well-established state of art algorithms including spectro-temporal filters such as Mel-Frequency Cepstral Filterbanks or Predictive Linear Filters, pre-trained neural networks, pitch estimators as well as speaker normalization methods and post-processing algorithms. Shennong is an open source, easy-to-use, reliable and extensible framework. The use of Python makes the integration to others speech modeling and machine learning tools easy. It aims to replace or complement several heterogeneous software, such as Kaldi or Praat. After describing the Shennong software architecture, its core components and implemented algorithms, this paper illustrates its use on three applications: a comparison of speech features performances on a phones discrimination task, an analysis of a Vocal Tract Length Normalization model as a function of the speech duration used for training and a comparison of pitch estimation algorithms under various noise conditions.
Published: 2021
Full Text: View/download PDF

5. Modeling the initial state of early phonetic learning in infants

Author: Poli, Maxime, primary, Schatz, Thomas, additional, Dupoux, Emmanuel, additional, and Lavechin, Marvin, additional
Published: 2024
Full Text: View/download PDF

6. Introducing Topography in Convolutional Neural Networks

Author: Poli, Maxime, primary, Dupoux, Emmanuel, additional, and Riad, Rachid, additional
Published: 2023
Full Text: View/download PDF

7. Valoriser et exploiter un fonds audiovisuel de films historiques et militants : du traitement documentaire au plan de communication. Le cas du Planning Familial

Author: Poli, Maxime, Institut national des techniques de la documentation (INTD-CNAM), Conservatoire National des Arts et Métiers [CNAM] (CNAM), Institut national des techniques de la documentation du CNAM, and Brigitte Guyot
Subjects: Image animée, Association, Archives audiovisuelles, [SHS.INFO.DOCU]Humanities and Social Sciences/Library and information sciences/domain_shs.info.docu, Planning Familial, Fonds documentaire, Aspect juridique, France, Gestion du fonds
Abstract: Ce mémoire présente les différentes étapes permettant le traitement et l'exploitation des fonds d'archives audiovisuelles issus du monde associatif. Pour se faire nous avons, dans un premier temps, replacé le fonds dans son contexte historique et politique. Nous avons ensuite présenté le marché des images et ses publics. La découverte de bobines de films au sein du Planning Familal a permis la constitution d'un fonds audiovisuel. Après une évaluation des besoins et des objectifs, les images bénéficient d'un traitement intellectuel et les droits sont clarifiés. Ce qui aboutit à la réalisation de produits promotionnels à base d'images d'archives et à la mise en place d'un plan de communication en adéquation avec les valeurs associatives ; afin de faire connaître le fonds et de favoriser la circulation et la publicité des images.
Published: 2011

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

7 results on '"Poli, Maxime"'

1. Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach

2. Introducing topography in convolutional neural networks

3. Shennong: A Python toolbox for audio speech features extraction

4. Shennong: a Python toolbox for audio speech features extraction

5. Modeling the initial state of early phonetic learning in infants

6. Introducing Topography in Convolutional Neural Networks

7. Valoriser et exploiter un fonds audiovisuel de films historiques et militants : du traitement documentaire au plan de communication. Le cas du Planning Familial

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

7 results on '"Poli, Maxime"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources