Start Over

Time-Domain Joint Training Strategies of Speech Enhancement and Intent Classification Neural Models

Authors :: Mohamed Nabih Ali
Daniele Falavigna
Alessio Brutti
Source :: Sensors, Vol 22, Iss 1, p 374 (2022)
Publication Year :: 2022
Publisher :: MDPI AG, 2022.
Abstract: Robustness against background noise and reverberation is essential for many real-world speech-based applications. One way to achieve this robustness is to employ a speech enhancement front-end that, independently of the back-end, removes the environmental perturbations from the target speech signal. However, although the enhancement front-end typically increases the speech quality from an intelligibility perspective, it tends to introduce distortions which deteriorate the performance of subsequent processing modules. In this paper, we investigate strategies for jointly training neural models for both speech enhancement and the back-end, which optimize a combined loss function. In this way, the enhancement front-end is guided by the back-end to provide more effective enhancement. Differently from typical state-of-the-art approaches employing on spectral features or neural embeddings, we operate in the time domain, processing raw waveforms in both components. As application scenario we consider intent classification in noisy environments. In particular, the front-end speech enhancement module is based on Wave-U-Net while the intent classifier is implemented as a temporal convolutional network. Exhaustive experiments are reported on versions of the Fluent Speech Commands corpus contaminated with noises from the Microsoft Scalable Noisy Speech Dataset, shedding light and providing insight about the most promising training approaches.

Subjects :: joint training
speech enhancement
intent classification
Chemical technology
TP1-1185

Details

Language :: English
ISSN :: 14248220
Volume :: 22
Issue :: 1
Database :: Directory of Open Access Journals
Journal :: Sensors
Publication Type :: Academic Journal
Accession number :: edsdoj.5084c273c0bb427aabe1d176ba50ec46
Document Type :: article
Full Text :: https://doi.org/10.3390/s22010374

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Time-Domain Joint Training Strategies of Speech Enhancement and Intent Classification Neural Models

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Time-Domain Joint Training Strategies of Speech Enhancement and Intent Classification Neural Models

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources