Back to Search Start Over

Language and noise transfer in speech enhancement generative adversarial network

Authors :
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Pascual de la Puente, Santiago
Park, Maruchan
Serra, Joan
Bonafonte Cávez, Antonio
Ahn, Kang-hun
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Pascual de la Puente, Santiago
Park, Maruchan
Serra, Joan
Bonafonte Cávez, Antonio
Ahn, Kang-hun
Publication Year :
2018

Abstract

©2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.<br />Speech enhancement deep learning systems usually require large amounts of training data to operate in broad conditions or real applications. This makes the adaptability of those systems into new, low resource environments an important topic. In this work, we present the results of adapting a speech enhancement generative adversarial network by fine-tuning the generator with small amounts of data. We investigate the minimum requirements to obtain a stable behavior in terms of several objective metrics in two very different languages: Catalan and Korean. We also study the variability of test performance to unseen noise as a function of the amount of different types of noise available for training. Results show that adapting a pre-trained English model with 10 min of data already achieves a comparable performance to having two orders of magnitude more data. They also demonstrate the relative stability in test performance with respect to the number of training noise types.<br />Peer Reviewed<br />Postprint (published version)

Details

Database :
OAIster
Notes :
5 p., application/pdf, English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1073028389
Document Type :
Electronic Resource