Back to Search
Start Over
Discriminative importance weighting of augmented training data for acoustic model training
- Source :
- ICASSP, 42th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), 42th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), Mar 2017, New Orleans, United States
- Publication Year :
- 2017
- Publisher :
- IEEE, 2017.
-
Abstract
- Added missing sign in equations (2) and (3) + explanation about iteration 1 in Fig. 1; International audience; DNN based acoustic models require a large amount of training data. Parametric data augmentation techniques such as adding noise, reverberation, or changing the speech rate, are often employed to boost the dataset size and the ASR performance. The choice of augmentation techniques and the associated parameters has been handled heuristically so far. In this work we propose an algorithm to automatically weight data perturbed using a variety of augmentation techniques and/or parameters. The weights are learned in a discriminative fashion so as to minimize the frame error rate using the standard gradient descent algorithm in an iterative manner. Experiments were performed using the CHiME-3 dataset. Data augmentation was done by adding noise at different SNRs. A relative WER improvement of 15% was obtained with the proposed data weighting algorithm compared to the unweighted augmented dataset. Interestingly, the resulting distribution of SNRs in the weighted training set differs significantly from that of the test set.
- Subjects :
- [INFO.INFO-TS] Computer Science [cs]/Signal and Image Processing
Computer science
Speech recognition
02 engineering and technology
feature simulation
Data modeling
[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
ASR
CHiME
Discriminative model
0202 electrical engineering, electronic engineering, information engineering
Parametric statistics
Training set
business.industry
Acoustic model
020206 networking & telecommunications
Pattern recognition
Weighting
Noise
Test set
020201 artificial intelligence & image processing
Artificial intelligence
business
Gradient descent
data augmentation
DNN
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Accession number :
- edsair.doi.dedup.....3ca43664cc301ad6dedd70c9989e50fc
- Full Text :
- https://doi.org/10.1109/icassp.2017.7953085