Back to Search Start Over

Deep learning for unsupervised feature extraction in audio signals: Monaural source separation

Authors :
Edward T. Nykaza
Tim Oates
Matthew G. Blevins
Anton Netchaev
Steven L. Bunkley
Arnold P. Boedihardjo
Zhiguang Wang
Source :
The Journal of the Acoustical Society of America. 140:3424-3424
Publication Year :
2016
Publisher :
Acoustical Society of America (ASA), 2016.

Abstract

Deep learning is becoming ubiquitous; it is the underlying and driving force behind many heavily embedded technologies in society (e.g., search engines, fraud detection warning systems, and social-media facial recognition algorithms). Over the past few years there has been a steady increase in the number of audio related applications of deep learning. Recently, Nykaza et al. presented a pedagogical approach to understanding how the hidden layers recreate, separate, and classify environmental noise signals. That work presented some feature extraction examples using simple pure tone, chord, and environmental noise datasets. In this paper, we build upon this recent analysis and expand the datasets to include more realistic representations of those datasets with the inclusion of noise and overlapping signals. Additionally, we consider other related architectures (e.g., variant-autoencoders, recurrent neural networks, and fixing hidden nodes/layers), explore their advantages/drawbacks, and provide insights on ...

Details

ISSN :
00014966
Volume :
140
Database :
OpenAIRE
Journal :
The Journal of the Acoustical Society of America
Accession number :
edsair.doi...........eacad48dd5285abe09c65c15311cb310
Full Text :
https://doi.org/10.1121/1.4971016