Back to Search Start Over

A case study of machine learning hardware: Real-time source separation using Markov Random Fields via sampling-based inference

Authors :
Rob A. Rutenbar
Glenn G. Ko
Source :
ICASSP
Publication Year :
2017
Publisher :
IEEE, 2017.

Abstract

We explore sound source separation to isolate human voice from background noise on mobile phones, e.g. talking on your cell phone in an airport. The challenges involved are real-time execution and power constraints. As a solution, we present a novel hardware-based sound source separation implementation capable of real-time streaming performance. The implementation uses a recently introduced Markov Random Field (MRF) inference formulation of foreground/background separation, and targets voice separation on mobile phones with two microphones. We demonstrate a real-time streaming FPGA implementation running at 150 MHz with total of 207 KB RAM. Our implementation achieves a speedup of 20× over a conventional software implementation, achieves an SDR of 6.655 dB with 1.601 ms latency, and exhibits excellent perceived audio quality. A virtual ASIC design shows that this architecture is quite small (less than 10M gates), consumes only 69.977 mW running at 20 MHz (52× less than an ARM Cortex-A9 software reference), and appears amenable to additional optimization for power.

Details

Database :
OpenAIRE
Journal :
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Accession number :
edsair.doi...........5c7eec47eeecf40bf69eace27e9d0ca9
Full Text :
https://doi.org/10.1109/icassp.2017.7952602