Energy-efficient MFCC extraction architecture in mixed-signal domain for automatic speech recognition

Authors :: Huifeng Zhu
Qi Wei
Qin Li
Xinjun Liu
Fei Qiao
Huazhong Yang
Source :: NANOARCH
Publication Year :: 2018
Publisher :: ACM, 2018.
Abstract: This paper proposes a novel processing architecture to extract Mel-Frequency Cepstrum Coefficients (MFCC) for automatic speech recognition. Inspired by the human ear, the energy-efficient analog-domain information processing is adopted to replace the energy-intensive Fourier Transform in conventional digital-domain. Moreover, the proposed architecture extracts the acoustic features in the mixed-signal domain, which significantly reduces the cost of Analog-to-Digital Converter (ADC) and the computational complexity. We carry out the circuit-level simulation based on 180nm CMOS technology, which shows an energy consumption of 2.4 nJ/frame, and a processing speed of 45.79 μs/frame. The proposed architecture achieves 97.2% energy saving and about 6.4× speedup than state of the art. Speech recognition simulation reaches the classification accuracy of 99% using the proposed MFCC features.

Subjects :: Speedup
Computer science
Speech recognition
Feature extraction
Frame (networking)
02 engineering and technology
Energy consumption
020202 computer hardware & architecture
ComputingMethodologies_PATTERNRECOGNITION
Frequency domain
0202 electrical engineering, electronic engineering, information engineering
Mel-frequency cepstrum
Energy (signal processing)
Efficient energy use

Database :: OpenAIRE
Journal :: Proceedings of the 14th IEEE/ACM International Symposium on Nanoscale Architectures
Accession number :: edsair.doi...........26d9ca0a08b3300c2c7769d167a80921
Full Text :: https://doi.org/10.1145/3232195.3232219

Full Text Access

Tools