1. A 28-nm 1.3-mW Speech-to-Text Accelerator for Edge AI Devices
- Author
-
Tsai, Yu-Hsuan, Lin, Yi-Cheng, Chen, Wen-Ching, Lin, Liang-Yi, Chang, Nian-Shyang, Lin, Chun-Pin, Chen, Shi-Hao, Chen, Chi-Shi, and Yang, Chia-Hsiang
- Abstract
Speech-to-text conversion has been extensively deployed for a variety of applications. To implement speech-to-text conversion on energy-constrained edge devices, a hybrid algorithm is adopted in this work. A bidirectional recurrent neural network (BRNN), composed of the light gated recurrent units (LiGRUs), is included to achieve a high speech-to-text accuracy with fewer network parameters. A network compression scheme, including scaling factor pruning (SFP), multi-bit clustering (MBC), and linear quantization (LQ), is proposed to minimize the complexity of the BRNN. The network size and the computational complexity are reduced by
$29.8\times $ $73.2\times $ $177\times $ $50\times $ - Published
- 2024
- Full Text
- View/download PDF