Back to Search Start Over

Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting

Authors :
Yang, Chao-Han Huck
Gu, Yile
Liu, Yi-Chieh
Ghosh, Shalini
Bulyko, Ivan
Stolcke, Andreas
Source :
Proc. IEEE ASRU Workshop, Dec. 2023
Publication Year :
2023

Abstract

We explore the ability of large language models (LLMs) to act as speech recognition post-processors that perform rescoring and error correction. Our first focus is on instruction prompting to let LLMs perform these task without fine-tuning, for which we evaluate different prompting schemes, both zero- and few-shot in-context learning, and a novel task activation prompting method that combines causal instructions and demonstration to increase its context windows. Next, we show that rescoring only by in-context learning with frozen LLMs achieves results that are competitive with rescoring by domain-tuned LMs, using a pretrained first-pass recognition system and rescoring output on two out-of-domain tasks (ATIS and WSJ). By combining prompting techniques with fine-tuning we achieve error rates below the N-best oracle level, showcasing the generalization power of the LLMs.<br />Comment: Accepted to IEEE Automatic Speech Recognition and Understanding (ASRU) 2023. 8 pages. 2nd version revised from Sep 29th's version

Details

Database :
arXiv
Journal :
Proc. IEEE ASRU Workshop, Dec. 2023
Publication Type :
Report
Accession number :
edsarx.2309.15649
Document Type :
Working Paper
Full Text :
https://doi.org/10.1109/ASRU57964.2023.10389673