Back to Search Start Over

Neurally Optimized Decoder for Low Bitrate Speech Codec.

Authors :
Kim, Hyung Yong
Yoon, Ji Won
Cho, Won Ik
Kim, Nam Soo
Source :
IEEE Signal Processing Letters; 2022, Vol. 29, p244-248, 5p
Publication Year :
2022

Abstract

Recently, a conventional neural decoder for speech codec has shown promising performance. However, it typically requires some prior knowledge of decoding such as bit allocation or dequantization scheme, which is not a universal solution for many different kinds of speech codecs. In order to address this limitation, we propose a neurally optimized decoder based on a generative model which can directly reconstruct the speech from the bitstream without a prior knowledge. The proposed decoder mainly consists of two components: 1) a dequantization model to group and dequantize related bits from the bitstream and 2) a generative model to restore the speech conditioned on the output of the dequantization model. Through experiments with mixed excitation linear prediction (MELP), Advanced multi-band excitation (AMBE), and SPEEX at around 2.4 kb/s, it is showed that the proposed model showed better performance in most of the objective and subjective evaluation compared to the conventional speech codecs. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10709908
Volume :
29
Database :
Complementary Index
Journal :
IEEE Signal Processing Letters
Publication Type :
Academic Journal
Accession number :
155383879
Full Text :
https://doi.org/10.1109/LSP.2021.3132557