Back to Search
Start Over
Neurally Optimized Decoder for Low Bitrate Speech Codec.
- Source :
- IEEE Signal Processing Letters; 2022, Vol. 29, p244-248, 5p
- Publication Year :
- 2022
-
Abstract
- Recently, a conventional neural decoder for speech codec has shown promising performance. However, it typically requires some prior knowledge of decoding such as bit allocation or dequantization scheme, which is not a universal solution for many different kinds of speech codecs. In order to address this limitation, we propose a neurally optimized decoder based on a generative model which can directly reconstruct the speech from the bitstream without a prior knowledge. The proposed decoder mainly consists of two components: 1) a dequantization model to group and dequantize related bits from the bitstream and 2) a generative model to restore the speech conditioned on the output of the dequantization model. Through experiments with mixed excitation linear prediction (MELP), Advanced multi-band excitation (AMBE), and SPEEX at around 2.4 kb/s, it is showed that the proposed model showed better performance in most of the objective and subjective evaluation compared to the conventional speech codecs. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 10709908
- Volume :
- 29
- Database :
- Complementary Index
- Journal :
- IEEE Signal Processing Letters
- Publication Type :
- Academic Journal
- Accession number :
- 155383879
- Full Text :
- https://doi.org/10.1109/LSP.2021.3132557