Back to Search
Start Over
Full-Band LPCNet: A Real-Time Neural Vocoder for 48 kHz Audio With a CPU
- Source :
- IEEE Access, Vol 9, Pp 94923-94933 (2021)
- Publication Year :
- 2021
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2021.
-
Abstract
- This paper investigates a real-time neural speech synthesis system on CPUs that can synthesize high-fidelity 48 kHz speech waveforms to cover the entire frequency range audible by human beings. Although most previous studies on 48 kHz speech synthesis have used traditional source-filter vocoders or a WaveNet vocoder for waveform generation, they have some drawbacks regarding synthesis quality or inference speed. LPCNet was proposed as a real-time neural vocoder with a mobile CPU but its sampling frequency is still only 16 kHz. In this paper, we propose a Full-band LPCNet to synthesize high-fidelity 48 kHz speech waveforms with a CPU by introducing some simple but effective modifications to the conventional LPCNet. We then evaluate the synthesis quality using both normal speech and a singing voice. The results of these experiments demonstrate that the proposed Full-band LPCNet is the only neural vocoder that can synthesize high-quality 48 kHz speech waveforms while maintaining real-time capability with a CPU.
- Subjects :
- Mobile processor
General Computer Science
Computer science
Speech recognition
General Engineering
Full band
LPCNet
Speech synthesis
computer.software_genre
singing voice synthesis
TK1-9971
Frequency conversion
Quality (physics)
speech synthesis
neural vocoder
Waveform
General Materials Science
Electrical engineering. Electronics. Nuclear engineering
text-to-speech
Normal speech
Hardware_REGISTER-TRANSFER-LEVELIMPLEMENTATION
computer
Subjects
Details
- ISSN :
- 21693536
- Volume :
- 9
- Database :
- OpenAIRE
- Journal :
- IEEE Access
- Accession number :
- edsair.doi.dedup.....6ac8d3f65059b4dfc330ad6d91057a71