Back to Search Start Over

Neural Pitch-Shifting and Time-Stretching with Controllable LPCNet

Authors :
Morrison, Max
Jin, Zeyu
Bryan, Nicholas J.
Caceres, Juan-Pablo
Pardo, Bryan
Publication Year :
2021

Abstract

Modifying the pitch and timing of an audio signal are fundamental audio editing operations with applications in speech manipulation, audio-visual synchronization, and singing voice editing and synthesis. Thus far, methods for pitch-shifting and time-stretching that use digital signal processing (DSP) have been favored over deep learning approaches due to their speed and relatively higher quality. However, even existing DSP-based methods for pitch-shifting and time-stretching induce artifacts that degrade audio quality. In this paper, we propose Controllable LPCNet (CLPCNet), an improved LPCNet vocoder capable of pitch-shifting and time-stretching of speech. For objective evaluation, we show that CLPCNet performs pitch-shifting of speech on unseen datasets with high accuracy relative to prior neural methods. For subjective evaluation, we demonstrate that the quality and naturalness of pitch-shifting and time-stretching with CLPCNet on unseen datasets meets or exceeds competitive neural- or DSP-based approaches.<br />Comment: Submitted to ICASSP 2022

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2110.02360
Document Type :
Working Paper