Back to Search Start Over

Random Cycle Loss and Its Application to Voice Conversion.

Authors :
Sun H
Wang D
Li L
Chen C
Zheng TF
Source :
IEEE transactions on pattern analysis and machine intelligence [IEEE Trans Pattern Anal Mach Intell] 2023 Aug; Vol. 45 (8), pp. 10331-10345. Date of Electronic Publication: 2023 Jun 30.
Publication Year :
2023

Abstract

Speech disentanglement aims to decompose independent causal factors of speech signals into separate codes. Perfect disentanglement benefits to a broad range of speech processing tasks. This paper presents a simple but effective disentanglement approach based on cycle consistency loss and random factor substitution. This leads to a novel random cycle (RC) loss that enforces analysis-and-resynthesis consistency, a main principle of reductionism. We theoretically demonstrate that the proposed RC loss can achieve independent codes if well optimized, which in turn leads to superior disentanglement when combined with information bottleneck (IB). Extensive simulation experiments were conducted to understand the properties of the RC loss, and experimental results on voice conversion further demonstrate the practical merit of the proposal. Source code and audio samples can be found on the webpage http://rc.cslt.org.

Details

Language :
English
ISSN :
1939-3539
Volume :
45
Issue :
8
Database :
MEDLINE
Journal :
IEEE transactions on pattern analysis and machine intelligence
Publication Type :
Academic Journal
Accession number :
37030720
Full Text :
https://doi.org/10.1109/TPAMI.2023.3257839