Back to Search
Start Over
Random Cycle Loss and Its Application to Voice Conversion.
- Source :
-
IEEE transactions on pattern analysis and machine intelligence [IEEE Trans Pattern Anal Mach Intell] 2023 Aug; Vol. 45 (8), pp. 10331-10345. Date of Electronic Publication: 2023 Jun 30. - Publication Year :
- 2023
-
Abstract
- Speech disentanglement aims to decompose independent causal factors of speech signals into separate codes. Perfect disentanglement benefits to a broad range of speech processing tasks. This paper presents a simple but effective disentanglement approach based on cycle consistency loss and random factor substitution. This leads to a novel random cycle (RC) loss that enforces analysis-and-resynthesis consistency, a main principle of reductionism. We theoretically demonstrate that the proposed RC loss can achieve independent codes if well optimized, which in turn leads to superior disentanglement when combined with information bottleneck (IB). Extensive simulation experiments were conducted to understand the properties of the RC loss, and experimental results on voice conversion further demonstrate the practical merit of the proposal. Source code and audio samples can be found on the webpage http://rc.cslt.org.
- Subjects :
- Software
Computer Simulation
Algorithms
Speech
Subjects
Details
- Language :
- English
- ISSN :
- 1939-3539
- Volume :
- 45
- Issue :
- 8
- Database :
- MEDLINE
- Journal :
- IEEE transactions on pattern analysis and machine intelligence
- Publication Type :
- Academic Journal
- Accession number :
- 37030720
- Full Text :
- https://doi.org/10.1109/TPAMI.2023.3257839