Back to Search
Start Over
A Generative Method for a Laryngeal Biosignal.
- Source :
-
Journal of voice : official journal of the Voice Foundation [J Voice] 2024 Feb 22. Date of Electronic Publication: 2024 Feb 22. - Publication Year :
- 2024
- Publisher :
- Ahead of Print
-
Abstract
- The Glottal Area Waveform (GAW) is an important component in quantitative clinical voice assessment, providing valuable insights into vocal fold function. In this study, we introduce a novel method employing Variational Autoencoders (VAEs) to generate synthetic GAWs. Our approach enables the creation of synthetic GAWs that closely replicate real-world data, offering a versatile tool for researchers and clinicians. We elucidate the process of manipulating the VAE latent space using the Glottal Opening Vector (GlOVe). The GlOVe allows precise control over the synthetic closure and opening of the vocal folds. By utilizing the GlOVe, we generate synthetic laryngeal biosignals. These biosignals accurately reflect vocal fold behavior, allowing for the emulation of realistic glottal opening changes. This manipulation extends to the introduction of arbitrary oscillations in the vocal folds, closely resembling real vocal fold oscillations. The range of factor coefficient values enables the generation of diverse biosignals with varying frequencies and amplitudes. Our results demonstrate that this approach yields highly accurate laryngeal biosignals, with the Normalized Mean Absolute Error values for various frequencies ranging from 9.6 ⋅ 10 <superscript>-3</superscript> to 1.20 ⋅ 10 <superscript>-2</superscript> for different experimented frequencies, alongside a remarkable training effectiveness, reflected in reductions of up to approximately 89.52% in key loss components. This proposed method may have implications for downstream speech synthesis and phonetics research, offering the potential for advanced and natural-sounding speech technologies.<br />Competing Interests: Declaration of Competing Interest The authors (Mahdi Darvish and Andreas M Kist) declare that no conflict of interest exists.<br /> (Copyright © 2024 The Authors. Published by Elsevier Inc. All rights reserved.)
Details
- Language :
- English
- ISSN :
- 1873-4588
- Database :
- MEDLINE
- Journal :
- Journal of voice : official journal of the Voice Foundation
- Publication Type :
- Academic Journal
- Accession number :
- 38395653
- Full Text :
- https://doi.org/10.1016/j.jvoice.2024.01.016