Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition

Authors :: Tan, Chao
Li, Sheng
Cao, Yang
Ren, Zhao
Schultz, Tanja
Publication Year :: 2024
Abstract: Federated Learning (FL) is a privacy-preserving approach that allows servers to aggregate distributed models transmitted from local clients rather than training on user data. More recently, FL has been applied to Speech Emotion Recognition (SER) for secure human-computer interaction applications. Recent research has found that FL is still vulnerable to inference attacks. To this end, this paper focuses on investigating the security of FL for SER concerning property inference attacks. We propose a novel method to protect the property information in speech data by decomposing various properties in the sound and adding perturbations to these properties. Our experiments show that the proposed method offers better privacy-utility trade-offs than existing methods. The trade-offs enable more effective attack prevention while maintaining similar FL utility levels. This work can guide future work on privacy protection methods in speech processing.

Subjects :: Electrical Engineering and Systems Science - Audio and Speech Processing
Computer Science - Sound

Tools