1. Exploring bat song syllable representations in self-supervised audio encoders
- Author
-
Kloots, Marianne de Heer and Knörnschild, Mirjam
- Subjects
Computer Science - Sound ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
How well can deep learning models trained on human-generated sounds distinguish between another species' vocalization types? We analyze the encoding of bat song syllables in several self-supervised audio encoders, and find that models pre-trained on human speech generate the most distinctive representations of different syllable types. These findings form first steps towards the application of cross-species transfer learning in bat bioacoustics, as well as an improved understanding of out-of-distribution signal processing in audio encoder models., Comment: Presented at VIHAR-2024; see https://vihar-2024.vihar.org/
- Published
- 2024