Back to Search Start Over

Active few-shot learning for rare bioacoustic feature annotation

Authors :
Ben McEwen
Kaspar Soltero
Stefanie Gutschmidt
Andrew Bainbridge-Smith
James Atlas
Richard Green
Source :
Ecological Informatics, Vol 82, Iss , Pp 102734- (2024)
Publication Year :
2024
Publisher :
Elsevier, 2024.

Abstract

The collection and annotation of bioacoustic data present several challenges to researchers. Bioacoustic monitoring of rare (sparse) or cryptic species generally encounter two main issues. The cost of collecting and processing field data and a lack of labelled datasets for the target species. The detection of invasive species incursions and probability of absence testing is especially challenging due to these species having population densities at or close to zero. We present a methodology specifically designed to aid in the analysis of rare acoustic events within long-term field recordings. This approach combines a wavelet-based segmentation method that automatically extracts transient features from within-field recordings. A few-shot active learning recommender system in a human-in-the-loop process prioritises the annotation of low-certainty samples. This process combines the accuracy of human classification and the speed of computational tools to greatly reduce the presence of non-target features in field recordings. We evaluate this approach using an invasive species identification case study. This methodology achieves a test accuracy of 98.4% as well as 81.2% test accuracy using 2-shot, 2-way prototypical learning without fine-tuning, demonstrating high performance at varying data availability contexts. Active learning using low-certainty samples achieves >90% test accuracy using only 20 training samples compared to 80 samples without active learning. This approach allows users to train custom audio classification models for any application with rare features. The model can be easily exported for use in the field making real-time bioacoustic monitoring of less-vocal species a possibility. All code and data are available at https://github.com/Listening-Lab/Annotator.

Details

Language :
English
ISSN :
15749541
Volume :
82
Issue :
102734-
Database :
Directory of Open Access Journals
Journal :
Ecological Informatics
Publication Type :
Academic Journal
Accession number :
edsdoj.3fc5080494f243f99900c181b7811a8b
Document Type :
article
Full Text :
https://doi.org/10.1016/j.ecoinf.2024.102734