Back to Search Start Over

MULTI-LABEL BIRD SPECIES CLASSIFICATION USING SEQUENTIAL AGGREGATION STRATEGY FROM AUDIO RECORDINGS.

Authors :
ABDUL KAREEM, Noumida
RAJAN, Rajeev
Source :
Computing & Informatics; 2023, Vol. 42 Issue 5, p1255-1280, 26p
Publication Year :
2023

Abstract

Birds are excellent bioindicators, playing a vital role in maintaining the delicate balance of ecosystems. Identifying species from bird vocalization is arduous but has high research gain. The paper focuses on the detection of multiple bird vocalizations from recordings. The proposed work uses a deep convolutional neural network (DCNN) and a recurrent neural network (RNN) architecture to learn the bird's vocalization from mel-spectrogram and mel-frequency cepstral coefficient (MFCC), respectively. We adopted a sequential aggregation strategy to make a decision on an audio file. We normalized the aggregated sigmoid probabilities and considered the nodes with the highest scores to be the target species. We evaluated the proposed methods on the Xeno-canto bird sound database, which comprises ten species. We compared the performance of our approach to that of transfer learning and Vanilla-DNN methods. Notably, the proposed DCNN and VGG-16 models achieved average F1 metrics of 0.75 and 0.65, respectively, outperforming the acoustic cue-based Vanilla-DNN approach. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13359150
Volume :
42
Issue :
5
Database :
Supplemental Index
Journal :
Computing & Informatics
Publication Type :
Academic Journal
Accession number :
176169550
Full Text :
https://doi.org/10.31577/cai_2023_5_1255