Back to Search Start Over

A Scalogram-Based CNN Approach for Audio Classification in Construction Sites

Authors :
Michele Scarpiniti
Raffaele Parisi
Yong-Cheol Lee
Source :
Applied Sciences, Vol 14, Iss 1, p 90 (2023)
Publication Year :
2023
Publisher :
MDPI AG, 2023.

Abstract

The automatic monitoring of activities in construction sites through the proper use of acoustic signals is a recent field of research that is currently in continuous evolution. In particular, the use of techniques based on Convolutional Neural Networks (CNNs) working on the spectrogram of the signal or its mel-scale variants was demonstrated to be quite successful. Nevertheless, the spectrogram has some limitations, which are due to the intrinsic trade-off between temporal and spectral resolutions. In order to overcome these limitations, in this paper, we propose employing the scalogramas a proper time–frequency representation of the audio signal. The scalogram is defined as the square modulus of the Continuous Wavelet Transform (CWT) and is known as a powerful tool for analyzing real-world signals. Experimental results, obtained on real-world sounds recorded in construction sites, have demonstrated the effectiveness of the proposed approach, which is able to clearly outperform most state-of-the-art solutions.

Details

Language :
English
ISSN :
20763417
Volume :
14
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Applied Sciences
Publication Type :
Academic Journal
Accession number :
edsdoj.9d144b8f02cb4cf5b551152a594415b4
Document Type :
article
Full Text :
https://doi.org/10.3390/app14010090