Back to Search
Start Over
Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration
- Publication Year :
- 2023
-
Abstract
- Recent efforts have been made on acoustic scene classification in the audio signal processing community. In contrast, few studies have been conducted on acoustic scene clustering, which is a newly emerging problem. Acoustic scene clustering aims at merging the audio recordings of the same class of acoustic scene into a single cluster without using prior information and training classifiers. In this study, we propose a method for acoustic scene clustering that jointly optimizes the procedures of feature learning and clustering iteration. In the proposed method, the learned feature is a deep embedding that is extracted from a deep convolutional neural network (CNN), while the clustering algorithm is the agglomerative hierarchical clustering (AHC). We formulate a unified loss function for integrating and optimizing these two procedures. Various features and methods are compared. The experimental results demonstrate that the proposed method outperforms other unsupervised methods in terms of the normalized mutual information and the clustering accuracy. In addition, the deep embedding outperforms many state-of-the-art features.<br />9 pages, 6 figures, 11 tables. Accepted for publication in IEEE TMM
- Subjects :
- FOS: Computer and information sciences
Sound (cs.SD)
Computer science
Feature extraction
02 engineering and technology
Convolutional neural network
Computer Science - Sound
Audio and Speech Processing (eess.AS)
0202 electrical engineering, electronic engineering, information engineering
Media Technology
FOS: Electrical engineering, electronic engineering, information engineering
Electrical and Electronic Engineering
Cluster analysis
business.industry
Pattern recognition
Computer Science Applications
Statistical classification
ComputingMethodologies_PATTERNRECOGNITION
Feature (computer vision)
Signal Processing
Embedding
020201 artificial intelligence & image processing
Artificial intelligence
business
Feature learning
Electrical Engineering and Systems Science - Audio and Speech Processing
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.doi.dedup.....5d4c1d23b22a4ddac81492b98477e5dc