Back to Search Start Over

Visual exploration of latent space for traditional Chinese music

Authors :
Han-Wei Shen
Jingyi Shen
Runqi Wang
Source :
Visual Informatics, Vol 4, Iss 2, Pp 99-108 (2020)
Publication Year :
2020
Publisher :
Elsevier, 2020.

Abstract

Generating compact and effective numerical representations of data is a fundamental step for many machine learning tasks. Traditionally, handcrafted features are used but as deep learning starts to show its potential, using deep learning models to extract compact representations becomes a new trend. Among them, adopting vectors from the model’s latent space is the most popular. There are several studies focused on visual analysis of latent space in NLP and computer vision. However, relatively little work has been done for music information retrieval (MIR) especially incorporating visualization. To bridge this gap, we propose a visual analysis system utilizing Autoencoders to facilitate analysis and exploration of traditional Chinese music. Due to the lack of proper traditional Chinese music data, we construct a labeled dataset from a collection of pre-recorded audios and then convert them into spectrograms. Our system takes music features learned from two deep learning models (a fully-connected Autoencoder and a Long Short-Term Memory (LSTM) Autoencoder) as input. Through interactive selection, similarity calculation, clustering and listening, we show that the latent representations of the encoded data allow our system to identify essential music elements, which lay the foundation for further analysis and retrieval of Chinese music in the future.

Details

Language :
English
Volume :
4
Issue :
2
Database :
OpenAIRE
Journal :
Visual Informatics
Accession number :
edsair.doi.dedup.....f24ba6fe41865d78c5a51aecb5cc5e62