Back to Search Start Over

A Framework for Content-Based Search in Large Music Collections

Authors :
Tiange Zhu
Raphaël Fournier-S’niehotta
Philippe Rigaux
Nicolas Travers
Source :
Big Data and Cognitive Computing, Vol 6, Iss 1, p 23 (2022)
Publication Year :
2022
Publisher :
MDPI AG, 2022.

Abstract

We address the problem of scalable content-based search in large collections of music documents. Music content is highly complex and versatile and presents multiple facets that can be considered independently or in combination. Moreover, music documents can be digitally encoded in many ways. We propose a general framework for building a scalable search engine, based on (i) a music description language that represents music content independently from a specific encoding, (ii) an extendible list of feature-extraction functions, and (iii) indexing, searching, and ranking procedures designed to be integrated into the standard architecture of a text-oriented search engine. As a proof of concept, we also detail an actual implementation of the framework for searching in large collections of XML-encoded music scores, based on the popular ElasticSearch system. It is released as open-source in GitHub, and available as a ready-to-use Docker image for communities that manage large collections of digitized music documents.

Details

Language :
English
ISSN :
25042289
Volume :
6
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Big Data and Cognitive Computing
Publication Type :
Academic Journal
Accession number :
edsdoj.47f9ee4b36794a4090fda661b41fdaa1
Document Type :
article
Full Text :
https://doi.org/10.3390/bdcc6010023