Back to Search Start Over

Data Augmentation for End-to-End Optical Music Recognition

Authors :
Jose J. Valero-Mas
Juan C. López-Gutiérrez
Jorge Calvo-Zaragoza
Francisco J. Castellanos
Source :
Document Analysis and Recognition – ICDAR 2021 Workshops ISBN: 9783030861971, ICDAR Workshops (1)
Publication Year :
2021
Publisher :
Springer International Publishing, 2021.

Abstract

Optical Music Recognition (OMR) is the research area that studies how to transcribe the content from music documents into a structured digital format. Within this field, techniques based on Deep Learning represent the current state of the art. Nevertheless, their use is constrained by the large amount of labeled data required, which constitutes a relevant issue when dealing with historical manuscripts. This drawback can be palliated by means of Data Augmentation (DA), which encompasses a series of strategies to increase data without the need of manual labeling new images. This work studies the applicability of specific DA techniques in the context of end-to-end staff-level OMR methods. More precisely, considering two corpora of historical music manuscripts, we applied different types of distortions to the music scores and assessed their contribution in an end-to-end system. Our results show that some transformations are much more appropriate than others, leading up to a \(34.5\%\) of relative improvement with respect to scenario without DA.

Details

ISBN :
978-3-030-86197-1
ISBNs :
9783030861971
Database :
OpenAIRE
Journal :
Document Analysis and Recognition – ICDAR 2021 Workshops ISBN: 9783030861971, ICDAR Workshops (1)
Accession number :
edsair.doi...........9d33f899ccee613370df4faf25b7fb0f
Full Text :
https://doi.org/10.1007/978-3-030-86198-8_5