Back to Search
Start Over
Data Augmentation for End-to-End Optical Music Recognition
- Source :
- Document Analysis and Recognition – ICDAR 2021 Workshops ISBN: 9783030861971, ICDAR Workshops (1)
- Publication Year :
- 2021
- Publisher :
- Springer International Publishing, 2021.
-
Abstract
- Optical Music Recognition (OMR) is the research area that studies how to transcribe the content from music documents into a structured digital format. Within this field, techniques based on Deep Learning represent the current state of the art. Nevertheless, their use is constrained by the large amount of labeled data required, which constitutes a relevant issue when dealing with historical manuscripts. This drawback can be palliated by means of Data Augmentation (DA), which encompasses a series of strategies to increase data without the need of manual labeling new images. This work studies the applicability of specific DA techniques in the context of end-to-end staff-level OMR methods. More precisely, considering two corpora of historical music manuscripts, we applied different types of distortions to the music scores and assessed their contribution in an end-to-end system. Our results show that some transformations are much more appropriate than others, leading up to a \(34.5\%\) of relative improvement with respect to scenario without DA.
- Subjects :
- Optical music recognition
business.industry
Computer science
Deep learning
Context (language use)
computer.software_genre
Field (computer science)
End-to-end principle
Content (measure theory)
Artificial intelligence
State (computer science)
business
computer
Natural language processing
Drawback
Subjects
Details
- ISBN :
- 978-3-030-86197-1
- ISBNs :
- 9783030861971
- Database :
- OpenAIRE
- Journal :
- Document Analysis and Recognition – ICDAR 2021 Workshops ISBN: 9783030861971, ICDAR Workshops (1)
- Accession number :
- edsair.doi...........9d33f899ccee613370df4faf25b7fb0f
- Full Text :
- https://doi.org/10.1007/978-3-030-86198-8_5