Back to Search Start Over

Representative Image Selection for Data Efficient Word Spotting

Authors :
Westphal, Florian
Grahn, Håkan
Lavesson, Niklas
Westphal, Florian
Grahn, Håkan
Lavesson, Niklas
Publication Year :
2020

Abstract

This paper compares three different word image representations as base for label free sample selection for word spotting in historical handwritten documents. These representations are a temporal pyramid representation based on pixel counts, a graph based representation, and a pyramidal histogram of characters (PHOC) representation predicted by a PHOCNet trained on synthetic data. We show that the PHOC representation can help to reduce the amount of required training samples by up to 69% depending on the dataset, if it is learned iteratively in an active learning like fashion. While this works for larger datasets containing about 1 700 images, for smaller datasets with 100 images, we find that the temporal pyramid and the graph representation perform better.<br />open access

Details

Database :
OAIster
Notes :
application/pdf, English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1178731414
Document Type :
Electronic Resource
Full Text :
https://doi.org/10.1007.978-3-030-57058-3_27