Back to Search Start Over

Layout based document image retrieval by means of XY tree reduction

Authors :
Emanuele Marino
Giovanni Soda
Simone Marinai
Source :
ICDAR
Publication Year :
2005
Publisher :
IEEE, 2005.

Abstract

We analyze a system for the retrieval of document images on the basis of layout similarity. Layout objects are extracted and represented with the XY tree. Page similarity is computed with a tree-edit distance algorithm. The peculiarity of the approach is the use of tree grammars to model the variations in the tree, which are due to segmentation algorithms or to structural differences between documents with similar layout. A few class-independent grammatical rules are used to modify each tree and obtain a reduced tree that is supposed to preserve the most relevant features of the page.

Details

Database :
OpenAIRE
Journal :
Eighth International Conference on Document Analysis and Recognition (ICDAR'05)
Accession number :
edsair.doi...........b2afda1ecba242e2bc8a0bc8d556df7b
Full Text :
https://doi.org/10.1109/icdar.2005.150