Back to Search Start Over

HaFT: A handwritten Farsi text database

Authors :
Golnaz Ghiasi
Ali Reza Ghanbarian
Reza Safabaksh
Source :
2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP).
Publication Year :
2013
Publisher :
IEEE, 2013.

Abstract

Standard databases provide for evaluation and comparison of various pattern recognition techniques by different researchers; thus they are essential for the advance of research. There are different handwritten databases in various languages, but there is not a large standard database of handwritten text for the evaluation of different algorithms for writer identification and verification in Farsi. This paper introduces a large handwritten Farsi text database called HaFT. The database contains 1800 gray scale images of unconstrained text written by 600 writers. Each participant gave three separate eight-line samples of his handwriting, each of which was written at a different time on a separate sheet. HaFT is presented in several versions each including different lengths of text and using identical or different writing instruments. A new measure, called CVM, is defined which effectively reflects the size of handwriting and thus the content volume of a given text image. This database is designed for training and testing Farsi writer identification and verification using handwritten text. In addition, the database can also be used in training and testing handwritten Farsi text segmentation and recognition algorithms. HaFT is available for research use.

Details

Database :
OpenAIRE
Journal :
2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP)
Accession number :
edsair.doi...........e6bcb9b4da7dcc603b643b44c4999fad
Full Text :
https://doi.org/10.1109/iranianmvip.2013.6779956