1. A novel segmentation technique for online handwritten Bangla words
- Author
-
Mridul Mitra, Shibaprasad Sen, Ram Sarkar, Friedhelm Schwenker, Kaushik Roy, and Shubham Chowdhury
- Subjects
Scheme (programming language) ,Computer science ,02 engineering and technology ,01 natural sciences ,Set (abstract data type) ,Artificial Intelligence ,0103 physical sciences ,0202 electrical engineering, electronic engineering, information engineering ,Segmentation ,010306 general physics ,computer.programming_language ,Ground truth ,business.industry ,Text segmentation ,Pattern recognition ,language.human_language ,Bengali ,Signal Processing ,Word recognition ,language ,020201 artificial intelligence & image processing ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,computer ,Software ,Word (computer architecture) - Abstract
In the present work, we have proposed a novel Bangla word segmentation technique that is based on stroke-level busy zone formation procedure. In an unconstrained domain, people often write text where strokes may be poorly aligned (due to multi-directional skewness) and varied combination of strokes with various types of joining between them are possible while forming the words. Hence, a segmentation approach for stroke extraction is pertinent for any stroke-based word recognition system. The presence of a large volume of symbols set (58 basic symbols with more than 280 compound characters) in Bangla script makes the task more challenging. In the current experiment, our stroke-level segmentation approach effectively handles such type of Bangla words. A sub-zoning scheme within busy zone followed by a modified Down->Up->Down (DUD) concept within these sub-zones has been used to find valid segmentation points. This scheme avoids over and under-segmentation issues caused by either inherent writing pattern or due to writing style variations up to certain extent. The proposed segmentation approach has been tested on 6500 online handwritten Bangla word samples with 98.45% correct segmentation accuracy (compared with manually generated ground truth of the same database).
- Published
- 2020
- Full Text
- View/download PDF