A. Das, University, Shibpur, India
Text, graphics and half-tones are the major constituents of any document page. While half-tone can be characterised by its inherent intensity variation, text and graphics share common characteristics except difference in spatial distri- bution. The success of document image analysis systems depends on the proper segmentation of text and graphics as text is further subdivided into other classes such as heading, table and math-zones. Segmentation of graphics is essential for better OCR performance and vectorization in computer vision applications. Graphics segmentation from text is par- ticularly difficult in the context of graphics made of small components (dashed or dotted lines etc.) which have many features similar to texts. Here we propose a robust tech- nique for segmenting all sorts of graphics and texts in any orientation from document pages.
Citation:
S. Chowdhury, S. Mandal, A. Das, B. Chanda, "Segmentation of Text and Graphics from Document Images," icdar, vol. 2, pp.619-623, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 2, 2007