S. Mandal, CST Dept., B. E. & Science University, India
A. K. Das, CST Dept., B. E. & Science University, India
Identification and segmentation of the table of contents (TOC) and index pages for the development a of digital library is an obvious task. A digital document library is created to provide a non-labour intensive, cheap and flexible way of storage, representation and management of paper documents in electronic form to facilitate indexing, viewing, printing and extracting the intended portions. Using document image analysis techniques information from the TOC and index pages may be extracted to use in a document database for effective retrieval of the required pieces of information. In this paper, we present fully auotmatic identification and segmentation of TOC and index pages from scanned documents.
Citation:
S. Mandal, S. P. Chowdhury, A. K. Das, Bhabatosh Chanda, "Detection and Segmentation of Table of Contents and Index Pages from Document Images," dial, pp.70-81, Second International Conference on Document Image Analysis for Libraries (DIAL'06), 2006