This paper describes a new system for extracting and classifying bibliography regions from the color image of a book cover. The system consists of three major components: preprocessing, color space segmentation, and text regions extraction and classification.Preprocessing extracts the edge lines of the book, gets the basic information, and geometrically corrects and segments the input image, into the parts of front cover, spine, and back cover.The same as all color image processing researches, the segmentation of color space is an essential and important step here. Instead of RGB color space, HSI color space is used in this system. The color space is segmented into achromatic and chromatic regions first; and both the achromatic and chromatic regions are segmented further to complete color space segmentation.Then text regions extraction and classification follows. After detecting fundamental features (stroke width and local label width) text regions are determined. By comparing the text regions on front cover with those on spine, all extracted text regions are classified into suitable bibliography categories: author, title, publisher, and other information, without applying OCR.
Citation:
Hua Yang, Norikazu Onda, Masaaki Kashimura, Shinji Ozawa, "Extraction of Bibliography Information Based on Image of Book Cover," iciap, pp.921, 10th International Conference on Image Analysis and Processing (ICIAP'99), 1999