Utpal Garain, Indian Statistical Institute, 203, B.T. Road, Kolkata 700108, India
M. P. Chakraborty, Indian Statistical Institute, 203, B.T. Road, Kolkata 700108, India
Bhabatosh Chanda, Indian Statistical Institute, 203, B.T. Road, Kolkata 700108, India
This paper presents a method for lossless compression of Indian language textual images. The study is an extension of the previously developed pattern matching and substitution (PM&S)-based method for lossy compression of similar images. Here an efficient method for residue coding is proposed and its performance is compared with CCITT Gr-IV and JBIG. A set of 20 text images for two most popular Indic scripts, namely Devanagari (Hindi) and Bengali, is used in the experiment. It is noted that the best results is achieved by PM&S-based approach followed by LZW-based residue coding. This combined scheme gives lossless compression ratio1 of about 37.9.
Citation:
Utpal Garain, M. P. Chakraborty, Bhabatosh Chanda, "Lossless Compression of Textual Images: A Study on Indic Script Documents," icpr, vol. 3, pp.806-809, 18th International Conference on Pattern Recognition (ICPR'06) Volume 3, 2006