We describe a low-complexity scheme for lossless compression of short text messages. The method uses arithmetic coding and a specific statistical context model for prediction of single symbols. Our particular contribution is a simple yet effective approach for storing highly complex statistics in a succinct yet effective data model that can easily be trained by text data. The proposed model already gives good compression rates with a RAM memory size of 128 kByte, thus making lossless data compression with statistical context modeling readily applicable to small devices like wireless sensors or mobile phones.
Citation:
Stephan Rein, Clemens Guhmann,, Frank H.P. Fitzek, "Low-Complexity Compression of Short Messages," dcc, pp.123-132, Data Compression Conference (DCC'06), 2006