loading...
Motifs in Ziv-Lempel-Welch Clef
Snowbird, Utah March 23-March 25
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/DCC.2004.1281452Data Compression Conference (DCC '04)
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Alberto Apostolico, Univ. of Padova & Purdue Univ.
Matteo Comin, Univ. of Padova
Laxmi Parida, IBM T. J. Watson Center
We present variants of classical data compression paradigms by Ziv, Lempel, and Welch in which the phrases used in compression are selected among suitably chosen motifs, defined here as strings of intermittently solid and wild characters that recur more or less frequently in the source textstring. This notion emerged primarily in the analysis of biological sequences and molecules. Whereas the number of motifs in a sequence or family may be exponential in the size of the input, a linear-sized basis of irredundant motifs may be defined such that any other motif can be obtained by the union of a suitable subset from the basis. Previous study has exposed the advantages of using irredundant motifs in lossy as well as lossless off-line compression. In the present paper, we examine adaptations and extensions of classical incremental ZL and ZLW paradigms. First, hybrid schemata are proposed along these lines, in which motifs may be discovered and selected off-line, while the parse and encoding is still conducted on-line. The performances thus obtained improve on the one hand over previous off-line implementations of motif-based compression, and on the other, over the traditionally best implementations of ZLW. On the basis of this, both lossy and lossless motif-based schemata are introduced and tested that follow more closely the ZL and ZLW paradigms.
Citation:
Alberto Apostolico, Matteo Comin, Laxmi Parida, "Motifs in Ziv-Lempel-Welch Clef," dcc, pp.72, Data Compression Conference (DCC '04), 2004
Usage of this product signifies your acceptance of the Terms of Use.