loading...
Modeling Delta Encoding of Compressed Files
Snowbird, Utah March 28-March 30
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/DCC.2006.47Data Compression Conference (DCC'06)
 This Article 
 
PURCHASE ARTICLE: $0
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
S.T. Klein, Bar Ilan University, Israel
T.C. Serebro, Bar Ilan University, Israel
D. Shapira, Ashkelon Acad. Colleges, Israel
We introduce a new model of differencing encoding, that of Compressed Differencing. Given two files for which at least one is in compressed form, the goal is to create a third file which is the delta file of the two original files, in time proportional to the size of the input, that is, without decompressing the compressed files. If both files, S and T, are compressed using the same static Huffman code, generating the differencing file can be done in the traditional way (using a sliding window) directly on the compressed files. The delta encoding is at least as efficient as the delta encoding generated on the original files S and T. Common substrings of S and T are still common substrings of the compressed versions of S and T. However the reverse is not necessarily true, since the common substrings can extend beyond codeword boudaries.
Citation:
S.T. Klein, T.C. Serebro, D. Shapira, "Modeling Delta Encoding of Compressed Files," dcc, pp.457, Data Compression Conference (DCC'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.