loading...
Improving Index Compression Using Cluster Information
Hong Kong, China December 18-December 22
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/WI.2006.962006 IEEE/WIC/ACM International Confe ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Jinlin Chen, CUNY, USA
Ping Zhong, CUNY, USA
Terry Cook, CUNY, USA
The clustering property of document collections in Web search engines provides valuable information for improving index compression. By clustering d-gaps of an inverted list and then encoding clustered and nonclustered d-gaps using different codes, we can tailor to the specific properties of different d-gaps and achieve better compression ratio. Further improvement on index compression can be achieved by adaptively adjusting the cluster threshold for inverted lists. Based on these ideas, in this paper we propose adaptive cluster based mixed codes for inverted file index compression. Experiment results show that codes using adaptive cluster based mixed approach have better performance in terms of compression ratio and lower complexity comparing to interpolative code which is considered as one of the most efficient bitwise codes at present.
Citation:
Jinlin Chen, Ping Zhong, Terry Cook, "Improving Index Compression Using Cluster Information," wi, pp.188-194, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.