loading...
Efficient Query Evaluation on Large Textual Collections in a Peer-to-Peer Environment
Konstanz, Germany August 31-September 02
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/P2P.2005.7Fifth IEEE International Conference o ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Jiangong Zhang, Polytechnic University
Torsten Suel, Polytechnic University
We study the problem of evaluating ranked (top-k) queries on textual collections ranging from multiple giga-bytes to terabytes in size. We focus on the case of a global index organization in a highly distributed environment, and consider a class of ranking functions that includes common variants of the Cosine and Okapi measures. The main bottleneck in such a scenario is the amount of communication required during query evaluation. We propose several efficient query evaluation schemes and evaluate their performance. Our results on real search engine query traces and over 120 million web pages show that after careful optimization such queries can be evaluated at a reasonable cost, while challenges remain for even larger collections and more general classes of ranking functions.
Citation:
Jiangong Zhang, Torsten Suel, "Efficient Query Evaluation on Large Textual Collections in a Peer-to-Peer Environment," p2p, pp.225-233, Fifth IEEE International Conference on Peer-to-Peer Computing (P2P'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.