loading...
A Probabilistic Approach to Metasearching with Adaptive Probing
Boston, Massachusetts March 30-April 02
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDE.2004.132002620th International Conference on Data ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Zhenyu Liu, University of California, Los Angeles
Chang Luo, University of California, Los Angeles
Junghoo Cho, University of California, Los Angeles
Wesley W. Chu, University of California, Los Angeles
An ever-increasing amount of valuable information is stored in Web databases, "hidden" behind search interfaces. To save the user's effort in manually exploring each database, metasearchers automatically select the most relevant databases to a user's query. In this paper, we focus on one of the technical challenges in metasearching, namely database selection. Past research uses a pre-collected summary of each database to estimate its "relevancy" to the query, and in many cases make incorrect database selection. In this paper, we propose two techniques: probabilistic relevancy modelling and adaptive probing. First, we model the relevancy of each database to a given query as a probabilistic distribution, derived by sampling that database. Using the probabilistic model, the user can explicitly specify a desired level of certainty for database selection. The adaptive probing technique decides which and how many databases to contact in order to satisfy the user's requirement. Our experiments on real Hidden-Web databases indicate that our approach significantly improves the accuracy of database selection at the cost of a small number of database probing.
Citation:
Zhenyu Liu, Chang Luo, Junghoo Cho, Wesley W. Chu, "A Probabilistic Approach to Metasearching with Adaptive Probing," icde, pp.547, 20th International Conference on Data Engineering (ICDE'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.


Suggestions