Longzhuang Li, Dept. of Comput. Eng. & Comput. Sci., Missouri Univ., Columbia, MO, USA
Yi Shang, Dept. of Comput. Eng. & Comput. Sci., Missouri Univ., Columbia, MO, USA
Abstract: We present a new statistical method for evaluating search engines' precision performance based on sample queries. The method consists of relevance evaluation and statistical comparison. In relevance evaluation, we present two scoring algorithms: one is a term-based algorithm based on the vector space model, and the other is a new three-level algorithm modeled after manual methods commonly used in information retrieval studies. In statistical comparison, we apply a statistical metric probability of win, in ranking the search engines. Based on a set of sample queries, our method evaluates the relevance of the pages returned by the search engines and compares them statistically In the experiment, our method was applied to three search engines, AltaVista, Google, and InfoSeek, using two query sets derived from the domain of parallel and distributed processing. Our results show that the three-level scoring algorithm with a typical set of parameters obtained results consistent with those obtained using the manual method, whereas the term-based algorithm did not.
Index Terms:
search engines; information retrieval system evaluation; relevance feedback; statistical method; search engines; precision performance evaluation; sample queries; relevance evaluation; statistical comparison; scoring algorithms; term-based algorithm; vector space model; three-level algorithm; information retrieval; statistical metric win probability; AltaVista; Google; InfoSeek; distributed processing; parallel processing
Citation:
Longzhuang Li, Yi Shang, "A new statistical method for performance evaluation of search engines," ictai, pp.0208, 12th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'00), 2000