loading...
Using Genetic Algorithm in Building Domain-Specific Collections: An Experiment in the Nanotechnology Domain
Big Island, Hawaii January 03-January 06
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/HICSS.2005.659Proceedings of the 38th Annual Hawaii ...
 This Article 
 
PURCHASE ARTICLE: $0
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Jialun Qin, University of Arizona
Hsinchun Chen, University of Arizona
As the key technique to build domain-specific search engines, focused crawling has drawn a lot of attention from researchers in the past decade. However, as Web structure analysis techniques advance, several problems in traditional focused crawler design were revealed and they could result in domain-specific collections with low quality. In this work, we studied the problems of focused crawling that are caused by using local search algorithms. We also proposed to use a global search algorithm, the Genetic Algorithm, in focused crawling to address the problems. We conducted evaluation experiments to examine the effectiveness of our approach. The results showed that our approach could build domain-specific collections with higher quality than traditional focused crawling techniques. Furthermore, we used the concept of Web communities to evaluate how comprehensively the focused crawlers could traverse the Web search space, which could be a good complement to the traditional focused crawler evaluation methods.
Citation:
Jialun Qin, Hsinchun Chen, "Using Genetic Algorithm in Building Domain-Specific Collections: An Experiment in the Nanotechnology Domain," hicss, vol. 4, pp.102b, Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS'05) - Track 4, 2005
Usage of this product signifies your acceptance of the Terms of Use.