loading...
Automatic Web Page Categorization using Principal Component Analysis
Big Island, Hawaii January 03-January 06
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/HICSS.2007.9840th Annual Hawaii International Conf ...
 This Article 
 
PURCHASE ARTICLE: $0
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Richong Zhang, Dalhousie University
Michael Shepherd, Dalhousie University
Jack Duffy, Dalhousie University
Carolyn Watters, Dalhousie University
Today?s search engines retrieve tens of thousands of web pages in response to fairly simple query articulations. These pages are retrieved on the basis of the query terms occurring in the web pages and the popularity of the web pages as per the link structure of the web. However, these search engines do not take into account the broader information need of the user, such as the task in which the user is involved. This research investigates the automatic categorization of web pages using Principal Component Analysis. The research focuses on user tasks that involve searching for web pages containing health information, education information or shopping information. Initial results are encouraging with recall and precision values slightly in excess of 80%.
Citation:
Richong Zhang, Michael Shepherd, Jack Duffy, Carolyn Watters, "Automatic Web Page Categorization using Principal Component Analysis," hicss, pp.73a, 40th Annual Hawaii International Conference on System Sciences (HICSS'07), 2007
Usage of this product signifies your acceptance of the Terms of Use.