loading...
HITS is Principal Components Analysis
Compi?gne University of Technology, France September 19-September 22
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/WI.2005.712005 IEEE/WIC/ACM International Confe ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Marco Saerens, Université Catholique de Louvain
Francois Fouss, Université Catholique de Louvain
In this work, we show that Kleinberg?s hubs and authorities model (HITS) is simply Principal Components Analysis (PCA; maybe the most widely used multivariate statistical analysis method), albeit without centering, applied to the adjacency matrix of the graph of web pages. We further show that a variant of HITS, SALSA, is closely related to correspondence analysis, another standard multivariate statistical analysis method. In addition to provide a clear statistical interpretation for HITS, this result suggests to rely on existing work already published in the multivariate statistical analysis litterature (extensions of PCA or correspondence analysis) in order to analyse or design new web pages scoring procedures.
Citation:
Marco Saerens, Francois Fouss, "HITS is Principal Components Analysis," wi, pp.782-785, 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.