loading...
Investigating the Effect of Multiple Communities on Kernel-Based Citation Analysis
Atlanta, Georgia April 03-April 07
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDEW.2006.7022nd International Conference on Data ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Takahiko Ito, Nara Institute of Science and Technology, Japan
Masashi Shimbo, Nara Institute of Science and Technology, Japan
Daich Mochihashi, ATR Spoken Language Communication Research Laboratories, Japan
Yuji Matsumoto, Nara Institute of Science and Technology, Japan
In this paper, we discuss issues raised by applying Kandola et al.'s Neumann kernels to large citation graphs that have multiple communities. Neumann kernels can identify not only documents related a given document but also the most important documents in a citation graph. However, when Neumann kernels are biased towards importance, topranked documents are uniformly documents in the dominant community of the citation graph irrespective of the communities where the target document is cited.

To solve this problem, we model a generation process of citations by probabilistic Latent Semantic Indexing, and then construct a weighted graph (hidden topic graph) for each community (topic). Applying Neumann kernels to each hidden topic graph, we can rank documents on the basis of the communities in which they appear.

Citation:
Takahiko Ito, Masashi Shimbo, Daich Mochihashi, Yuji Matsumoto, "Investigating the Effect of Multiple Communities on Kernel-Based Citation Analysis," icdew, pp.x113, 22nd International Conference on Data Engineering Workshops (ICDEW'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.


Suggestions