loading...
Finding Conceptual Document Clusters with Improved Top-N Formal Concept Search
Hong Kong, China December 18-December 22
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/WI.2006.812006 IEEE/WIC/ACM International Confe ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Yoshiaki Okubo, Hokkaido University, Japan
Makoto Haraguchi, Hokkaido University, Japan
In this paper, we discuss a method for conceptual clustering of documents. Our cluster is defined with the notion of Formal Concept Analysis which can provide a conceptual meaning for each document cluster. Our clustering is formalized as a Top-N \delta-valid formal concept problem. We improve our previous clique search-based algorithm for the problem so that it can be applied to larger scale datasets. For more efficient computation, we present some pruning rules based on theoretical properties of formal concepts. A depth-first branch-and-bound algorithm with the prunings is designed. Our experimental results show valuable clusters can be extracted from a collection of web documents. Moreover, the algorithm outperforms some fast algorithms for mining closed itemsets equivalent to formal concepts.
Citation:
Yoshiaki Okubo, Makoto Haraguchi, "Finding Conceptual Document Clusters with Improved Top-N Formal Concept Search," wi, pp.347-351, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.