loading...
Novelty-based Incremental Document Clustering for On-line Documents
Atlanta, Georgia April 03-April 07
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDEW.2006.10022nd International Conference on Data ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sophoin Khy, University of Tsukuba, Japan
Yoshiharu Ishikawa, University of Tsukuba, Japan
Hiroyuki Kitagawa, University of Tsukuba, Japan
Document clustering has been used as a core technique in managing vast amount of data and providing needed information. In on-line environments, generally new information gains more interests than old one. Traditional clustering focuses on grouping similar documents into clusters by treating each document with equal weight. We proposed a novelty-based incremental clustering method for on-line documents that has biases on recent documents. In the clustering method, the notion of ?novelty? is incorporated into a similarity function and a clustering method, a variant of the K-means method, is proposed. We examine the efficiency and behaviors of the method by experiments.
Citation:
Sophoin Khy, Yoshiharu Ishikawa, Hiroyuki Kitagawa, "Novelty-based Incremental Document Clustering for On-line Documents," icdew, pp.40, 22nd International Conference on Data Engineering Workshops (ICDEW'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.