loading...
Improving the Results and Performance of Clustering Bit-encoded XML Documents
Hong Kong, China December 18-December 22
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDMW.2006.97Sixth IEEE International Conference o ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Michal Kozielski, Silesian University of Technology, Gliwice, Poland
Clustering XML documents according to their structure is one of the techniques that may improve the effectiveness of XML documents storage and retrieval. One of existing approaches to this problem is to encode XML document structure as a string of bits and cluster such feature vectors. High dimensionality and sparseness of the feature vectors are the weaknesses of this method. The paper presents four methods reducing the dimensionality of the bit feature vectors. Two of these methods are novel. They are dedicated to XML documents and should be applied during the encoding process. The results showed good efficiency of these inner-encoding methods and their ability of improving clustering results in some cases. The methods presented in the paper are tested on two datasets of XML documents having different characteristics.
Citation:
Michal Kozielski, "Improving the Results and Performance of Clustering Bit-encoded XML Documents," icdmw, pp.60-64, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06), 2006
Usage of this product signifies your acceptance of the Terms of Use.