loading...
An Effective Support Vector Machines (SVMs) Performance Using Hierarchical Clustering
Boca Raton, Florida November 15-November 17
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICTAI.2004.2616th IEEE International Conference on ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Mamoun Awad, University of Texas at Dallas
Latifur Khan, University of Texas at Dallas
Farokh Bastani, University of Texas at Dallas
I-Ling Yen, University of Texas at Dallas
The training time for SVMs to compute the maximal marginal hyper-plane is at least O(N²) with the data set size N, which makes it non-favorable for large data sets. This paper presents a study for enhancing the training time of SVMs, specifically when dealing with large data sets, using hierarchical clustering analysis. We use the Dynamically Growing Self-Organizing Tree (DGSOT) Algorithm for clustering because it has proved to overcome the drawbacks of traditional hierarchical clustering algorithms. Clustering analysis helps find the boundary points, which are the most qualified data points to train SVMs, between two classes. We present a new approach of combination of SVMs and DGSOT, which starts with an initial training set and expands it gradually using the clustering structure produced by the DGSOT algorithm. We compare our approach with the Rocchio Bundling technique in terms of accuracy loss and training time gain using two benchmark real data sets.
Citation:
Mamoun Awad, Latifur Khan, Farokh Bastani, I-Ling Yen, "An Effective Support Vector Machines (SVMs) Performance Using Hierarchical Clustering," ictai, pp.663-667, 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.