loading...
Prototype Selection for Finding Efficient Representations of Dissimilarity Data
Quebec City, QC, Canada August 11-August 15
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICPR.2002.104778916th International Conference on Patt ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Elżbieta Pekalska, Delft University of Technology
Robert P. W. Duin, Delft University of Technology

The nearest neighbor (N N) ru le is a simple and intuitive method for solving classification problems. Originally, it uses distances to the complete training set. It performs well, however, it is sensitive to noisy objects, due to its operation on local neighborhoods only . A more global approach is possible by mapping the distance data on to a pseudo-Euclidean space, such that the distances are preserved as well as possible. Then, a classifier built in such a space can out perform the NN rule. However, again all objects from th e training set are used for a projection of new data.

This paper addresses the issue of reducing the training set while possibly preserving the original structure of the mapped data. Some criteria are introduced and evaluated against two problems, polygon recognition and digit recognition. Our experiments show that the representation mismatch criterion is beneficial for the applications considered. Moreover, the linear classifier built in the pseudo-Euclidean space, determined by 20% -25% of the training objects, outperforms the NN rule based on all of them.

Citation:
Elżbieta Pekalska, Robert P. W. Duin, "Prototype Selection for Finding Efficient Representations of Dissimilarity Data," icpr, vol. 3, pp.30037, 16th International Conference on Pattern Recognition (ICPR'02) - Volume 3, 2002
Usage of this product signifies your acceptance of the Terms of Use.