loading...
Estimation of False Negatives in Classification
Brighton, United Kingdom November 01-November 04
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDM.2004.10048Fourth IEEE International Conference ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Sandeep Mane, University of Minnesota, Minneapolis
Jaideep Srivastava, University of Minnesota, Minneapolis
San-Yih Hwang, National Sun-Yat-Sen University, Kaohsiung, Taiwan
Jamshid Vayghan, IBM Corporation, Minneapolis
In many classification problems such as spam detection and network intrusion, a large number of unlabeled test instances are predicted negative by the classifier. However, the high costs as well as time constraints on an expert's time prevent further analysis of the "predicted false" class instances in order to segregate the false negatives from the true negatives. A systematic method is thus required to obtain an estimate of the number of false negatives. A capture-recapture based method can be used to obtain an ML-estimate of false negatives when two or more independent classifiers are available. In the case for which independence does not hold, we can apply log-linear models to obtain an estimate of false negatives. However, as shown in this paper, lesser the dependencies among the classifiers, better is the estimate obtained for false negatives. Thus, ideally independent classifiers should be used to estimate the false negatives in an unlabeled dataset. Experimental results on the spam dataset from the UCI Machine Learning Repository are presented.
Citation:
Sandeep Mane, Jaideep Srivastava, San-Yih Hwang, Jamshid Vayghan, "Estimation of False Negatives in Classification," icdm, pp.475-478, Fourth IEEE International Conference on Data Mining (ICDM'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.


Suggestions