loading...
Adaptive Spam Filtering Using Dynamic Feature Space
Hong Kong, China November 14-November 16
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICTAI.2005.2817th IEEE International Conference on ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Yan Zhou, University of South Alabama
Madhuri S. Mulekar, University of South Alabama
Praveen Nerellapalli, University of South Alabama
Unsolicited bulk e-mail, also known as spam, has been an increasing problem for the e-mail society. This paper presents a new spam filtering strategy that 1) uses a practical entropy coding technique, Huffman coding, to dynamically encode the feature space of e-mail collections over time and, 2) applies an online algorithm to adaptively enhance the learned spam concept as new e-mail data becomes available. The contributions of this work include a highly efficient spam filtering algorithm in which the input space is radically reduced to a single-dimension input vector, and an adaptive learning technique that is robust to vocabulary change, concept drifting and skewed data distribution. We compare our technique to several existing off-line learning techniques including Support Vector Machine, Na??ve Bayes, -Nearest Neighbor, C4.5 decision tree, RBFNetwork, Boosted decision tree and Stacking, and demonstrate the effectiveness of our technique by presenting the experimental results on the e-mail data that is publicly available.
Citation:
Yan Zhou, Madhuri S. Mulekar, Praveen Nerellapalli, "Adaptive Spam Filtering Using Dynamic Feature Space," ictai, pp.302-309, 17th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.