loading...
Random-Walk Term Weighting for Improved Text Classification
Irvine, California September 17-September 19
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICSC.2007.56International Conference on Semantic ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Samer Hassan, University of North Texas, USA
Rada Mihalcea, University of North Texas, USA
Carmen Banea, University of North Texas, USA
This paper describes a new approach for estimating term weights in a document, and shows how the new weighting scheme can be used to improve the accuracy of a text classifier. The method uses term co-occurrence as a measure of dependency between word features. A random-walk model is applied on a graph encoding words and co-occurrence dependencies, resulting in scores that represent a quantification of how a particular word feature contributes to a given context. Experiments performed on three standard classification datasets show that the new random-walk based approach outperforms the traditional term frequency approach of feature weighting.
Citation:
Samer Hassan, Rada Mihalcea, Carmen Banea, "Random-Walk Term Weighting for Improved Text Classification," icsc, pp.242-249, International Conference on Semantic Computing (ICSC 2007), 2007
Usage of this product signifies your acceptance of the Terms of Use.