Many classification methods have been proposed to find patterns in text documents. However, according to Occam?s razor principle, "the explanation of any phenomenon should make as few assumptions as possible", short patterns usually have more explainable and meaningful for classifying text documents. In this paper, we propose a depth-first pattern generation algorithm, which can find out short patterns from text document more effectively, comparing with breadth-first algorithm.
Index Terms:
Document Categorization, rule generation, breadth-first, depth-first.
Citation:
Jiyuan An, Yi-Ping Phoebe Chen, "Finding Short Patterns to Classify Text Documents," wi, pp.293-296, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI'06), 2006