This paper proposes a novel Chinese customer address clustering algorithm by considering both the customers? postal code and the text clustering technique. Based on thousands of student hometown addresses in a university, an experiment is done and analyzed. The results show that the algorithm presents in this paper has the advantages of high effectiveness of computation and good clustering ability when comparing with the Standard K-means algorithm.
Citation:
Jian-Xun Liu, Yi-Ping Wen, Jie Liu, "NCAC: A Novel Chinese Address Clustering Algorithm," skg, pp.56, Second International Conference on Semantics, Knowledge, and Grid (SKG'06), 2006