The segmentation of handwritten Chinese text into Chinese characters is an important preprocessing step to the offline Chinese character recognition. It is also a very difficult task due to so many Chinese characters and their handwritten structures can be very complex. Many researchers have developed various algorithms during the past decade [1-6]. In this paper, we compare two of the existing algorithms. The first one is spatial shape-based algorithm proposed in [5], which segments the character strings into radicals, not dealing with stroke identification. The second algorithm is stroke-based [2-4], which traces each stroke and draws the stroke-bounding box, then merges the boxes by a set of rules. Based on the algorithms presented in [2-5], we wrote C++ programs for time complexity, accuracy performance comparisons using different handwritten Chinese character texts. Our experimental result shows that the spatial shape-based algorithm [5] is faster and more accurate.
Index Terms:
Segmentation, character recognition, computer vision, image processing
Citation:
Jian Yang, Han Zhang, Mark Dencler, Chao Lu, "Comparison of Shape-Based and Stroke-Based Methods for Segmenting Handwritten Chinese Characters," icis, pp.114-119, Fourth Annual ACIS International Conference on Computer and Information Science (ICIS'05), 2005