Y. Xia, Institute of Automation, Chinese Academy of Sciences
B.-H. Xiao, Institute of Automation, Chinese Academy of Sciences
C.-H. Wang, Institute of Automation, Chinese Academy of Sciences
R.-W. Dai, Institute of Automation, Chinese Academy of Sciences
This paper presents a general frame to integrate segmentation and recognition and gives a novel method to identify lingual attribute of mixed Chinese/English characters. The outstanding performance of this method is as follows. First, a text- line rather than a character segment is regarded as a process unit. Second, multi-feature is adopted based on multi-phase segmentation. Third, two types of feedbacks, including from character recognition and from character feature statistic within a text-line, are adopted throughout the whole segmentation and recognition. Fourth, it is adaptive to the quality and genre of documents.
Citation:
Y. Xia, B.-H. Xiao, C.-H. Wang, R.-W. Dai, "Integrated Segmentation and Recognition of Mixed Chinese/English Document," icdar, vol. 2, pp.704-708, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 2, 2007