loading...
Performance Improvements to the BBN Byblos OCR System
Seoul, Korea August 31-September 01
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDAR.2005.189Eighth International Conference on Do ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Michael Decerbo, BBN Technologies, MA, USA
Premkumar Natarajan, BBN Technologies, MA, USA
Rohit Prasad, BBN Technologies, MA, USA
Ehry MacRostie, BBN Technologies, MA, USA
In this paper, we describe four recent enhancements to the BBN Byblos OCR System, a multilingual HMMbased character recognition system which has been demonstrated on a variety of languages, including English, Arabic, Chinese, and Japanese. These enhancements are implemented as optional extensions to the system and provide improved performance for certain scripts or domains. Projection-based reestimation of line boundaries reduces instability in the presence of some types of noise. An alternate modeling strategy used in the first of two recognition search passes substantially increases speed on languages with a large number of characters. Another speed improvement comes from automatic discovery and modeling of sub-characters. The use of Heteroschedastic Linear Discriminant Analysis (HLDA) makes modeling more tractable by reducing feature-space dimensionality.
Citation:
Michael Decerbo, Premkumar Natarajan, Rohit Prasad, Ehry MacRostie, "Performance Improvements to the BBN Byblos OCR System," icdar, pp.411-415, Eighth International Conference on Document Analysis and Recognition (ICDAR'05), 2005
Usage of this product signifies your acceptance of the Terms of Use.