This paper presents a scheme based upon XML based label-ing for managing a large multilingual OCR project. Managing a large multi-lingual OCR project involving multiple research groups, develop-ing script specific and script independent technologies in a collaborative fashion is a challenging problem.
During scanning of documents the image may get skewed because of improper alignment of paper on the scanner, which results in wrong alignment of text on the document image. In some cases the image may even have double skew both at the page level and at word level due to curl near the binding of the book or in old typed/printed documents.Therefore skew detection and correction becomes an indispensable pre-processing task before the recognition of the text.
Added on July 14, 2010
Product Type : Research Paper
License Type : Freeware
System Requirement :
Author : Dharam Veer Sharma,Gurpreet Singh Lehal Department