•    Freeware
  •    Shareware
  •    Research
  •    Localization Tools 20
  •    Publications 696
  •    Validators 2
  •    Mobile Apps 22
  •    Fonts 31
  •    Guidelines/ Draft Standards 3
  •    Documents 13
  •    General Tools 38
  •    NLP Tools 105
  •    Linguistic Resources 251

Search Results | Total Results found :   1181

You refine search by : All Results
  Catalogue
Performance of an OCR system is badly affected due to presence of hand-drawn annotation lines in various forms, such as underlines, circular lines, and other text-surrounding curves. Such annotation lines are drawn by a reader usually in free hand in order to summarize some text or to mark the keywords within a document page. In this paper, we propose a generalized scheme for detection and removal of these hand-drawn annotations from a scanned document page. An underline drawn by hand is roughly horizontal or has a tolerable undulation, whereas for a hand-drawn curved line, the slope usually changes at a gradual pace.

Added on August 27, 2018

2

  More Details
  • Contributed by : OCR Consortium
  • Product Type : Research Paper
  • License Type : Freeware
  • System Requirement : Not Applicable
  • Author : Sanjoy Pratihar, Partha Bhowmick, Shamik Sural, Jayanta Mukhopadhyay

Rubber stamps on document pages often overlap and obscure the text very badly, thereby impairing its readability and deteriorating the performance of an optical character recognition system. Removal of rubber stamps from a document image is, therefore, essential for successfully converting a document image into an editable electronic form. We propose here an effective technique for rubber stamp removal from scanned document images. It is based on the novel idea of a single feature obtained by projecting the pixel colors of the image foreground along the eigenvector corresponding to the first principal component in HSV color space.

Added on August 27, 2018

2

  More Details
  • Contributed by : Consortium
  • Product Type : Research Paper
  • License Type : Freeware
  • System Requirement : Not Applicable
  • Author : Soumyadeep Dey, Jayanta Mukherjee,Shamik Sural

English words are frequently encountered in Gurmukhi texts. A monolingual Gurmukhi OCR will recognize such words as garbage. It becomes necessary to add bilingual capability to the Gurmukhi OCR to recognize English text too. But adding bilingual capability reduces the recognition accuracy for monolingual texts due to errors in script identification. Even a system with 99% script identification accuracy results in reduction of 1% recognition accuracy on monolingual text. In this paper, we present a bilingual OCR, which recognizes both English and Gurmukhi scripts without any significant reduction in recognition accuracy as compared to the monolingual Gurmukhi OCR when recognizing monolingual Gurmukhi text.

Added on August 17, 2018

5

  More Details
  • Contributed by : Consortium
  • Product Type : Publications
  • License Type : Freeware
  • System Requirement : Not Applicable
  • Author : Gurpreet Singh Lehal

e-Aksharayan is a Desktop software for converting scanned printed Indian Language documents into a fully editable text format in Unicode encoding. Works on Windows 7,8, and 10.

Input and output specifications
• Works on Windows 7,8, and 10.
• The Software supports BMP,TIFF & PNG formats.
• Output formats supported are RTF,TXT,DOC.
• Gray level and black ’n’ white images can be given as input.
• Image dimensions up to 3500 × 3500 pixels.
• Minimum scanning resolution supported 300dpi.
• Maximum input skew supported 15degrees.
• Equipped with Unicode typing tool for typing in Indian Language
• Sakal Bharati font (11 Indian Language scripts in a Single font) is also provided.
• The system recognizes up to 5 pages at a time.

Added on August 1, 2018

222

  More Details
  • Contributed by : OCR Consortium
  • Product Type : General Tools
  • License Type : Freeware
  • System Requirement : Windows

e-Aksharayan is a Desktop software for converting scanned printed Indian Language documents into a fully editable text format in Unicode encoding. Works on Windows 7,8, and 10.

Input and output specifications
• Works on Windows 7,8, and 10.
• The Software supports BMP,TIFF & PNG formats.
• Output formats supported are RTF,TXT,DOC.
• Gray level and black ’n’ white images can be given as input.
• Image dimensions up to 3500 × 3500 pixels.
• Minimum scanning resolution supported 300dpi.
• Maximum input skew supported 15degrees.
• Equipped with Unicode typing tool for typing in Indian Language
• Sakal Bharati font (11 Indian Language scripts in a Single font) is also provided.
• The system recognizes up to 5 pages at a time.


Added on August 1, 2018

114

  More Details
  • Contributed by : OCR Consortium
  • Product Type : General Tools
  • License Type : Freeware
  • System Requirement : Windows