Paradigm Table consists of part of speech categories to cater for GNP information and post position to the root word (with appropriate modification). The categorization follows norms led down by AnglaBharati requirement.
The system uses the IITK-Punjabi Roman notations for the translation process. When it comes to the user interface side these notations have to be converted to some standard codes like Punjabi Unicode. Similarly the lexical data which is in entered in Punjabi Unicode has to be converted to IITK-Punjabi Roman notation for internal processing. The system output is displayed in Devanagari and INSROT also,to enable a non native person to read any target language.
A post-processor is an integral part of any OCR system. This paper proposes a method for detection and correction of errors in recognition results of handwritten and machine printed Gurmukhi OCR. Based on the shape similarity of characters, the consonants of Gurmukhi Script are divided into different sets.
The problem of orientation detection of all major printed Indian scripts is considered. For a large skew, the corrected document may be oriented in 0° or 180°. The OCR system will make fatal error if the document is in 180° orientation.
The paper describes an initial attempt to use Levins verb classes for the task of preposition sense selection in English to Indian language machine translation. Two language pairs have been selected to describe the approach, English-Hindi and English-Telugu. We exploit the correspondence of verb classs semantics vis--vis some prepositions it takes.
Added on August 4, 2010
Product Type : Research Paper
License Type : Freeware
System Requirement :
Author : Samar Husain, Phani Chaitanya, Ganeshwar Rao Dulam, Tariq Khan, Dipti M Sharma