•    Freeware
  •    Shareware
  •    Research
  •    Localization Tools 20
  •    Publications 707
  •    Validators 2
  •    Mobile Apps 22
  •    Fonts 31
  •    Guidelines/ Draft Standards 3
  •    Documents 13
  •    General Tools 38
  •    NLP Tools 105
  •    Linguistic Resources 255
This paper presents a universal Parts of Speech (POS) tag set using W3C XML framework covering the major Indian Languages. The present work attempts to develop a common national framework for POS tag-set for Indian languages to enable a reusable and extendable architecture that would be useful for development of Web based Indian Language technologies such as Machine Translation , Cross-lingual Information Access and other Natural Language Processing technologies. The present POS tag schema has been developed for 13 Indian languages and being extended for all 22 constitutionally recognized Indian Languages. The POS schema has been developed using international standards e.g. metadata as per ISO 12620:1999, schema as per W3C XML internationalization guidelines and one to one mapping labels used 13 Indian languages.

Added on April 19, 2012


  More Details
  • Product Type : Research Paper
  • License Type : Freeware
  • System Requirement : Not Applicable
  • Author : Swaran Lata, Somnath Chandra, Prashant Verma, Swati Arora
Author Community Profile :
Similar / Suggested Resources