Under the Indradhanush wordnet project, initiated by MeitY, Govt. of India,the IndoWordNet API provides a simple and easy way to access and manipulate the WordNet resource independent of the underlying storage technology. The functionality is exposed through a set of well defined objects that developer can create and manipulate as per his/her processing requirement. Although the current implementation expects the data to be available in a relational database, a two layered architecture supports separation of functionality offered to the user from the data access function, thereby allowing for future changes to support any data storage technology and design without changing the API provided to the developer.A new design using relational database has been implemented for this purpose. This database design (IndoWordNet database) supports storage of multiple language WordNets. An effort has been made to optimise the design to reduce redundancy. Certain data common across all languages i.e. ontology information, semantic relationships, etc are stored in a separate master
database and data specific to a language i.e. synsets, lexical relationships, etc are stored separate for each language in the database of respective language.
Parsing is the process of assigning grammatical labels to each chunk/constituent in the sentence. The Full Parser gives the analysis of a sentence in terms of syntactic dependency relations. Identification of the grammatical labels (karaka and non-karaka relations) for each word of the sentence helps many applications such as WSD, NER etc. There are a number of approaches, such as rule-based, statistics based, transformation-based etc. which are used for parsing. Statistics based approach is used in this module.