Cloud Tags
  • Research Areas
  • Development of corpora of texts in machine readable form
  • Machine Translation System
  • Development of English to Indian Languages Machine Translation (MT) System
  • Development of English to Indian Languages Machine Translation (MT) System with Angla-Bharti Technology
  • Development of Indian Language to Indian Language Machine Translation System
  • Development of Cross-lingual Information Access
  • Development of Robust Document Analysis & Recognition System for Indian Languages (OCR)
  • Development of On-line handwriting recognition system (OHWR)
  • Development of Text to Speech System for Indian Languages (TTS)
  • Development of Sanskrit Hindi Machine Translation System (SHMT)

Research Areas

Technology Development for Indian Languages (TDIL) Programme of Ministry of Electronics and Information Technology (MeitY), Govt. India works on following research areas with the aim of enabling masses to build a knowledge society and ensuring communication without language barrier and moving up the knowledge chain. Apart from TDIL-DC Infrastructure, these research outcomes are also hosted on NIC Meghraj Cloud.

Research Areas

Development of corpora of texts in machine readable form – In this, authentic data on contemporary use of Indian languages to both computer scientists and linguists for their academic, research and the developmental activities will be provided

Machine Translation System In this, one natural language gets translated to another language using computational applications without real-time human interface or with minimal human effort. The various software’s developed under the Machine Translation project are as follows:-


Development of English to Indian Languages Machine Translation (MT) System
Development of English to Indian Languages Machine Translation (MT) System with Angla-Bharti Technology
Development of Indian Language to Indian Language Machine Translation System


Development of Cross-lingual Information Access
Cross Lingual Information Access (CLIA) systems makes it possible for users to directly access sources of information which may be available in languages other than the language of query.

Development of Robust Document Analysis & Recognition System for Indian Languages (OCR)Optical Character Recognition (OCR) is a utility tool for digitizing the content and is essential for development of knowledge networks such as digital libraries. OCR technology offers the facility to scan and store the printed text. There are three basic elements of OCR technology - scanning, recognition and then reading text.

Development of On-line handwriting recognition system (OHWR) On-line handwriting recognition system (OHWR) is a useful tool that converts the written strokes of an individual into editable text thus bypassing the need for a keyboard for text entry.

Development of Text to Speech System for Indian Languages (TTS) - A text-to-speech system converts normal language text into speech, the objective of this project is to develop and deploy Text to Speech system for visually challenged persons, thus contributing to a social cause.

Development of Sanskrit Hindi Machine Translation System (SHMT) In India, there have been several efforts in the development of computational tools for Sanskrit. The objective is to develop Sanskrit to Hindi machine translation technology, using Sanskrit computational tools.