Hindi
TDIL
Download

Health Text Corpora-Malayalam
Under the Indian Languages Corpora Initiative (ILCI) project initiated by the DIT, Govt. of India, Jawaharlal Nehru University, New Delhi had collected corpus in Hindi as source language, which had been translated into 11 languages by 9 universities across India. There are 25,000 sentences Health domain in Malayalam language. Each sentence has a unique ID. The translated sentences have been POS tagged according to BIS (Bureau of Indian Standards) tagset.
Contributed by: JNU
Product Type: Linguistic Resources
License Type: Research
System Requirement : Not Applicable
download
Full Download
download
Supporting Document