• Hindi Annotated  Text Corpus - IIIT Hyderabad
Hindi Annotated Text Corpus - IIIT Hyderabad

Available Under License: CC BY-NC-SA 4.0  

Sample Download | size: 10.6KB | type: zip
Added on : 17 Mar 2021

Hindi Annotated corpus developed Under NLTM Pilot by IIIT-Hyderabad (Part1). Domains of the Corpus are Chemistry, Law, News & General,

HealthCare, Education Others, open education books.


Text Corpus Attributes
Language Hindi
Parallel or Monolingual Annotated
Annotation Annotated, POS Tagged
Domain Chemistry, Law, News & General, HealthCare, Open Education books.
No. of Sentences 100000 Sentences
Validated Yes
File Format Text File
Encoding UTF-8
Conformance to Standards/Best Practices Human Verified
File Size 15.5 MB (Compressed)
Updated Date 17 June 2021

Write a review

Please login or register to review

Tags: NLTM Pilot, Hindi, Telugu, Hindi–Telugu, Annotated, Text Corpus, IIIT-Hyderabad

Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.