Your cart is empty!
0 reviews / Write a review
Available Under License: Commercial Research
Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in English. This is the final outcome of the project and there are 30,000 sentences of general domain. The translated sentences have been Chunked tagged according to BIS (Bureau of Indian Standards) tagset. This corpus has following features: unique ID, UTF-8 encoding, and text file format.
Tags: English, Monolingual, Chunked Tagged, Text Corpus, ILCI