Your cart is empty!
0 reviews / Write a review
Tamil is one of the longest-surviving Classical Languages in the world. It is a Dravidian Language Family.
Tamil Text Corpus encoded in a machine readable form and stored in a standard format. The major encoding being used is Unicode and stored in XML format. The data is embedded with metadata information. The corpus has been created from contemporary text in typed and crawled methods. LDC-IL Tamil Text Corpus details:
Tags: Tamil, Raw Text Corpus