- Contributor: EILMT Consortia
- Product Code: EILMT-ENG-URD-TEXT-0733
Available Under License: Commercial Research
English-Urdu Parallel Tourism Text corpus is developed in Unicode under English to Indian Language Machine Translation (EILMT) consortium. The core vocabulary of this corpus consist of various names, destinations, visiting places, vocabularies from art & architecture, culture and civilization. By and large, the corpus contains basically descriptions and information pertaining to tourist destinations and related matters of tourist interests. This corpus is created in XML & XLS formats and size of the corpus is approx.5300 sentences.
Text Corpus Attributes | |
Language | English - Urdu |
Parallel or Monolingual | Parallel |
Annotation | Not Annotated |
No. of Sentences | 5300 sentences |
Word-Count | 118903 words |
File Format | XLS formats |
Encoding | UTF-8 |
File Size | 3.06 MB |
Tags: English-Urdu, Parallel, Tourism, Text corpus