Indian Language to Indian Language Machine Translation Punjabi - Hindi parallel corpus produced by human translators. The corpus is drawn from web covering tourism/travel and other in the ratio of 3:1. There are total 1000 sentences comprised of very simple, simple, complex and compound sentence structures.
The sentence structures include relative clauses, complement clauses, finite and non-finite conjunctions. Text encoding is UTF- 8.