Under the Indian Languages Corpora Initiative (ILCI) project initiated by the DeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected corpus in Hindi as source language and translated it in Gujarati as target language. There are 25,000 sentences of Tourism domain in this corpus. Each sentence has a unique ID. The translated sentences have been POS tagged according to BIS (Bureau of Indian Standards) tagset.