Under the Indian Languages Corpora Initiative (ILCI) project initiated by the DeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected corpus in Hindi as source language and translated it in Urdu as target language. There are 25,000 sentences of Health domain. Each sentence has a unique ID. The translated sentences have been POS tagged according to BIS (Bureau of Indian Standards) tagset.