Your cart is empty!
Resources owned and managed by Startups/ MNCs/ Private Companies working in NLP/ Language Technology/ Localization etc. domains.
This Hindi-Magahi parallel data set, having total 1000 sentences (500 dev, 500 test) has been release under license: CC BY-NC-SA 4.0 by Panlingua Lang..
This Hindi-Bhojpuri parallel data set, having total 1000 sentences (500 dev, 500 test) has been release under license: CC BY-NC-SA 4.0 by Panlingua La..
Available Under License: CC BY-NC-SA 4.0
This Hindi monolingual data set, having 473605 sentences and total word count of 7092870, has been release under license: CC BY-NC-SA 4.0 by Panlingua..
This Magahi monolingual data set, having 148606 sentences and total word count of 2178424, has been release under license: CC BY-NC-SA 4.0 by Panlingu..
This Bhojpuri monolingual data set, having 91131 sentences and total word count of 1562465, has been release under license: CC BY-NC-SA 4.0 by Panling..