Your cart is empty!
0 reviews / Write a review
Available Under License: Research
The data set comprises of Hindi read speech data along with the corresponding transcriptions. The text data was crawled from newspapers, and then volunteers were asked to read them. It covers genres like politics, sports, entertainment, etc. Lexicon, baseline models, results and recipes to replicate the baseline experiments are also made available The following data sets are released for this challenge: Train set - 40 hours Development set - 5 hours Evaluation set - 5 hours
Tags: Hindi, ASR Challenge Data, ASR, Speech Data, NLTM Pilot