Your cart is empty!
0 reviews / Write a review
Available Under License: Research
The data set comprises of English read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Speech Lab IITM and several startups. The text data was crawled from newspapers, and then volunteers were asked to read them. The following data sets are released for this challenge:
Train set - 179.5 hours
Development set - 5.4 hours
Evaluation set - 5.4 hours
Tags: Indian English, ASR Challenge Data, ASR Speech Data, NLTM Pilot, Speech Corpus, Speech, Corpus