Indian English 3rd ASR Challenge Data

Indian English ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

Contributor: ASR Consortia
Product Code: NLTMP-ASR-3CHALLENGE-ENG-004

Available Under License: Research

Added on : 26 Jul 2021

The data set comprises of English read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Speech Lab IITM and several startups. The text data was crawled from newspapers, and then volunteers were asked to read them. The following data sets are released for this challenge:

Train set - 179.5 hours

Development set - 5.4 hours

Evaluation set - 5.4 hours

Speech Data Attributes
Language	Indian Accent English
Transcription	Yes, Available
Duration	190.3 hours
Speaker Gender	Both Male & Female

Tags: Indian English, ASR Challenge Data, ASR Speech Data, NLTM Pilot, Speech Corpus, Speech, Corpus

Write a review