speech data

NLTM Pilot TTS Data for Indian Languages — Hindi, Punjabi, Tamil, and Indian English.

TTS data for Indian languages — Hindi, Punjabi, Tamil, and Indian English. Text and corresponding speech data record in studio environment...

Available Under License:
CC BY-SA 2.0

Sample Download | size: 0B | type: zip

Added on : 16 Aug 2021

Tags: TTS Data Speech Data Hindi TTS Data Punjabi TTS Data Tamil TTS Data Indian English TTS Data IITM

Indian English ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of English read and conversational speech data along with the corresponding transcriptions. This speech data was collected by S..

Available Under License:
Research

Added on : 26 Jul 2021

Tags: Indian English ASR Challenge Data ASR Speech Data NLTM Pilot Speech Corpus Speech Corpus

Hindi ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of Hindi read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Spe..

Available Under License:
Research

Added on : 26 Jul 2021

Tags: Hindi ASR Challenge Data ASR Speech Data NLTM Pilot Speech Corpus Speech Corpus

Hindi ASR Challenge Data (ASR Speech Data released under 1st Challenge) - NLTMP

The data set comprises of Hindi read speech data along with the corresponding transcriptions. The text data was crawled from newspapers, and then volu..

Available Under License:
Research

Sample Download | size: 0B | type: zip

Added on : 10 Jun 2021

Tags: Hindi ASR Challenge Data ASR Speech Data NLTM Pilot

Tamil ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of Tamil read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Spe..

Available Under License:
Research

Added on : 26 Jul 2021

Tags: Tamil ASR Challenge Data ASR Speech Data NLTM Pilot Speech Corpus Speech Corpus

Indian English ASR Challenge Data (ASR Speech Data) - NLTM Pilot

The data set comprises of Indian English read speech and lecture speech data along with the corresponding transcriptions. The read speech covers genre..

Available Under License:
Research

Sample Download | size: 0B | type: tar

Added on : 10 Jun 2021

Tags: Indian English ASR Challenge Data ASR Speech Data NLTM Pilot Speech Corpus Speech Corpus

Telugu Speech Data- ASR

This corpus contains the 6019 audio files of Telugu language of approx. 1000 native speakers. This data was prepared for Agricultural Commo..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 0B | type: zip

Added on : 21 Jan 2021

Tags: ASR Telugu Speech Data

BIHARI SPEECH DATA - ASR

This corpus contains the 54866 audio files of Bihari language of approx. 1000 native speakers. This corpus also contains word and its correspond..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 0B | type: zip

Added on : 21 Jan 2021

Tags: ASR Bihari Speech Data

Bengali Speech Data – ASR

This corpus contains the more than 43134 audio files of Bengali language of approx. 1000 native speakers. This corpus also contains word and its corre..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 0B | type: zip

Added on : 12 Jan 2021

Tags: ASR Bengali Speech Data

HINDI Speech Data – ASR

This corpus contains the more than 194714 audio files of HINDI language of approx. 1000 native speakers. This corpus also contains word and its c..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 0B | type: zip

Added on : 12 Jan 2021

Tags: ASR HINDI Speech Data

Marathi Speech Data - ASR

This corpus contains the more than 44521 audio files of Marathi language of 1500 speakers, dic file which contains word and its corresponding phonetic..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 0B | type: zip

Added on : 11 Dec 2020

Tags: ASR Marathi Speech Data

Tamil Speech Data- ASR

This corpus contains the more than 88175 audio files of Tamil language of approx. 1000 native speakers. This corpus contains word and its correspondin..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 0B | type: zip

Added on : 04 Dec 2020

Tags: ASR Tamil Speech Data

Odia Speech Data – ASR

This corpus contains the more than 11940 audio files of Odia language of approx. 1000 native speakers. This corpus contains word and its corresponding..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 0B | type: zip

Added on : 04 Dec 2020

Tags: ASR Odia Speech Data

Kannada Speech Data – ASR

This corpus contains the more than 93803 audio files of Kannada language of 1000 native speakers, Callflow1.dic file which contains word and its corre..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 0B | type: zip

Added on : 04 Dec 2020

Tags: ASR Kannada Speech Data

HINDI (JHARKHAND) Speech Data – ASR

This corpus contains the more than 36694 audio files of HINDI (JHARKHAND) language of approx. 1000 native speakers. This corpus also contains wo..

Available Under License:
CC BY-SA 2.0

Sample Download | size: 0B | type: zip

Added on : 03 Dec 2020

Tags: ASR HINDI (JHARKHAND) Speech Data