National Platform for Language Technology
  • Skip to Main Content
  • Announcement 1
  • Sign up
    • Register
    • Login
  • Save for later (0)
  • Feedback
    • Your cart is empty!

Highlights / Announcement

New Services Added on Portal
  • About
    • NLTM
    • NPLT
    • NLTM Advisors
    • NLTM Consortium
  • Resources
    • Text Corpus
    • Tools
    • Speech Corpus
    • WordNet
    • Treebank
    • PLS
    • Other Repositories
    • By Private Players
    Show All Resources
  • Services
    • Machine Translation
    • Speech Recognizer
    • Text to Speech
    • Transliteration
    • OCR
    • Govt. Services
    • Startups Services
    • Third Party Services
    Show All Services
  • Demonstration
  • Startups
    • Startup Wall
    • Mentor Wall
  • LeaderBoard
  • Dashboard
  • Marketplace
    • Data Marketplace
    • Translation Marketplace
Localization Logo
TDIL
Meity Startup
Startup Wall
Dashboard
C-DAC : Transliteration
  • Search

Search

Products meeting the search criteria

Product Compare (0)
Indian English ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

Indian English ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of English read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Speech Lab IITM and several startups. The text data...

Contributor:  ASR Consortia
Tags:  Indian English, ASR Challenge Data, ASR Speech Data, NLTM Pilot, Speech Corpus, Speech, Corpus
Redirect to external website
click here
Hindi ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

Hindi ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of Hindi read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Speech Lab IITM and several startups. The text data w...

Contributor:  ASR Consortia
Tags:  Hindi, ASR Challenge Data, ASR Speech Data, NLTM Pilot, Speech Corpus, Speech, Corpus
Redirect to external website
click here
Hindi ASR Challenge Data (ASR Speech Data released under 1st Challenge) - NLTMP

Hindi ASR Challenge Data (ASR Speech Data released under 1st Challenge) - NLTMP

The data set comprises of Hindi read speech data along with the corresponding transcriptions. The text data was crawled from newspapers, and then volunteers were asked to read them. It covers genres l...

Contributor:  ASR Consortia
Tags:  Hindi, ASR Challenge Data, ASR, Speech Data, NLTM Pilot
Redirect to external website
click here
Tamil ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

Tamil ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of Tamil read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Speech Lab IITM and several startups. The text data w...

Contributor:  ASR Consortia
Tags:  Tamil, ASR Challenge Data, ASR Speech Data, NLTM Pilot, Speech Corpus, Speech, Corpus
Redirect to external website
click here
Indian English ASR Challenge Data (ASR Speech Data) - NLTM Pilot

Indian English ASR Challenge Data (ASR Speech Data) - NLTM Pilot

The data set comprises of Indian English read speech and lecture speech data along with the corresponding transcriptions. The read speech covers genres like politics sports, entertainment, etc. It was...

Contributor:  ASR Consortia
Tags:  Indian English, ASR Challenge Data, ASR Speech Data, NLTM Pilot, Speech Corpus, Speech, Corpus
Redirect to external website
click here
Telugu Speech Data- ASR

Telugu Speech Data- ASR

This corpus contains the 6019 audio files of Telugu language of approx. 1000 native speakers.  This data was prepared for Agricultural Commodity and Size of this corpus is 5.73 GB. ...

Contributor:  ASR Consortia
Tags:  ASR, Telugu, Speech Data
Redirect to external website
click here
BIHARI SPEECH DATA - ASR

BIHARI SPEECH DATA - ASR

This corpus contains the 54866 audio files of Bihari language of approx. 1000 native speakers. This corpus also  contains word and its corresponding phonetic representation and transcription text...

Contributor:  ASR Consortia
Tags:  ASR, Bihari, Speech Data
Redirect to external website
click here
Bengali Speech Data – ASR

Bengali Speech Data – ASR

This corpus contains the more than 43134 audio files of Bengali language of approx. 1000 native speakers. This corpus also contains word and its corresponding phonetic representation and transcription...

Contributor:  ASR Consortia
Tags:  ASR, Bengali, Speech Data
Redirect to external website
click here
HINDI Speech Data – ASR

HINDI Speech Data – ASR

This corpus contains the more than 194714 audio files of HINDI language of approx. 1000 native speakers. This corpus also contains word and its corresponding phonetic representation and transcrip...

Contributor:  ASR Consortia
Tags:  ASR, HINDI, Speech Data
Redirect to external website
click here
Marathi Speech Data - ASR

Marathi Speech Data - ASR

This corpus contains the more than 44521 audio files of Marathi language of 1500 speakers, dic file which contains word and its corresponding phonetic representation and transcription text file listin...

Contributor:  ASR Consortia
Tags:   ASR, Marathi, Speech Data
Redirect to external website
click here
Tamil Speech Data- ASR

Tamil Speech Data- ASR

This corpus contains the more than 88175 audio files of Tamil language of approx. 1000 native speakers. This corpus contains word and its corresponding phonetic representation and transcription text f...

Contributor:  ASR Consortia
Tags:  ASR, Tamil, Speech Data
Redirect to external website
click here
Odia Speech Data – ASR

Odia Speech Data – ASR

This corpus contains the more than 11940 audio files of Odia language of approx. 1000 native speakers. This corpus contains word and its corresponding phonetic representation and transcription text fi...

Contributor:  ASR Consortia
Tags:  ASR, Odia, Speech Data
Redirect to external website
click here
Kannada Speech Data – ASR

Kannada Speech Data – ASR

This corpus contains the more than 93803 audio files of Kannada language of 1000 native speakers, Callflow1.dic file which contains word and its corresponding phonetic representation and transcription...

Contributor:  ASR Consortia
Tags:  ASR, Kannada, Speech Data
Redirect to external website
click here
HINDI (JHARKHAND) Speech Data – ASR

HINDI (JHARKHAND) Speech Data – ASR

This corpus contains the more than 36694 audio files of HINDI (JHARKHAND)  language of approx. 1000 native speakers. This corpus also contains word and its corresponding phonetic representation a...

Contributor:  ASR Consortia
Tags:  ASR, HINDI (JHARKHAND), Speech Data
Redirect to external website
click here
Assamese Speech Data-ASR

Assamese Speech Data-ASR

This corpus contains the 57975 audio files of Assamese language of approx. 1000 native speakers. This corpus also  contains word and its corresponding phonetic representation and transcription te...

Contributor:  ASR Consortia
Tags:  ASR, ASSAMESE, SPEECH DATA
Redirect to external website
click here
Information
  • About NPLT
  • Privacy Policy
  • Return Policy
  • Terms & Conditions
  • MeitY Linguistic Resource Sharing Policy
Customer Service
  • Contact Us
  • Website Survey
  • Feedback
  • FAQs
  • Site Map
Imp Links
  • National Portal of India
  • MeitY
  • TDIL Programme
  • TDIL-DC
  • Language Technology Players
My Account
  • My Account
  • Order History
  • Save for Later
  • Newsletter
National Portal link
MeitY Website link
Digital India Website link
TDIL logo
CDAC logo

Copyright @ All Rights Reserved
National Platform for Language Technology © 2025