National Platform for Language Technology
  • Skip to Main Content
  • Announcement 1
  • Sign up
    • Register
    • Login
  • Save for later (0)
  • Feedback
    • Your cart is empty!

Highlights / Announcement

New Services Added on Portal
  • About
    • NLTM
    • NPLT
    • NLTM Advisors
    • NLTM Consortium
  • Resources
    • Text Corpus
    • Tools
    • Speech Corpus
    • WordNet
    • Treebank
    • PLS
    • Other Repositories
    • By Private Players
    Show All Resources
  • Services
    • Machine Translation
    • Speech Recognizer
    • Text to Speech
    • Transliteration
    • OCR
    • Govt. Services
    • Startups Services
    • Third Party Services
    Show All Services
  • Demonstration
  • Startups
    • Startup Wall
    • Mentor Wall
  • LeaderBoard
  • Dashboard
  • Marketplace
    • Data Marketplace
    • Translation Marketplace
Localization Logo
TDIL
Meity Startup
Startup Wall
Dashboard
C-DAC : Transliteration
  • Search

Search

Products meeting the search criteria

Product Compare (0)
NLTM Pilot TTS Data for Indian Languages — Hindi, Punjabi, Tamil, and Indian English.

NLTM Pilot TTS Data for Indian Languages — Hindi, Punjabi, Tamil, and Indian English.

TTS data for Indian languages — Hindi, Punjabi, Tamil, and Indian English. Text and corresponding speech data record in studio environment....

Contributor:  TTS Consortia
Tags:  TTS Data,Speech Data, Hindi TTS Data, Punjabi TTS Data, Tamil TTS Data, Indian English TTS Data, IITM
Redirect to external website
click here
English-Hindi ,Tamil-Telugu Parallel  Data Developed Under PSA Pilot

English-Hindi ,Tamil-Telugu Parallel Data Developed Under PSA Pilot

English-Hindi , Tamil-Telugu Parallel Data Developed Under PSA Pilot on  SSMT, lead by IIIT-Hyderabad...

Contributor:  NLTM IIIT-Hyderabad
Tags:  English-Hindi , Tamil-Telugu , Parallel Data, IIIT-Hyderabad,NLTM Pilot
Redirect to external website
click here
Tamil Raw Speech Corpus

Tamil Raw Speech Corpus

Dataset Description139:11:41 Hours | 86 GB speech data | 452 Speakers | 60,287 Audio segments | 48 kHz | 16 bit wav. Tamil is one of the longest-surviving classical languages in the world. &nbs...

Contributor:  CIIL Mysore
Tags:  Tamil, Raw Speech Corpus, Speech Corpus
Redirect to external website
click here
Tamil ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

Tamil ASR Challenge Data (ASR Speech Data released under 3rd Challenge) - NLTMP

The data set comprises of Tamil read and conversational speech data along with the corresponding transcriptions. This speech data was collected by Speech Lab IITM and several startups. The text data w...

Contributor:  ASR Consortia
Tags:  Tamil, ASR Challenge Data, ASR Speech Data, NLTM Pilot, Speech Corpus, Speech, Corpus
Redirect to external website
click here
Tamil Speech Data- ASR

Tamil Speech Data- ASR

This corpus contains the more than 88175 audio files of Tamil language of approx. 1000 native speakers. This corpus contains word and its corresponding phonetic representation and transcription text f...

Contributor:  ASR Consortia
Tags:  ASR, Tamil, Speech Data
Redirect to external website
click here
English-Tamil Tourism Set - II Parallel Text corpus-EILMT

English-Tamil Tourism Set - II Parallel Text corpus-EILMT

English-Tamil Parallel Tourism Text corpus is developed in Unicode under English to Indian Language Machine Translation (EILMT) consortium. The core vocabulary of this corpus consist of various names,...

Contributor:  EILMT Consortia
Tags:  English-Tamil, Parallel, Tourism, Text corpus
Redirect to external website
click here
English-Tamil Health Parallel Text corpus-EILMT

English-Tamil Health Parallel Text corpus-EILMT

English-Tamil Parallel Health Text corpus is developed in Unicode under English to Indian Language Machine Translation (EILMT) Consortium. This corpus is created in excel format and size of the corpus...

Contributor:  EILMT Consortia
Tags:  English-Tamil, Parallel, Health, Text corpus
Redirect to external website
click here
English-Tamil Agriculture Parallel Text corpus-EILMT

English-Tamil Agriculture Parallel Text corpus-EILMT

English-Tamil Parallel Agriculture Text corpus is developed in Unicode, under English to Indian Language Machine Translation (EILMT) Consortium. This corpus is created in excel format and size of the ...

Contributor:  EILMT Consortia
Tags:  English-Tamil, Parallel, Agriculture, Text corpus
Redirect to external website
click here
Tamil SakalBharati Unicode Font

Tamil SakalBharati Unicode Font

Linguistically correct Unicode based Open type font specific to Tamil language conforming to the highest standards of quality, aesthetics and elegance as well as functionality. It can be deployed in a...

Contributor:  C-DAC GIST
Tags:  Tamil, Font
Redirect to external website
click here
Tamil Voice Data Male - ILTTS

Tamil Voice Data Male - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Tamil language under the project developing text-to-speech (TTS) synthesis systems for Indian languages.This is a c...

Contributor:  TTS Consortia
Tags:  Tamil, Voice Data, Male voice, TTS, text to speech
Redirect to external website
click here
Tamil Voice Data Female - ILTTS

Tamil Voice Data Female - ILTTS

It is a voice data collected for building HTS based statistical speech synthesis for Tamil language under the project developing text-to-speech (TTS) synthesis systems for Indian languages.This is a c...

Contributor:  TTS Consortia
Tags:  Tamil, voice data, tts, text to speech
Redirect to external website
click here
e-Aksharayan – Tamil OCR

e-Aksharayan – Tamil OCR

e-Aksharayan is a Desktop software for converting scanned printed Indian Language documents into a fully editable text format in Unicode encoding. Works on Windows 7,8, and 10. Input and output speci...

Contributor:  OCR Consortia
Tags:  e-Aksharayan, Tamil OCR, OCR, Tamil
Redirect to external website
click here
A Gold Standard Tamil Raw Text Corpus

A Gold Standard Tamil Raw Text Corpus

Tamil is one of the longest-surviving Classical Languages in the world. It is a Dravidian Language Family.Tamil Text Corpus encoded in a machine readable form and stored in a standard format. The majo...

Contributor:  CIIL Mysore
Tags:  Tamil, Raw Text Corpus
Redirect to external website
click here
Text to Speech Chrome Browser Plugin ILTTS

Text to Speech Chrome Browser Plugin ILTTS

Text to speech Indian Language Chrome browser plug-in gives the power of speech to browser. During net surfing user selects some text on the browser and press a particular command. System then starts ...

Contributor:  TTS Consortia
Tags:  Hindi Bengali, Marathi, Tamil, Telugu, Malayalam, Gujarati, Kannada, Chrome, browser
Redirect to external website
click here
Hindi - Tamil Parallel POS Tagged Text Corpus

Hindi - Tamil Parallel POS Tagged Text Corpus

Under the Indian Languages Corpora Initiative (ILCI) project, ILCI Consortia led by Jawaharlal Nehru University, New Delhi has created parallel corpus, Hindi as source language and translated in Tamil...

Contributor:  ILCI Consortia
Tags:  Hindi, Tamil, Text Corpus, POS tag, Parallel text corpus
Redirect to external website
click here
Information
  • About NPLT
  • Privacy Policy
  • Return Policy
  • Terms & Conditions
  • MeitY Linguistic Resource Sharing Policy
Customer Service
  • Contact Us
  • Website Survey
  • Feedback
  • FAQs
  • Site Map
Imp Links
  • National Portal of India
  • MeitY
  • TDIL Programme
  • TDIL-DC
  • Language Technology Players
My Account
  • My Account
  • Order History
  • Save for Later
  • Newsletter
National Portal link
MeitY Website link
Digital India Website link
TDIL logo
CDAC logo

Copyright @ All Rights Reserved
National Platform for Language Technology © 2025