National Platform for Language Technology
  • Skip to Main Content
  • Announcement 1
  • Sign up
    • Register
    • Login
  • Save for later (0)
  • Feedback
    • Your cart is empty!

Highlights / Announcement

New Services Added on Portal
  • About
    • NLTM
    • NPLT
    • NLTM Advisors
    • NLTM Consortium
  • Resources
    • Text Corpus
    • Tools
    • Speech Corpus
    • WordNet
    • Treebank
    • PLS
    • Other Repositories
    • By Private Players
    Show All Resources
  • Services
    • Machine Translation
    • Speech Recognizer
    • Text to Speech
    • Transliteration
    • OCR
    • Govt. Services
    • Startups Services
    • Third Party Services
    Show All Services
  • Demonstration
  • Startups
    • Startup Wall
    • Mentor Wall
  • LeaderBoard
  • Dashboard
  • Marketplace
    • Data Marketplace
    • Translation Marketplace
Localization Logo
TDIL
Meity Startup
Startup Wall
Dashboard
C-DAC : Transliteration
  • Search

Search

Products meeting the search criteria

Product Compare (0)
Hindi Monolingual Data Set

Hindi Monolingual Data Set

This Hindi monolingual data set, having 473605 sentences and total word count of 7092870, has been release under license: CC BY-NC-SA 4.0 by Panlingua Language Processing LLP, New Delhi, India....

Contributor:  Panlingua Language Processing LLP
Tags:  Hindi, Monolingual, Text Corpus
Redirect to external website
click here
Magahi monolingual data set

Magahi monolingual data set

This Magahi monolingual data set, having 148606 sentences and total word count of 2178424, has been release under license: CC BY-NC-SA 4.0 by Panlingua Language Processing LLP, New Delhi, India....

Contributor:  Panlingua Language Processing LLP
Tags:  Magahi, Monolingual, Text Corpus
Redirect to external website
click here
Bhojpuri monolingual data set

Bhojpuri monolingual data set

This Bhojpuri monolingual data set, having 91131 sentences and total word count of 1562465, has been release under license: CC BY-NC-SA 4.0 by Panlingua Language Processing LLP, N. Delhi, India....

Contributor:  Panlingua Language Processing LLP
Tags:  Bhojpuri, Monolingual, Text Corpus
Redirect to external website
click here
Urdu Monolingual Chunked Tagged Text Corpus ILCI

Urdu Monolingual Chunked Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in Urdu...

Contributor:  ILCI Consortia
Tags:  Urdu, Monolingual, Chunked Tagged, Text Corpus
Redirect to external website
click here
Nepali  Monolingual Chunked Text Corpus ILCI

Nepali Monolingual Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in Nepa...

Contributor:  ILCI Consortia
Tags:  Nepali, Monolingual, Chunked Tagged, Text Corpus, ILCI
Redirect to external website
click here
Kannada  Monolingual Chunked Text Corpus ILCI

Kannada Monolingual Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in Kann...

Contributor:  ILCI Consortia
Tags:  Kannada, Monolingual, Chunked Tagged, Text Corpus
Redirect to external website
click here
Hindi Monolingual Chunked Text Corpus ILCI

Hindi Monolingual Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in Hind...

Contributor:  ILCI Consortia
Tags:  Hindi, Monolingual, Chunked Tagged, Text Corpus
Redirect to external website
click here
Gujarati  Monolingual Chunked Text Corpus ILCI

Gujarati Monolingual Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in Guja...

Contributor:  ILCI Consortia
Tags:  Gujarati, Monolingual, Chunked Tagged, Text Corpus, ILCI
Redirect to external website
click here
English  Monolingual Chunked Text Corpus ILCI

English Monolingual Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in Engl...

Contributor:  ILCI Consortia
Tags:  English, Monolingual, Chunked Tagged, Text Corpus, ILCI
Redirect to external website
click here
Punjabi Monolingual Chunked Text Corpus ILCI

Punjabi Monolingual Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in Punj...

Tags:  Punjabi, Monolingual, Chunked Tagged, Text Corpus
Redirect to external website
click here
Assamese Monolingual Chunked Text Corpus ILCI

Assamese Monolingual Chunked Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in Assa...

Contributor:  ILCI Consortia
Tags:  Assamese, Monolingual, Chunked, Text Corpus, ILCI
Redirect to external website
click here
Urdu Monolingual PoS Tagged Text Corpus ILCI

Urdu Monolingual PoS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in Urdu...

Contributor:  ILCI Consortia
Tags:  Urdu, Monolingual, PoS Tagged, Text Corpus
Redirect to external website
click here
Telugu Monolingual PoS Tagged Text Corpus ILCI

Telugu Monolingual PoS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in Telu...

Contributor:  ILCI Consortia
Tags:  Telugu, Monolingual, PoS Tagged, Text Corpus
Redirect to external website
click here
Punjabi Monolingual PoS Tagged Text Corpus ILCI

Punjabi Monolingual PoS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in Punj...

Contributor:  ILCI Consortia
Tags:  Punjabi, Monolingual, PoS Tagged, Text Corpus
Redirect to external website
click here
Nepali Monolingual PoS Tagged Text Corpus ILCI

Nepali Monolingual PoS Tagged Text Corpus ILCI

Under the Indian Languages Corpora Initiative phase –II (ILCI Phase-II) project, initiated by the MeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected monolingual corpus in Nepa...

Contributor:  ILCI Consortia
Tags:  Nepali, Monolingual, PoS Tagged, Text Corpus
Redirect to external website
click here
Information
  • About NPLT
  • Privacy Policy
  • Return Policy
  • Terms & Conditions
  • MeitY Linguistic Resource Sharing Policy
Customer Service
  • Contact Us
  • Website Survey
  • Feedback
  • FAQs
  • Site Map
Imp Links
  • National Portal of India
  • MeitY
  • TDIL Programme
  • TDIL-DC
  • Language Technology Players
My Account
  • My Account
  • Order History
  • Save for Later
  • Newsletter
National Portal link
MeitY Website link
Digital India Website link
TDIL logo
CDAC logo

Copyright @ All Rights Reserved
National Platform for Language Technology © 2025