• Kannada Raw Speech Corpus
Kannada Raw Speech Corpus
  • Contributor: CIIL Mysore
  • Product Code: CIIL-KAN-RAW-Speech-122
Sample Download | size: 1.5MB | type: zip
Added on : 29 Jul 2019

179:32:52 hours of 115 Gigabytes speech data | 656 Speakers | 99109 Audio segments | 48 kHz | 16 bit wav

Kannada is one of the Ancient Indian languages which belong to the Dravidian family. It has its own script. The language in a region is influenced by other languages of the region, the mother tongue of the speaker, etc. The reading speed, loudness, frequency etc also differ depending on certain factors like age, gender etc. Linguistic data consortium identified four regional dialects and collected the speech corpus through fieldwork. This read data is collected from various age groups, of male and female native speakers in equal number. This data includes Texts, Sentences, Date Formats, and different wordlists.

 

The available Speech Corpus details are as follows.

    •          Total Speakers - 656 (328 Female and 328 Male)
    •          Contemporary Text (News) - 600 Audio Segments - 66:06:09 Hours
    •          Creative Text - 600 Audio Segments - 33:09:20 Hours
    •          Sentence - 14887 Audio Segments - 13:58:15 Hours
    •          Date Format - 1200 Audio Segments - 1:16:22 Hours
    •          Command and Control Words - 17988 Audio Segments - 12:31:43 Hours
    •          Person Name - 12009 Audio Segments - 13:04:49 Hours
    •          Place Noun - 6032 Audio Segments - 4:48:42 Hours
    •          Most Frequent Word-Part - 18065 Audio Segments - 12:21:24 Hours
    •          Most Frequent Word-Full Set - 8000 Audio Segments - 6:45:56 Hours
    •          Phonetically Balanced - 9360 Audio Segments - 6:47:23 Hours
    •          Form and Function- Word - 10368 Audio Segments - 8:42:49 Hours
Speech Data Attributes
Annotation Raw Speech Corpus
Language Kannada
Duration 179:32:52
Speaker Type Native
File Size 115 GB
No. of Audio Segment 99109
Speaker Gender Male and Female

Write a review

Please login or register to review

Tags: Kannada, Raw Speech Corpus

Disclaimer: The information provided on this page has been procured through different sources. Please write back to us at nplt_support[at]cdac[dot]in in case you would like to suggest an update.