Your cart is empty!
0 reviews / Write a review
176:53:28 hours of 113 Gigabytes speech data | 456 Speakers | 77443 Audio segments | 48 kHz | 16 bit wav
Bodo, one of the scheduled language of India, is one of the Tonal languages of the world. There are two clearly distinguishable kinds of tones in Bodo which are known as Low and High. The language belongs to the Tibeto Burmese linguistic family. It is the language of Bodos, which are the major tribes of Indian State of Assam.
The LDC-IL speech data is collected from the regions of Chirang, Baksa Sonitpur Udalguri, Kamrup, Barpeta, Udalguri, Kokrajhar districts of Assam State of India which covers Bwrdwnari, Eastern, and Standard dialects. The data is collected from both the genders and different age group.
The LDC-IL Bodo Speech data set consists of different types of datasets that are made up of word lists, sentences running texts and date formats.
The available Speech Corpus details:
Tags: Boro, Bodo, Raw Speech Corpus