Your cart is empty!
0 reviews / Write a review
Kashmiri language is one of the 22 scheduled languages of India and is a part of the Eighth Schedule in the constitution of Jammu and Kashmir.
Kashmiri text has been typed in Unicode by using the In Script Keyboard in XML files. Metadata information has also been provided along with the data. The corpus has been developed from the available contemporary text. Kashmiri Text Corpus in LDC-IL comprises of 466,054 Words and character count is 2646948, drawn from books, newspapers and magazines. The representations of the two major domains are Aesthetics and Social Sciences etc.
Tags: Kashmiri, Raw Text Corpus