... corpus has following features: unique ID, UTF-8 encoding, and text file format. ...
Text Corpora
License Type: Research
... corpus has following features: unique ID, UTF-8 encoding, and text file format. ...
Text Corpora
License Type: Research
... corpus has following features: unique ID, UTF-8 encoding, and text file format.
...
Text Corpora
License Type: Research
... corpus has following features: unique ID, UTF-8 encoding, and text file format. ...
Text Corpora
License Type: Research
Under the Indian Languages Corpora Initiative (ILCI) project initiated by the DeitY, Govt. of India, Jawaharlal Nehru University, New Delhi had collected corpus in Hindi as the source language and translated ...
Text Corpora
License Type: Research
... corpus has following features: unique ID, UTF-8 encoding, and text file format. ...
Text Corpora
License Type: Research
... corpus has following features: unique ID, UTF-8 encoding, and text file format. ...
Text Corpora
License Type: Research
... corpus has following features: unique ID, UTF-8 encoding, and text file format. ...
Text Corpora
License Type: Research
... corpus has following features: unique ID, UTF-8 encoding, and text file format. ...
Text Corpora
License Type: Research
... corpus has following features: unique ID, UTF-8 encoding, and text file format.
...
Text Corpora
License Type: Research
This paper presents an HMM-based chunk tagger for Hindi. Various tagging schemes for marking chunk boundaries are discussed along with their results. Contextual information is incorporated into the chunk ...
License Type: Freeware
Research Paper
Paper Type: Conference Papers
License Type: Freeware
Author(s):Akshay Singh,S M Bendre,Rajeev Sangal
There have been growing interest to use speech technology for rural areas. In this context, this paper describes the development of speech corpora in Indian languages (viz., Gujarati and Marathi from remote ...
License Type: Freeware
Research Paper
Paper Type: Conference Papers
License Type: Freeware
Author(s):Kewal D. Malde, Bhavik B. Vachhani, Maulik C. Madhavi, Nirav H. Chhayani , Hemant A. Patil
Abstract:
The task of Word Sense Disambiguation (WSD) incorporates in its definition the role of ‘context’. We present our work on the development of a tool which allows for automatic acquisition and ...
License Type: Freeware
Research Paper
Paper Type: Conference Papers
License Type: Freeware
Author(s):Pushpak Bhattacharyya,Diptesh Kanojia, Raj Dabre ,Siddhartha Gunti ,Manish Shrivastava
... Thus grammar correction can be considered a translation problem from incorrect text to correct text. Over the years, grammar correction data in the electronic form (i.e., parallel corpora of incorrect ...
License Type: Freeware
Research Paper
Paper Type: Conference Papers
License Type: Freeware
Author(s):Bibek Behera,Pushpak Bhattacharyya
In this paper we introduce Translation Difficulty Index (TDI), a measure of difficulty in text translation. We first define and quantify translation difficulty in terms of TDI. We realize that any measure ...
License Type: Freeware
Research Paper
Paper Type: Conference Papers
License Type: Freeware
Author(s): Abhijit Mishra, Pushpak Bhattacharyya,Michael Carl
... the results in KeyWord-In-Context (KWIC) format. We also present the notation used for querying and transformation, which is comparable to but different from the Corpus Query Language (CQL).
For Full ...
License Type: Freeware
Research Paper
Paper Type: Conference Papers
License Type: Freeware
Author(s):Anil Kumar Singh,Bharat Ambati
... poor on verbs with accuracy level at 25-38%. We suggest a modification to this mentioned formulation, using context and semantic relatedness of neighboring words. An improvement of 17% -35% in the accuracy ...
License Type: Freeware
Research Paper
Paper Type: Conference Papers
License Type: Freeware
Author(s):Sudha Bhingardive, Samiulla Shaikh,Pushpak Bhattacharyya
... to incorporate greater contextual and linguistic information), which leads to an effective training of these models. This model is then used by the standard state-of-art Moses decoder (Koehn et al., 2007) ...
License Type: Freeware
Research Paper
Paper Type: Conference Papers
License Type: Freeware
Author(s):Prasanth Kolachina,V Sriram,Srinivas Bangalore,Sudheer Kolachina Sudheer Kolachina, Avinesh PVS
We propose a lightweight method for using discourse relations for polarity detection of tweets. This method is targeted towards the web-based applications that deal with noisy, unstructured text, like ...
License Type: Freeware
Research Paper
Paper Type: Conference Papers
License Type: Freeware
Author(s):Subhabrata Mukherjee,Pushpak Bhattacharyya
Cross-Lingual Sentiment Analysis (CLSA) is the task of predicting the polarity of the opinion expressed in a text in a language Ltest using a classifier trained on the corpus of another language Ltrain. ...
License Type: Freeware
Research Paper
Paper Type: Conference Papers
License Type: Freeware
Author(s):Balamurali A R, Aditya Joshi,Pushpak Bhattacharyya