Telugu is the official language of Andhra Pradesh state and one of the widely spoken languages in the world. However, there is no standard input method, which has a widespread use among Telugu users on computers. In this paper, we describe the design of Telugu soft keyboards, which are based on a set of design principles. We also evaluate these designs along with existing designs and compare their performances.
In this paper we have proposed an approach for automatic language and subject identification for the books of digital library. The important characteristics of function words is explored for language identification. The heuristic search approach is explored for subject identification by matching title words with the keywords of the subjects. The language identification system is developed for five languages namely English, French, German, Italian and Spanish.
In this paper, we describe a sentence position based summarizer that is built based on a sentence position policy, created from the evaluation testbed of recent summarization tasks at Document Understanding Conferences (DUC).We show that the summarizer thus built is able to outperform most systems participating in task focused summarization evaluations at Text Analysis Conferences (TAC) 2008.
Retrieving answer containing passages is a challenging task in Question Answering. In this paper we describe a novel query expansion method which aims to rank the answer containing passages better. It uses content and structured information (link structure and category information) of Wikipedia to generate a set of terms semantically related to the question.
Document Retrieval assumes that a document is independent of its relevance, and non-relevance. Previous works showed that the same assumption is being considered for passage retrieval in the context of Question Answering. In this paper, we relax this assumption and describe a method for estimating the prior of a passage being relevant, and non-relevant to a question. These prior probabilities are used in the process of ranking passages.