•    Freeware
  •    Shareware
  •    Research
  •    Localization Tools 20
  •    Publications 696
  •    Validators 2
  •    Mobile Apps 22
  •    Fonts 31
  •    Guidelines/ Draft Standards 3
  •    Documents 13
  •    General Tools 38
  •    NLP Tools 105
  •    Linguistic Resources 251

Search Results | Total Results found :   1181

You refine search by : All Results
  Catalogue
Kannada tree bank data is in Shakti Standard Format (SSF). SSF is a common representation for data. SSF allows information in a sentence to be represented in the form of one or more trees together with a set of attribute-value pairs with nodes of the trees. The attribute-value pairs allow features or properties to be specified with every node. Sentence level SSF is used to store the analysis of a sentence. It occurs as part of text level SSF. The analysis of a sentence may mark any or all of the following kinds of information as appropriate: part of speech of the words in the sentence; morphological analysis of the words including properties such as root, gender, number, person, tense, aspect, modality; phrase-structure or dependency structure of the sentence; and properties of units such as chunks, phrases, local word groups, tags, etc. SSF is theory neutral and allows both phrase structure as well as dependency structure to be coded, and even mixed in well defined ways. The SSF representation for a sentence consists of a sequence of trees. Each tree is made up of one or more related nodes. Total size of the Kannada tree bank corpus is 19550 sentence ids and approx. 215 verb frames. Following supporting documents are provided:
1. Dependency_Guidelines_Kannada.pdf
2. morph_kannada.pdf
3. pos_chunk_guidelines_kannada.pdf
4. SSF format of tree bank.pdf

Tags:TreeBank, Kannada treebank, Treebank Corpus, Tree bank Data

Added on February 27, 2018

1
9

  More Details
  • Contributed by : IL Dependency Treebank, IIIT Hyd
  • Product Type : Text Corpora
  • License Type : Research
  • System Requirement : Not Applicable

Marathi treebank data is in Shakti Standard Format (SSF). SSF is a common representation for data. SSF allows information in a sentence to be represented in the form of one or more trees together with a set of attribute-value pairs with nodes of the trees. The attribute-value pairs allow features or properties to be specified with every node. Sentence level SSF is used to store the analysis of a sentence. It occurs as part of text level SSF. The analysis of a sentence may mark any or all of the following kinds of information as appropriate: part of speech of the words in the sentence; morphological analysis of the words including properties such as root, gender, number, person, tense, aspect, modality; phrase-structure or dependency structure of the sentence; and properties of units such as chunks, phrases, local word groups, tags, etc. SSF is theory neutral and allows both phrase structure as well as dependency structure to be coded, and even mixed in well defined ways. The SSF representation for a sentence consists of a sequence of trees. Each tree is made up of one or more related nodes. Total size of the Marathi tree bank corpus is 10852
monolingual sentence ids ,3450 parallel sentence ids and approx. 380 verb frames.
Following supporting documents are provided:
1. Marathi POS Tag set
2. Marathi_Morph Guidelines
3. Guidelines for Marathi Verb frames
4.Dependency -Marathi

Tags: TreeBank, Marathi treebank, Treebank Corpus, Tree bank Data

Added on February 27, 2018

1
7

  More Details
  • Contributed by : IL dependency tree bank, IIIT Hyderabad
  • Product Type : Text Corpora
  • License Type : Research
  • System Requirement : Not Applicable

Hindi tree bank data is in Shakti Standard Format (SSF). SSF is a common representation for data. SSF allows information in a sentence to be represented in the form of one or more trees together with a set of attribute-value pairs with nodes of the trees. The attribute-value pairs allow features or properties to be specified with every node. Sentence level SSF is used to store the analysis of a sentence. It occurs as part of text level SSF. The analysis of a sentence may mark any or all of the following kinds of information as appropriate: part of speech of the words in the sentence; morphological analysis of the words including properties such as root, gender, number, person, tense, aspect, modality; phrase-structure or dependency structure of the sentence; and properties of units such as chunks, phrases, local word groups, tags, etc. Note that SSF is theory neutral and allows both phrase structure as well as dependency structure to be coded, and even mixed in well defined ways. The SSF representation for a sentence consists of a sequence of trees. Each tree is made up of one or more related nodes.Total size of the Hindi tree bank corpus is 3000 sentence ids and approx. 860 verb frames. Following supporting documents are provided:
1. Chunk guidelines.pdf
2. DS-guidelines-june-18-2014.pdf
3. Hindi verb frames guidelines.pdf
4. pos-standard-doc-modified-nov-12-2011-v1.0_BIS_Tagset.pdf
5. SSF format of tree bank.pdf

Tags: TreeBank, Hindi treebank, Treebank Corpus, Tree bank Data

Added on February 27, 2018

3
22

  More Details
  • Contributed by : IL Dependency Tree bank_IIIT Hyd
  • Product Type : Text Corpora
  • License Type : Research
  • System Requirement : Not Applicable

Bengali tree bank data is in Shakti Standard Format (SSF). SSF is a common representation for data. SSF allows information in a sentence to be represented in the form of one or more trees together with a set of attribute-value pairs with nodes of the trees. The attribute-value pairs allow features or properties to be specified with every node. Sentence level SSF is used to store the analysis of a sentence. It occurs as part of text level SSF. The analysis of a sentence may mark any or all of the following kinds of information as appropriate: part of speech of the words in the sentence; morphological analysis of the words including properties such as root, gender, number, person, tense, aspect, modality; phrase-structure or dependency structure of the sentence; and properties of units such as chunks, phrases, local word groups, tags, etc. SSF is theory neutral and allows both phrase structure as well as dependency structure to be coded, and even mixed in well defined ways. The SSF representation for a sentence consists of a sequence of trees. Each tree is made up of one or more related nodes.Total size of the Bengali tree bank corpus is 15725
monolingual sentence ids and approx. 425 verb frames.
Following supporting documents are provided:
1. Bengali sentences with relation following Hindi.pdf
2. POS TAG guideline draft.pdf
3. SSF format of tree bank.pdf

Tags: TreeBank, Bengali treebank, Treebank Corpus, Tree bank Data, Bangla

Added on February 27, 2018

1
17

  More Details
  • Contributed by : IL dependency tree bank, IIIT Hyd
  • Product Type : Text Corpora
  • License Type : Research
  • System Requirement : Not Applicable

A necessary step for the recognition of scanned documents is binarization, which is essentially the segmentation of the document. In order to binarize a scanned document, we can find several algorithms in the literature. What is the best binarization result for a given document image? To answer this question, a user needs to check different binarization algorithms for suitability, since different algorithms may work better for different type of documents. Manually choosing the best from a set of binarized documents is time consuming. To automate the selection of the best segmented document, either we need to use ground-truth of the document or propose an evaluation metric.

Added on December 19, 2017

127

  More Details
  • Contributed by : Consortium
  • Product Type : Research Paper
  • License Type : Freeware
  • System Requirement : Not Applicable
  • Author : Deepak Kumar, M. N. Anil Prasad, A.G. Ramakrishnan
Author Community Profile :