•    Freeware
  •    Shareware
  •    Research
  •    Localization Tools 20
  •    Publications 707
  •    Validators 2
  •    Mobile Apps 22
  •    Fonts 31
  •    Guidelines/ Draft Standards 3
  •    Documents 13
  •    General Tools 38
  •    NLP Tools 105
  •    Linguistic Resources 255
Malayalam tree bank data is in Shakti Standard Format (SSF). SSF is a common representation for data. SSF allows information in a sentence to be represented in the form of one or more trees together with a set of attribute-value pairs with nodes of the trees. The attribute-value pairs allow features or properties to be specified with every node. Sentence
level SSF is used to store the analysis of a sentence. It occurs as part of text level SSF. The analysis of a sentence may mark any or all of the following kinds of information as appropriate: part of speech of the words in the sentence; morphological analysis of the words including properties such as root, gender, number, person, tense, aspect, modality; phrase-structure or dependency structure of the sentence; and properties of units such as chunks, phrases, local word groups, tags, etc. SSF is theory neutral and allows both phrase structure as well as dependency structure to be coded, and even mixed in well defined ways. The SSF representation for a sentence consists of a sequence of trees. Each tree is made up of one or more related nodes.
Total size of the Malayalam tree bank corpus is 9512 monolingual sentence ids ,6010 parallel sentence ids and approx. 251 verb frames. Following supporting documents are provided:
1. BIS Tag set
2. Chunk Guidelines
3. Dependency guidelines_Malayalam
5.morph guidelines final
6.pos guidelines
7.SSF Format

Tags: TreeBank, Malayalam treebank, Malayalam Treebank Corpus, Tree Bank Data

Added on February 27, 2018


  More Details
  • Contributed by : IL dependency tree bank, IIIT Hyd
  • Product Type : Text Corpora
  • License Type : Research
  • System Requirement : Not Applicable
Similar / Suggested Resources