Package nltk :: Package corpus :: Package reader :: Module bracket_parse :: Class AlpinoCorpusReader
[hide private]
[frames] | no frames]

Class AlpinoCorpusReader

source code

         object --+            
                  |            
   api.CorpusReader --+        
                      |        
util.SyntaxCorpusReader --+    
                          |    
   BracketParseCorpusReader --+
                              |
                             AlpinoCorpusReader

Reader for the Alpino Dutch Treebank.

Instance Methods [hide private]
 
__init__(self, root, encoding=None, tag_mapping_function=None) source code
 
_normalize(self, t) source code

Inherited from BracketParseCorpusReader (private): _parse, _read_block, _tag, _word

Inherited from util.SyntaxCorpusReader: parsed_sents, raw, sents, tagged_sents, tagged_words, words

Inherited from api.CorpusReader: __repr__, abspath, abspaths, encoding, files, open

Inherited from api.CorpusReader (private): _get_root

Inherited from object: __delattr__, __getattribute__, __hash__, __new__, __reduce__, __reduce_ex__, __setattr__, __str__

    Block Readers
    Deprecated since 0.8

Inherited from util.SyntaxCorpusReader: parsed, read, tagged, tokenized

    Deprecated since 0.9.1

Inherited from api.CorpusReader: filenames

Inherited from api.CorpusReader (private): _get_items

Instance Variables [hide private]

Inherited from api.CorpusReader (private): _encoding, _files, _root

Properties [hide private]

Inherited from api.CorpusReader: root

Inherited from object: __class__

    Deprecated since 0.9.1

Inherited from api.CorpusReader: items

Method Details [hide private]

__init__(self, root, encoding=None, tag_mapping_function=None)
(Constructor)

source code 
Parameters:
  • root - The root directory for this corpus.
  • files - A list or regexp specifying the files in this corpus.
  • comment_char - The character which can appear at the start of a line to indicate that the rest of the line is a comment.
  • detect_blocks - The method that is used to find blocks in the corpus; can be 'unindented_paren' (every unindented parenthesis starts a new parse) or 'sexpr' (brackets are matched).
Overrides: BracketParseCorpusReader.__init__
(inherited documentation)

_normalize(self, t)

source code 
Overrides: BracketParseCorpusReader._normalize