Home | Trees | Indices | Help |
|
---|
|
|
|||
Corpus View | |||
---|---|---|---|
StreamBackedCorpusView A 'view' of a corpus file, which acts like a sequence of tokens: it can be accessed by index, iterated over, etc. |
|||
ConcatenatedCorpusView A 'view' of a corpus file that joins together one or more StreamBackedCorpusViews. |
|||
Corpus View for Pickled Sequences | |||
PickleCorpusView A stream backed corpus view for corpus files that consist of sequences of serialized Python objects (serialized using pickle.dump ).
|
|||
Treebank readers | |||
SyntaxCorpusReader An abstract base class for reading corpora consisting of syntactically parsed text. |
|
|||
Corpus View | |||
---|---|---|---|
|
|||
Block Readers | |||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
Finding Corpus Items | |||
|
|||
|
|||
Paragraph structure in Treebank files | |||
|
|
Concatenate together the contents of multiple documents from a single corpus, using an appropriate concatenation function. This utility function is used by corpus readers when the user requests more than one document at a time. |
Read a sequence of tokens from a stream, where tokens begin with lines
that match |
Read a sequence of s-expressions from the stream, and leave the stream's file position at the end the last complete s-expression read. This function will always return at least one s-expression, unless there are no more s-expressions in the file. If the file ends in in the middle of an s-expression, then that incomplete s-expression is returned when the end of the file is reached.
|
Home | Trees | Indices | Help |
|
---|
Generated by Epydoc 3.0beta1 on Wed Aug 27 15:08:50 2008 | http://epydoc.sourceforge.net |