Home | Trees | Indices | Help |
|
---|
|
object --+ | api.CorpusReader --+ | TaggedCorpusReader --+ | MacMorphoCorpusReader
A corpus reader for the MAC_MORPHO corpus. Each line contains a single tagged word, using '_' as a separator. Sentence boundaries are based on the end-sentence tag ('_.'). Paragraph information is not included in the corpus, so each paragraph returned by self.paras() and self.tagged_paras() contains a single sentence.
|
|||
|
|||
|
|||
Inherited from Inherited from Inherited from Inherited from |
|||
Deprecated since 0.8 | |||
---|---|---|---|
Inherited from |
|||
Deprecated since 0.9.1 | |||
Inherited from Inherited from |
|
|||
Inherited from |
|
|||
Inherited from Inherited from |
|||
Deprecated since 0.9.1 | |||
---|---|---|---|
Inherited from |
|
Construct a new Tagged Corpus reader for a set of documents located at the given root directory. Example usage: >>> root = '/...path to corpus.../' >>> reader = TaggedCorpusReader(root, '.*', '.txt')
|
Home | Trees | Indices | Help |
|
---|
Generated by Epydoc 3.0beta1 on Wed Aug 27 15:08:53 2008 | http://epydoc.sourceforge.net |