Home | Trees | Indices | Help |
|
---|
|
object --+ | _PunktBaseClass
Includes common components of PunktTrainer and PunktSentenceTokenizer.
|
|||
_Token The token definition that should be used by this class. |
|
|||
|
|||
Inherited from |
|||
Word tokenization | |||
---|---|---|---|
|
|||
Annotation Procedures | |||
|
|||
|
|
|||
Helper Functions | |||
---|---|---|---|
|
|
|||
_params The collection of parameters that determines the behavior of the punkt tokenizer. |
|
|||
Inherited from |
|
x.__init__(...) initializes x; see x.__class__.__doc__ for signature
|
Yields pairs of tokens from the given iterator such that each input token will appear as the first element in a yielded tuple. The last pair will have None as its second element. |
Perform the first pass of annotation, which makes decisions based purely based on the word type of each word:
Return these annotations as a tuple of three sets:
|
Home | Trees | Indices | Help |
|
---|
Generated by Epydoc 3.0beta1 on Wed Aug 27 15:08:58 2008 | http://epydoc.sourceforge.net |