| Home | Trees | Indices | Help |
|
|---|
|
|
object --+
|
_PunktBaseClass
Includes common components of PunktTrainer and PunktSentenceTokenizer.
|
|||
|
_Token The token definition that should be used by this class. |
|||
|
|||
|
|||
|
Inherited from |
|||
| Word tokenization | |||
|---|---|---|---|
|
|||
| Annotation Procedures | |||
|
|||
|
|||
|
|||
| Helper Functions | |||
|---|---|---|---|
|
|||
|
|||
|
_params The collection of parameters that determines the behavior of the punkt tokenizer. |
|||
|
|||
|
Inherited from |
|||
|
|||
x.__init__(...) initializes x; see x.__class__.__doc__ for signature
|
Yields pairs of tokens from the given iterator such that each input token will appear as the first element in a yielded tuple. The last pair will have None as its second element. |
Perform the first pass of annotation, which makes decisions based purely based on the word type of each word:
Return these annotations as a tuple of three sets:
|
| Home | Trees | Indices | Help |
|
|---|
| Generated by Epydoc 3.0beta1 on Wed Aug 27 15:08:58 2008 | http://epydoc.sourceforge.net |