| Home | Trees | Indices | Help |
|
|---|
|
|
object --+
|
api.TokenizerI --+
|
WhitespaceTokenizer
A tokenizer that divides a string into substrings by treating any
sequence of whitespace characters as a separator. Whitespace characters
are space (' '), tab ('\t'), and newline
('\n'). If you are performing the tokenization yourself
(rather than building a tokenizer to pass to some other piece of code),
consider using the string split() method instead:
>>> words = s.split()
|
|||
|
|||
|
Inherited from Inherited from |
|||
|
|||
|
Inherited from |
|||
|
|||
Divide the given string into a list of substrings.
|
| Home | Trees | Indices | Help |
|
|---|
| Generated by Epydoc 3.0beta1 on Wed Aug 27 15:08:58 2008 | http://epydoc.sourceforge.net |