Package nltk :: Package tokenize :: Module api :: Class TokenizerI
Class TokenizerI

object --+
Known Subclasses:

A procesing interface for tokenizing a string, or dividing it into a list of substrings.

Subclasses must define:

tokenize(self, s)
Divide the given string into a list of substrings.
list of list of str
batch_tokenize(self, strings)
Apply self.tokenize() to each element of strings.
tokenize(self, s)

Divide the given string into a list of substrings.

list of str

batch_tokenize(self, strings)

Apply self.tokenize() to each element of strings. I.e.:

>>> return [self.tokenize(s) for s in strings]
Returns: list of list of str