Package nltk :: Module text :: Class Text
Class Text

object --+    
      list --+

A text object, which can be loaded with a sequence of words, and which supports counting, concordancing, collocation discovery, etc. This class is intended to support initial exploration of texts. It is initialized with a list of words, e.g.:

>>> moby = Text(nltk.corpus.gutenberg.words('melville-moby_dick.txt'))

Many of the methods simply print their results, and are intended for use via the interactive console.

__init__(self, text, name=None)
Create a Text object.
concordance(self, word, width=80, lines=25)
Print a concordance for the word with the specified context window.
collocations(self, num=20)
Print collocations derived from the text.
readability(self, method) source code
generate(self, length=100)
Print random text, generated using a trigram language model.
similar(self, word, num=20)
Distributional similarity: find other words which appear in the same contexts as the specified word.
dispersion_plot(self, words) source code
zipf_plot(self, *args) source code
vocab(self) source code
__init__(self, text, name=None)

Create a Text object.

  • words (sequence of str) - The source text.
concordance(self, word, width=80, lines=25)

Print a concordance for the word with the specified context window.

  • word (str) - The target word
  • width (int) - The width of each line, in characters (default=80)
  • lines (int) - The number of lines to display (default=25)

collocations(self, num=20)

Print collocations derived from the text.

  • num (int) - The number of collocations to produce.

generate(self, length=100)

Print random text, generated using a trigram language model.

  • length (int) - The length of text to generate (default=100)

similar(self, word, num=20)

Distributional similarity: find other words which appear in the same contexts as the specified word.

  • word (str) - The word used to seed the similarity search
  • num (int) - The number of words to generate (default=20)

Returns: string
A string representation of this FreqDist.
Returns: string
A string representation of this FreqDist.
