Package nltk :: Package corpus :: Package reader :: Module senseval
[hide private]
[frames] | no frames]

Module senseval

source code

Read from the Senseval 2 Corpus.

SENSEVAL [http://www.senseval.org/] Evaluation exercises for Word Sense Disambiguation. Organized by ACL-SIGLEX [http://www.siglex.org/]

Prepared by Ted Pedersen <[email protected]>, University of Minnesota, http://www.d.umn.edu/~tpederse/data.html Distributed with permission.

The NLTK version of the Senseval 2 files uses well-formed XML. Each instance of the ambiguous words "hard", "interest", "line", and "serve" is tagged with a sense identifier, and supplied with context.

Classes [hide private]
  SensevalInstance
  SensevalCorpusReader
  SensevalCorpusView
Functions [hide private]
 
_fixXML(text)
Fix the various issues with Senseval pseudo-XML.
source code