Module senseval
source code
Read from the Senseval 2 Corpus.
SENSEVAL [http://www.senseval.org/] Evaluation exercises for Word
Sense Disambiguation. Organized by ACL-SIGLEX
[http://www.siglex.org/]
Prepared by Ted Pedersen <[email protected]>, University of
Minnesota, http://www.d.umn.edu/~tpederse/data.html Distributed with
permission.
The NLTK version of the Senseval 2 files uses well-formed XML. Each
instance of the ambiguous words "hard", "interest",
"line", and "serve" is tagged with a sense
identifier, and supplied with context.
|
_fixXML(text)
Fix the various issues with Senseval pseudo-XML. |
source code
|
|