Package nltk :: Package classify :: Module maxent :: Class GISEncoding
Class GISEncoding

             object --+        
 MaxentFeatureEncodingI --+    
BinaryMaxentFeatureEncoding --+

A binary feature encoding which adds one new joint-feature to the joint-features defined by BinaryMaxentFeatureEncoding: a correction feature, whose value is chosen to ensure that the sparse vector always sums to a constant non-negative number. This new feature is used to ensure two preconditions for the GIS training algorithm:

__init__(self, labels, mapping, unseen_features=False, alwayson_features=False, C=None)
x.__init__(...) initializes x; see x.__class__.__doc__ for signature
list of (int, number)
encode(self, featureset, label)
Given a (featureset, label) pair, return the corresponding vector of joint-feature values.
Returns: The size of the fixed-length joint-feature vectors that are generated by this encoding.
describe(self, f_id)
Returns: A string describing the value of the joint-feature whose index in the generated feature vectors is fid.
Inherited from BinaryMaxentFeatureEncoding: labels

Inherited from BinaryMaxentFeatureEncoding: train

The non-negative constant that all encoded feature vectors will sum to.
The non-negative constant that all encoded feature vectors will sum to.

__init__(self, labels, mapping, unseen_features=False, alwayson_features=False, C=None)

  • C - The correction constant. The value of the correction feature is based on this value. In particular, its value is C - sum([v for (f,v) in encoding]).
Overrides: BinaryMaxentFeatureEncoding.__init__

encode(self, featureset, label)

Given a (featureset, label) pair, return the corresponding vector of joint-feature values. This vector is represented as a list of (index, value) tuples, specifying the value of each non-zero joint-feature.

Returns: list of (int, number)
Overrides: BinaryMaxentFeatureEncoding.encode


Returns: int
The size of the fixed-length joint-feature vectors that are generated by this encoding.
Overrides: BinaryMaxentFeatureEncoding.length

describe(self, f_id)

Returns: str
A string describing the value of the joint-feature whose index in the generated feature vectors is fid.
Overrides: BinaryMaxentFeatureEncoding.describe

The non-negative constant that all encoded feature vectors will sum to.

