demo(num_sents=100,
max_rules=200,
min_score=3,
error_output='errors.out',
rule_output='rules.yaml',
randomize=False,
train=0.8,
trace=3)
| source code
|
Brill Tagger Demonstration
- Parameters:
num_sents (int) - how many sentences of training and testing data to use
max_rules (int) - maximum number of rule instances to create
min_score (int) - the minimum score for a rule in order for it to be considered
error_output (string) - the file where errors will be saved
rule_output (string) - the file where rules will be saved
randomize (boolean) - whether the training data should be a random subset of the corpus
train (float) - the fraction of the the corpus to be used for training (1=all)
trace (int) - the level of diagnostic tracing output to produce (0-4)
|