| Home | Trees | Indices | Help |
|
|---|
|
|
object --+
|
PunktParameters
Stores data used to perform sentence boundary detection with punkt.
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
|||
|
Inherited from |
|||
|
|||
|
abbrev_types A set of word types for known abbreviations. |
|||
|
collocations A set of word type tuples for known common collocations where the first word ends in a period. |
|||
|
sent_starters A set of word types for words that often appear at the beginning of sentences. |
|||
|
ortho_context A dictionary mapping word types to the set of orthographic contexts that word type appears in. |
|||
|
|||
|
Inherited from |
|||
|
|||
x.__init__(...) initializes x; see x.__class__.__doc__ for signature
|
|
|||
collocationsA set of word type tuples for known common collocations where the first word ends in a period. E.g., ('S.', 'Bach') is a common collocation in a text that discusses 'Johann S. Bach'. These count as negative evidence for sentence boundaries. |
ortho_contextA dictionary mapping word types to the set of orthographic contexts that word type appears in. Contexts are represented by adding orthographic context flags: ... |
| Home | Trees | Indices | Help |
|
|---|
| Generated by Epydoc 3.0beta1 on Wed Aug 27 15:08:58 2008 | http://epydoc.sourceforge.net |