1.1 KNB Corpus
|
>>> from nltk.corpus import knbc
|
|
Access the words: this should produce a list of strings:
|
>>> type(knbc.words()[0]) is not bytes
True
|
|
Access the sentences: this should produce a list of lists of strings:
|
>>> type(knbc.sents()[0][0]) is not bytes
True
|
|
Access the tagged words: this should produce a list of word, tag pairs:
|
>>> type(knbc.tagged_words()[0])
<... 'tuple'>
|
|
Access the tagged sentences: this should produce a list of lists of word, tag pairs:
|
>>> type(knbc.tagged_sents()[0][0])
<... 'tuple'>
|
|