To Do in nltk.corpus.reader.conll.ConllCorpusReader
- Add support for reading from corpora where different parallel files
contain different columns.
- Possibly add caching of the grid corpus view? This would allow the
same grid view to be used by different data access methods (eg words()
and parsed_sents() could both share the same grid corpus view object).
- Better support for -DOCSTART-. Currently, we just ignore it, but it
could be used to define methods that retrieve a document at a time (eg
parsed_documents()).
|