textnets.corpusΒΆ

Implements the features relating to language.

Module attributes

LANGS

Mapping of language codes to spaCy language model names.

DocLike

Custom type for objects resembling documents (token sequences).

Classes

Corpus

Corpus of labeled documents.

TidyText

Collection of tokens with per-document counts.

Exceptions

NoDocumentColumnException

Raised if no suitable document column is specified or found.