SciPy

tethne.model.corpus package

Module contents

Corpus models describe latent topics (dimensions) that explain the distribution of features (eg words) among documents in a Corpus.

Tethne presently represents two corpus models:

ldamodel.LDAModel Represents a Latent Dirichlet Allocation (LDA) topic model.
dtmmodel.DTMModel Represents a Dynamic Topic Model (DTM).

Most model classes are subclasses of BaseModel. It is assumed that each model describes a set of items (eg Papers or authors), a set of dimensions that describe those items (eg topics), and a set of features that comprise those dimensions (eg words).