tethne.persistence.hdf5.util module¶
Helper classes and methods for tethne.persistence.hdf5.
TODO: move away from index table pattern, toward index array pattern.
- class tethne.persistence.hdf5.util.HDF5Axis(h5file, fgroup, name)[source]¶
Bases: dict
Organizes a single axis.
- class tethne.persistence.hdf5.util.HDF5FeatureSet(h5file, fgroup, name)[source]¶
Bases: dict
Stores data about the distribution of a specific feature-set, e.g. unigrams, across papers in the Corpus.
- class tethne.persistence.hdf5.util.HDF5FeatureValues(h5file, group, name, keyatom=StringAtom(itemsize=200, shape=(), dflt=''), indexatom=Int32Atom(shape=(), dflt=0), valueatom=Float64Atom(shape=(), dflt=0.0))[source]¶
Bases: dict
- class tethne.persistence.hdf5.util.HDF5Features(h5file)[source]¶
Bases: dict
Organizes feature-sets, each as a HDF5Feature.
- class tethne.persistence.hdf5.util.HDF5SparseValues(h5file, group, name, iatom, katom, indexatom)[source]¶
Bases: dict
Parameters: h5file
group
name
iatom
katom
indexatom
- tethne.persistence.hdf5.util.get_h5file(typename, datapath=None)[source]¶
Load or create an HDF5 data file.
- class tethne.persistence.hdf5.util.papers_table(h5file, index_by, name, citations=None, index_citation_by='ayjid')[source]¶
Bases: dict
Mimics the papers dict in Paper, providing HDF5 persistence.
Values should be set only once for a key.
Parameters: h5file : tables.file.File
A tables.file.File object.
index_by : str