Package org.terrier.structures.indexing
Provides the classes used for creating the data structures of the Terrier platform.
These include the builders of the inverted index, the direct index, the lexicon and the document index.
-
Interface Summary Interface Description LexiconBuilder.CollectionStatisticsCounter Counter of LexiconEntries -
Class Summary Class Description BaseMetaIndexBuilder Abstract base class for compressed and uncompressed metaindex building Properties: metaindex.compressed.max.data.in-mem.mb - maximum size that a meta index .zdata file will be kept in memory.BlockDocumentPostingList Represents the postings of one document, and saves block (term position) information.BlockFieldDocumentPostingList BlockFieldDocumentPostingList classCompressingMetaIndexBuilder Creates a metaindex structure that compresses all values using Deflate.CompressionFactory Configures the compression to be used when creating an IndexOnDisk.CompressionFactory.BitCompressionConfiguration CompressionFactory.BitIdOnlyCompressionConfiguration CompressionFactory.CompressionConfiguration A configuration object used by the disk indexers for configuring the compression for direct and inverted files.DiskIndexWriter Utility class for writing an existing index to diskDocumentIndexBuilder A builder for the document index.DocumentPostingList Represents the postings of one document.FieldDocumentPostingList FieldDocumentPostingList classFieldLexiconMap class FieldLexicanMapFSOMapFileLexiconUtilities Indexer Properties: termpipelines - the sequence ofTermPipeline
stages (e.g.LexiconBuilder Builds temporary lexicons during indexing a collection and merges them when the indexing of a collection has finished.LexiconBuilder.BasicLexiconCollectionStaticticsCounter counts global statistics in the non-fields caseLexiconBuilder.FieldLexiconCollectionStaticticsCounter counts global statistics in the fields caseLexiconBuilder.NullCollectionStatisticsCounter LexiconMap This class keeps track of the total counts of terms within a bundle of documents being indexed.LZ4MetaIndexBuilder Writes all metadata using Zstandard compression.MetaIndexBuilder Abstract class for writing document metadata to a disk index.UncompressedMetaIndexBuilder Writes all metadata as uncompressedZstdMetaIndexBuilder Writes all metadata using Zstandard compression.