Terrier IR Platform
2.2.1

Package uk.ac.gla.terrier.structures

Provides the classes that implement the data structures used for retrieval with the Terrier platform.

See:
          Description

Interface Summary
Closeable Marks an index structure class that can be closed.
IndexConfigurable Interface defining that an index structure wants access to the Index object it is associated with.
 

Class Summary
BlockDirectIndex Extended direct index that saves both block and field information about the terms that appear in a document.
BlockDirectIndexInputStream This class reads the block field direct index structure sequentially, as an input stream.
BlockDirectInvertedOutputStream Writes a block direct or block inverted index, when passed appropriate posting lists.
BlockInvertedIndex This class implements the block field inverted index for performing retrieval.
BlockInvertedIndexInputStream Reads a BlockInvertedIndex as a stream
BlockLexicon A lexicon class that saves the number of different blocks a term appears in.
BlockLexiconInputStream An input stream for accessing sequentially the entries of a block lexicon.
BlockLexiconOutputStream An output stream for writing the lexicon to a file sequentially.
CollectionStatistics This class provides basic statistics for the indexed collection of documents, such as the average length of documents, or the total number of documents in the collection.
DirectIndex A class that implements the direct index and saves information about whether a term appears in one of the specified fields.
DirectIndexInputStream This class reads the direct index structure, sequentially, as an input stream.
DirectInvertedOutputStream Writes a block direct or block inverted index, when passed appropriate posting lists.
DocIndexEncodedHash  
DocumentIndex This class provides an interface for accessing the document index file.
DocumentIndexEncoded A document index class which reads the .docid file and keeps its contents in a array of bytes in memory.
DocumentIndexInMemory This class extends DocumentIndex, but instead of accessing the disk file each time, the data are loaded into memory, in order to decrease access time.
DocumentIndexInputStream This class provides access to the document index file sequentially, as a stream.
DocumentIndexOutputStream A DocumentIndexOutputStream.
ExpansionTerms This class implements a data structure of terms in the top-retrieved documents.
ExpansionTerms.ExpansionTerm This class implements a data structure for a term in the top-retrieved documents.
FilePosition Models a position within a file as the offset in bytes and the offset in bits in that byte.
Index This class encapsulates all the Indexes at retrieval time.
InvertedIndex This class implements the inverted index for performing retrieval, with field information optionally.
InvertedIndexInputStream Reads an InvertedIndex as a stream
Lexicon The class that implements the lexicon structure.
LexiconEntry Contains all the information about one entry in the Lexicon.
LexiconInputStream This class implements an input stream for the lexicon structure.
LexiconOutputStream This class implements an output stream for the lexicon structure.
SingleLineTRECQuery This class can be used to extract batch queries from a simpler format than the regular SGML TREC format.
TRECQuery This class is used for reading the queries from TREC topic files.
UTFBlockLexicon A lexicon class that saves the number of different blocks a term appears in, using UTF encoding of Strings.
UTFBlockLexiconInputStream An input stream for accessing sequentially the entries of a block lexicon.
UTFBlockLexiconOutputStream An output stream for writing the lexicon to a file sequentially.
UTFLexicon The class that implements the lexicon structure.
UTFLexiconInputStream This class implements an input stream for the lexicon structure.
UTFLexiconOutputStream This class implements an output stream for the lexicon structure.
 

Package uk.ac.gla.terrier.structures Description

Provides the classes that implement the data structures used for retrieval with the Terrier platform.

These include the implementations of the inverted index, the direct index, the lexicon and the document index.


Terrier IR Platform
2.2.1

Terrier Information Retrieval Platform 2.2.1. Copyright 2004-2008 University of Glasgow