public class BlockSinglePassIndexer extends BasicSinglePassIndexer
BasicSinglePassIndexer
).
All normal block properties are supported. For more information, see BlockIndexer
.Modifier and Type | Class and Description |
---|---|
protected class |
BlockSinglePassIndexer.BasicTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
protected class |
BlockSinglePassIndexer.DelimFieldTermProcessor
This class behaves in a similar fashion to FieldTermProcessor except that
this one treats blocks bounded by delimiters instead of fixed-sized blocks.
|
protected class |
BlockSinglePassIndexer.DelimTermProcessor
This class behaves in a similar fashion to BasicTermProcessor except that
this one treats blocks bounded by delimiters instead of fixed-sized blocks.
|
protected class |
BlockSinglePassIndexer.FieldTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
Modifier and Type | Field and Description |
---|---|
protected int |
BLOCK_SIZE
The maximum number of terms allowed in a block
|
protected int |
blockId
The block number in the current document.
|
protected int |
MAX_BLOCKS
The maximum number allowed number of blocks in a document.
|
protected int |
numOfTokensInBlock
The number of tokens in the current block of the current document.
|
basicInvertedIndexPostingIteratorClass, currentFile, currentId, docsPerCheck, fieldInvertedIndexPostingIteratorClass, fileNames, invertedIndexClass, invertedIndexInputStreamClass, maxDocsPerFlush, maxMemory, memoryAfterFlush, memoryCheck, merger, mp, numberOfDocsSinceCheck, numberOfDocsSinceFlush, numberOfDocuments, numberOfPointers, numberOfTokens, numberOfUniqueTerms, runtime
compressionDirectConfig, compressionInvertedConfig, numOfTokensInDocument, termCodes, termFields, termsInDocument
BUILDER_BOUNDARY_DOCUMENTS, currentIndex, directIndexBuilder, docIndexBuilder, emptyDocIndexEntry, fieldNames, fileNameNoExtension, IndexEmptyDocuments, invertedIndexBuilder, lexiconBuilder, logger, MAX_DOCS_PER_BUILDER, MAX_TOKENS_IN_DOCUMENT, metaBuilder, numFields, path, pipeline_first, prefix, useFieldInformation
Constructor and Description |
---|
BlockSinglePassIndexer(String pathname,
String prefix)
Constructs an instance of this block indexer which uses the single-pass strategy
|
Modifier and Type | Method and Description |
---|---|
protected void |
createDocumentPostings()
Hook method that creates the right type of DocumentTree class.
|
protected void |
createFieldRunMerger(String[][] files)
Hook method that creates a FieldRunMerger instance
|
protected void |
createMemoryPostings()
Hook method that creates the right type of MemoryPostings class.
|
protected void |
createRunMerger(String[][] files)
Hook method that creates a RunsMerger instance
|
protected TermPipeline |
getEndOfPipeline()
Returns the object that is to be the end of the TermPipeline.
|
void |
performMultiWayMerge()
Uses the merger class to perform a k multiway merge
in a set of previously written runs.
|
checkFlush, createDirectIndex, createInvertedIndex, createInvertedIndex, finishMemoryPosting, forceFlush, getFileNames, indexDocument, load_indexer_properties
finishedInvertedIndexBuild
createMetaIndexBuilder, finishedDirectIndexBuild, index, indexEmpty, init, load_builder_boundary_documents, load_field_ids, load_pipeline, main, merge, merge, mergeTwoIndices, parseInts, useFieldInformation
protected int numOfTokensInBlock
protected int blockId
protected int BLOCK_SIZE
protected int MAX_BLOCKS
protected TermPipeline getEndOfPipeline()
getEndOfPipeline
in class BasicIndexer
protected void createFieldRunMerger(String[][] files) throws IOException
BasicSinglePassIndexer
createFieldRunMerger
in class BasicSinglePassIndexer
IOException
- if an I/O error occurs.protected void createRunMerger(String[][] files) throws Exception
BasicSinglePassIndexer
createRunMerger
in class BasicSinglePassIndexer
IOException
- if an I/O error occurs.Exception
protected void createMemoryPostings()
BasicSinglePassIndexer
createMemoryPostings
in class BasicSinglePassIndexer
protected void createDocumentPostings()
BasicIndexer
createDocumentPostings
in class BasicIndexer
public void performMultiWayMerge() throws IOException
BasicSinglePassIndexer
performMultiWayMerge
in class BasicSinglePassIndexer
IOException
Terrier Information Retrieval Platform 5.1. Copyright © 2004-2019, University of Glasgow