public class BlockSinglePassIndexer extends BasicSinglePassIndexer
BasicSinglePassIndexer).
All normal block properties are supported. For more information, see BlockIndexer.| Modifier and Type | Class and Description |
|---|---|
protected class |
BlockSinglePassIndexer.BasicTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
protected class |
BlockSinglePassIndexer.DelimFieldTermProcessor
This class behaves in a similar fashion to FieldTermProcessor except that
this one treats blocks bounded by delimiters instead of fixed-sized blocks.
|
protected class |
BlockSinglePassIndexer.DelimTermProcessor
This class behaves in a similar fashion to BasicTermProcessor except that
this one treats blocks bounded by delimiters instead of fixed-sized blocks.
|
protected class |
BlockSinglePassIndexer.FieldTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
| Modifier and Type | Field and Description |
|---|---|
protected int |
BLOCK_SIZE
The maximum number of terms allowed in a block
|
protected int |
blockId
The block number in the current document.
|
protected int |
MAX_BLOCKS
The maximum number allowed number of blocks in a document.
|
protected int |
numOfTokensInBlock
The number of tokens in the current block of the current document.
|
basicInvertedIndexPostingIteratorClass, currentFile, currentId, docsPerCheck, fieldInvertedIndexPostingIteratorClass, fileNames, invertedIndexClass, invertedIndexInputStreamClass, maxDocsPerFlush, maxMemory, memoryAfterFlush, memoryCheck, merger, mp, numberOfDocsSinceCheck, numberOfDocsSinceFlush, numberOfDocuments, numberOfPointers, numberOfTokens, numberOfUniqueTerms, runtimecompressionDirectConfig, compressionInvertedConfig, numOfTokensInDocument, termCodes, termFields, termsInDocumentblocks, BUILDER_BOUNDARY_DOCUMENTS, currentIndex, directIndexBuilder, docIndexBuilder, emptyDocIndexEntry, externalParalllism, fieldNames, fileNameNoExtension, IndexEmptyDocuments, invertedIndexBuilder, lexiconBuilder, logger, MAX_DOCS_PER_BUILDER, MAX_TOKENS_IN_DOCUMENT, metaBuilder, numFields, path, pipeline_first, prefix, useFieldInformation| Constructor and Description |
|---|
BlockSinglePassIndexer(String pathname,
String prefix)
Constructs an instance of this block indexer which uses the single-pass strategy
|
| Modifier and Type | Method and Description |
|---|---|
protected void |
createDocumentPostings()
Hook method that creates the right type of DocumentTree class.
|
protected void |
createFieldRunMerger(String[][] files)
Hook method that creates a FieldRunMerger instance
|
protected void |
createMemoryPostings()
Hook method that creates the right type of MemoryPostings class.
|
protected void |
createRunMerger(String[][] files)
Hook method that creates a RunsMerger instance
|
protected TermPipeline |
getEndOfPipeline()
Returns the object that is to be the end of the TermPipeline.
|
void |
performMultiWayMerge()
Uses the merger class to perform a k multiway merge
in a set of previously written runs.
|
checkFlush, createDirectIndex, createInvertedIndex, createInvertedIndex, finishMemoryPosting, forceFlush, getFileNames, indexDocument, load_indexer_propertiesfinishedInvertedIndexBuildcreateMetaIndexBuilder, finishedDirectIndexBuild, getExternalParalllism, index, indexEmpty, init, load_builder_boundary_documents, load_field_ids, load_pipeline, main, merge, merge, mergeTwoIndices, parseInts, setExternalParalllism, useFieldInformationprotected int numOfTokensInBlock
protected int blockId
protected int BLOCK_SIZE
protected int MAX_BLOCKS
protected TermPipeline getEndOfPipeline()
getEndOfPipeline in class BasicIndexerprotected void createFieldRunMerger(String[][] files) throws IOException
BasicSinglePassIndexercreateFieldRunMerger in class BasicSinglePassIndexerIOException - if an I/O error occurs.protected void createRunMerger(String[][] files) throws Exception
BasicSinglePassIndexercreateRunMerger in class BasicSinglePassIndexerIOException - if an I/O error occurs.Exceptionprotected void createMemoryPostings()
BasicSinglePassIndexercreateMemoryPostings in class BasicSinglePassIndexerprotected void createDocumentPostings()
BasicIndexercreateDocumentPostings in class BasicIndexerpublic void performMultiWayMerge()
throws IOException
BasicSinglePassIndexerperformMultiWayMerge in class BasicSinglePassIndexerIOExceptionTerrier Information Retrieval Platform 5.2. Copyright © 2004-2019, University of Glasgow