public class Hadoop_BlockSinglePassIndexer extends Hadoop_BasicSinglePassIndexer
BlockIndexer
.Modifier and Type | Class and Description |
---|---|
protected class |
Hadoop_BlockSinglePassIndexer.BasicTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
protected class |
Hadoop_BlockSinglePassIndexer.DelimFieldTermProcessor
This class behaves in a similar fashion to FieldTermProcessor except that
this one treats blocks bounded by delimiters instead of fixed-sized blocks.
|
protected class |
Hadoop_BlockSinglePassIndexer.DelimTermProcessor
This class behaves in a similar fashion to BasicTermProcessor except that
this one treats blocks bounded by delimiters instead of fixed-sized blocks.
|
protected class |
Hadoop_BlockSinglePassIndexer.FieldTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
Modifier and Type | Field and Description |
---|---|
protected int |
BLOCK_SIZE
The maximum number of terms allowed in a block
|
protected int |
blockId
The block number in the current document.
|
protected int |
MAX_BLOCKS
The maximum number allowed number of blocks in a document.
|
protected int |
numOfTokensInBlock
The number of tokens in the current block of the current document.
|
currentReporter, flushList, flushNo, jc, lastReporter, lexstream, MapIndexPrefixes, mapTaskID, mutipleIndices, outputPostingListCollector, reduceId, reduceStarted, RunData, runIteratorF, splitnum, start
basicInvertedIndexPostingIteratorClass, currentFile, currentId, docsPerCheck, fieldInvertedIndexPostingIteratorClass, fileNames, invertedIndexClass, invertedIndexInputStreamClass, maxDocsPerFlush, maxMemory, memoryAfterFlush, memoryCheck, merger, mp, numberOfDocsSinceCheck, numberOfDocsSinceFlush, numberOfDocuments, numberOfPointers, numberOfTokens, numberOfUniqueTerms, runtime
compressionDirectConfig, compressionInvertedConfig, numOfTokensInDocument, termFields, termsInDocument
BUILDER_BOUNDARY_DOCUMENTS, currentIndex, directIndexBuilder, docIndexBuilder, emptyDocIndexEntry, fieldNames, fileNameNoExtension, IndexEmptyDocuments, invertedIndexBuilder, lexiconBuilder, logger, MAX_DOCS_PER_BUILDER, MAX_TOKENS_IN_DOCUMENT, metaBuilder, numFields, path, pipeline_first, prefix, useFieldInformation
Constructor and Description |
---|
Hadoop_BlockSinglePassIndexer()
Constructs an instance of this class, where the created data structures
are stored in the given path.
|
Modifier and Type | Method and Description |
---|---|
protected void |
createDocumentPostings()
Hook method that creates the right type of DocumentTree class.
|
void |
createMemoryPostings()
Hook method that creates the right type of MemoryPostings class.
|
protected RunsMerger |
createtheRunMerger()
Creates the RunsMerger and the RunIteratorFactory
|
protected TermPipeline |
getEndOfPipeline()
Returns the object that is to be the end of the TermPipeline.
|
protected void |
load_indexer_properties() |
close, closeMap, closeReduce, configure, configureMap, configureReduce, createMetaIndexBuilder, finish, forceFlush, indexEmpty, load_builder_boundary_documents, loadRunData, main, map, mergeDocumentIndex, reduce, startReduce
checkFlush, createDirectIndex, createFieldRunMerger, createInvertedIndex, createInvertedIndex, createRunMerger, finishMemoryPosting, getFileNames, indexDocument, performMultiWayMerge
finishedInvertedIndexBuild
finishedDirectIndexBuild, index, init, load_field_ids, load_pipeline, merge, merge, mergeTwoIndices, parseInts, useFieldInformation
protected int numOfTokensInBlock
protected int blockId
protected int BLOCK_SIZE
protected int MAX_BLOCKS
public Hadoop_BlockSinglePassIndexer()
protected TermPipeline getEndOfPipeline()
getEndOfPipeline
in class BasicIndexer
public void createMemoryPostings()
createMemoryPostings
in class BasicSinglePassIndexer
protected void createDocumentPostings()
BasicIndexer
createDocumentPostings
in class BasicIndexer
protected RunsMerger createtheRunMerger()
Hadoop_BasicSinglePassIndexer
createtheRunMerger
in class Hadoop_BasicSinglePassIndexer
protected void load_indexer_properties()
load_indexer_properties
in class BasicSinglePassIndexer
Terrier Information Retrieval Platform4.1. Copyright © 2004-2015, University of Glasgow