public class NoDuplicatesSinglePassIndexing extends BasicSinglePassIndexer
BasicIndexer.BasicTermProcessor, BasicIndexer.FieldTermProcessor
Modifier and Type | Field and Description |
---|---|
protected TreeSet<String> |
seenDocnos |
basicInvertedIndexPostingIteratorClass, currentFile, currentId, docsPerCheck, fieldInvertedIndexPostingIteratorClass, fileNames, invertedIndexClass, invertedIndexInputStreamClass, maxDocsPerFlush, maxMemory, memoryAfterFlush, memoryCheck, merger, mp, numberOfDocsSinceCheck, numberOfDocsSinceFlush, numberOfDocuments, numberOfPointers, numberOfTokens, numberOfUniqueTerms, runtime
compressionDirectConfig, compressionInvertedConfig, numOfTokensInDocument, termFields, termsInDocument
BUILDER_BOUNDARY_DOCUMENTS, currentIndex, directIndexBuilder, docIndexBuilder, emptyDocIndexEntry, fieldNames, fileNameNoExtension, IndexEmptyDocuments, invertedIndexBuilder, lexiconBuilder, logger, MAX_DOCS_PER_BUILDER, MAX_TOKENS_IN_DOCUMENT, metaBuilder, numFields, path, pipeline_first, prefix, useFieldInformation
Modifier | Constructor and Description |
---|---|
protected |
NoDuplicatesSinglePassIndexing(long a,
long b,
long c) |
|
NoDuplicatesSinglePassIndexing(String pathname,
String prefix) |
Modifier and Type | Method and Description |
---|---|
protected void |
indexDocument(Map<String,String> docProperties,
DocumentPostingList termsInDocument)
This adds a document to the direct and document indexes, as well
as it's terms to the lexicon.
|
protected void |
indexEmpty(Map<String,String> docProperties)
Adds an entry to document index for empty document @param docid, only if
IndexEmptyDocuments is set to true.
|
checkFlush, createDirectIndex, createFieldRunMerger, createInvertedIndex, createInvertedIndex, createMemoryPostings, createRunMerger, finishMemoryPosting, forceFlush, getFileNames, load_indexer_properties, performMultiWayMerge
createDocumentPostings, finishedInvertedIndexBuild, getEndOfPipeline
createMetaIndexBuilder, finishedDirectIndexBuild, index, init, load_builder_boundary_documents, load_field_ids, load_pipeline, main, merge, merge, mergeTwoIndices, parseInts, useFieldInformation
protected NoDuplicatesSinglePassIndexing(long a, long b, long c)
protected void indexDocument(Map<String,String> docProperties, DocumentPostingList termsInDocument) throws Exception
indexDocument
in class BasicSinglePassIndexer
docProperties
- MaptermsInDocument
- DocumentPostingList the terms in the document.Exception
protected void indexEmpty(Map<String,String> docProperties) throws IOException
indexEmpty
in class Indexer
IOException
Terrier Information Retrieval Platform4.1. Copyright © 2004-2015, University of Glasgow