public class NoDuplicatesSinglePassIndexing extends BasicSinglePassIndexer
BasicIndexer.BasicTermProcessor, BasicIndexer.FieldTermProcessor| Modifier and Type | Field and Description |
|---|---|
protected TreeSet<String> |
seenDocnos |
basicInvertedIndexPostingIteratorClass, currentFile, currentId, docsPerCheck, fieldInvertedIndexPostingIteratorClass, fileNames, invertedIndexClass, invertedIndexInputStreamClass, maxDocsPerFlush, maxMemory, memoryAfterFlush, memoryCheck, merger, mp, numberOfDocsSinceCheck, numberOfDocsSinceFlush, numberOfDocuments, numberOfPointers, numberOfTokens, numberOfUniqueTerms, runtimecompressionDirectConfig, compressionInvertedConfig, numOfTokensInDocument, termFields, termsInDocumentBUILDER_BOUNDARY_DOCUMENTS, currentIndex, directIndexBuilder, docIndexBuilder, emptyDocIndexEntry, fieldNames, fileNameNoExtension, IndexEmptyDocuments, invertedIndexBuilder, lexiconBuilder, logger, MAX_DOCS_PER_BUILDER, MAX_TOKENS_IN_DOCUMENT, metaBuilder, numFields, path, pipeline_first, prefix, useFieldInformation| Modifier | Constructor and Description |
|---|---|
protected |
NoDuplicatesSinglePassIndexing(long a,
long b,
long c) |
|
NoDuplicatesSinglePassIndexing(String pathname,
String prefix) |
| Modifier and Type | Method and Description |
|---|---|
protected void |
indexDocument(Map<String,String> docProperties,
DocumentPostingList termsInDocument)
This adds a document to the direct and document indexes, as well
as it's terms to the lexicon.
|
protected void |
indexEmpty(Map<String,String> docProperties)
Adds an entry to document index for empty document @param docid, only if
IndexEmptyDocuments is set to true.
|
checkFlush, createDirectIndex, createFieldRunMerger, createInvertedIndex, createInvertedIndex, createMemoryPostings, createRunMerger, finishMemoryPosting, forceFlush, getFileNames, load_indexer_properties, performMultiWayMergecreateDocumentPostings, finishedInvertedIndexBuild, getEndOfPipelinecreateMetaIndexBuilder, finishedDirectIndexBuild, index, init, load_builder_boundary_documents, load_field_ids, load_pipeline, main, merge, merge, mergeTwoIndices, parseInts, useFieldInformationprotected NoDuplicatesSinglePassIndexing(long a,
long b,
long c)
protected void indexDocument(Map<String,String> docProperties, DocumentPostingList termsInDocument) throws Exception
indexDocument in class BasicSinglePassIndexerdocProperties - MaptermsInDocument - DocumentPostingList the terms in the document.Exceptionprotected void indexEmpty(Map<String,String> docProperties) throws IOException
indexEmpty in class IndexerIOExceptionTerrier Information Retrieval Platform4.1. Copyright © 2004-2015, University of Glasgow