Package org.terrier.structures.merging
Class BlockStructureMerger
- java.lang.Object
-
- org.terrier.structures.merging.StructureMerger
-
- org.terrier.structures.merging.BlockStructureMerger
-
public class BlockStructureMerger extends StructureMerger
This class merges two sets of data structures (ie direct index, inverted index, document index, lexicon and statistics), created by Terrier with position information (blocks) and possibly field information, into one set of data structures.- Author:
- Vassilis Plachouras and Craig Macdonald
- See Also:
StructureMerger
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from class org.terrier.structures.merging.StructureMerger
StructureMerger.Command, StructureMerger.NullDocumentIndex
-
-
Field Summary
-
Fields inherited from class org.terrier.structures.merging.StructureMerger
basicDirectIndexPostingIteratorClass, basicInvertedIndexPostingIteratorClass, compressionDirectConfig, compressionInvertedConfig, destIndex, directFileOutputStreamClass, fieldCount, fieldDirectFileOutputStreamClass, fieldDirectIndexPostingIteratorClass, fieldInvertedIndexPostingIteratorClass, keepTermCodeMap, logger, MetaReverse, numberOfDocuments, numberOfPointers, numberOfTerms, srcIndex1, srcIndex2, termcodeHashmap
-
-
Constructor Summary
Constructors Constructor Description BlockStructureMerger(IndexOnDisk _srcIndex1, IndexOnDisk _srcIndex2, IndexOnDisk _destIndex)
constructor
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description static void
main(java.lang.String[] args)
usage: java org.terrier.structures.merging.BlockStructureMerger [binary bits] [inverted file 1] [inverted file 2] [output inverted file]protected void
matchBlockProperties(java.lang.String structureName)
protected void
mergeDirectFiles()
Merges the two direct files and the corresponding document id files.protected void
mergeInvertedFiles()
Merges the two lexicons into one.-
Methods inherited from class org.terrier.structures.merging.StructureMerger
createLexidFile, getInterfaces, mergeDocumentIndexFiles, mergeStructures, setOutputIndex, setReverseMeta
-
-
-
-
Constructor Detail
-
BlockStructureMerger
public BlockStructureMerger(IndexOnDisk _srcIndex1, IndexOnDisk _srcIndex2, IndexOnDisk _destIndex)
constructor- Parameters:
_srcIndex1
-_srcIndex2
-_destIndex
-
-
-
Method Detail
-
mergeInvertedFiles
protected void mergeInvertedFiles()
Description copied from class:StructureMerger
Merges the two lexicons into one. After this stage, the offsets in the lexicon are ot correct. They will be updated only after creating the inverted file.- Overrides:
mergeInvertedFiles
in classStructureMerger
-
mergeDirectFiles
protected void mergeDirectFiles()
Description copied from class:StructureMerger
Merges the two direct files and the corresponding document id files.- Overrides:
mergeDirectFiles
in classStructureMerger
-
matchBlockProperties
protected void matchBlockProperties(java.lang.String structureName)
-
main
public static void main(java.lang.String[] args)
usage: java org.terrier.structures.merging.BlockStructureMerger [binary bits] [inverted file 1] [inverted file 2] [output inverted file]
-
-