org.terrier.indexing.hadoop
Class Hadoop_BlockSinglePassIndexer.DelimTermProcessor

java.lang.Object
  extended by org.terrier.indexing.hadoop.Hadoop_BlockSinglePassIndexer.DelimTermProcessor
All Implemented Interfaces:
TermPipeline
Enclosing class:
Hadoop_BlockSinglePassIndexer

protected class Hadoop_BlockSinglePassIndexer.DelimTermProcessor
extends java.lang.Object
implements TermPipeline

This class behaves in a similar fashion to BasicTermProcessor except that this one treats blocks bounded by delimiters instead of fixed-sized blocks.

Since:
2.2
Author:
Rodrygo Santos

Field Summary
protected  gnu.trove.THashSet<java.lang.String> blockDelimiterTerms
           
protected  boolean countDelimiters
           
protected  boolean indexDelimiters
           
 
Constructor Summary
Hadoop_BlockSinglePassIndexer.DelimTermProcessor(java.lang.String[] _delims, boolean _indexDelimiters, boolean _countDelimiters)
           
 
Method Summary
 void processTerm(java.lang.String t)
          Processes a term using the current term pipeline component and passes the output to the next pipeline component, if the term has not been discarded.
 boolean reset()
          This method implements the specific rest option needed to implements query or doc oriented policy.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

blockDelimiterTerms

protected gnu.trove.THashSet<java.lang.String> blockDelimiterTerms

indexDelimiters

protected final boolean indexDelimiters

countDelimiters

protected final boolean countDelimiters
Constructor Detail

Hadoop_BlockSinglePassIndexer.DelimTermProcessor

public Hadoop_BlockSinglePassIndexer.DelimTermProcessor(java.lang.String[] _delims,
                                                        boolean _indexDelimiters,
                                                        boolean _countDelimiters)
Method Detail

processTerm

public void processTerm(java.lang.String t)
Description copied from interface: TermPipeline
Processes a term using the current term pipeline component and passes the output to the next pipeline component, if the term has not been discarded.

Specified by:
processTerm in interface TermPipeline
Parameters:
t - String the term to process.

reset

public boolean reset()
Description copied from interface: TermPipeline
This method implements the specific rest option needed to implements query or doc oriented policy.

Specified by:
reset in interface TermPipeline
Returns:
results of the operation


Terrier 3.5. Copyright © 2004-2011 University of Glasgow