Package | Description |
---|---|
org.terrier.realtime.memory |
Provides MemoryIndex structures.
|
org.terrier.realtime.memory.fields |
Provides MemoryIndex structures that support field search.
|
org.terrier.structures.indexing |
Provides the classes used for creating the data structures of
the Terrier platform.
|
org.terrier.structures.indexing.classical |
Provides functionality for creating on-disk indices via indexer classes.
|
org.terrier.structures.indexing.singlepass |
Provides implementation of the structures needed for performing a single
pass indexing
|
org.terrier.structures.indexing.singlepass.hadoop |
Provides classes implemeting the Hadoop MapReduce indexing in Terrier.
|
org.terrier.terms |
Provides the interface and classes for the term pipeline,
a set of objects that process the terms during indexing and
processing of queries.
|
Modifier and Type | Field and Description |
---|---|
protected TermPipeline |
MemoryIndex.pipeline_first
FIXME
|
Modifier and Type | Method and Description |
---|---|
protected TermPipeline |
MemoryIndex.getEndOfPipeline()
FIXME
|
Modifier and Type | Field and Description |
---|---|
protected TermPipeline |
MemoryFields.pipeline |
Modifier and Type | Method and Description |
---|---|
protected TermPipeline |
MemoryFields.getEndOfPipeline() |
Modifier and Type | Field and Description |
---|---|
protected TermPipeline |
Indexer.pipeline_first
The first component of the term pipeline.
|
Modifier and Type | Method and Description |
---|---|
protected abstract TermPipeline |
Indexer.getEndOfPipeline()
An abstract method that returns the last component
of the term pipeline.
|
Modifier and Type | Class and Description |
---|---|
protected class |
BasicIndexer.BasicTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
protected class |
BasicIndexer.FieldTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
protected class |
BlockIndexer.BasicTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
protected class |
BlockIndexer.DelimFieldTermProcessor
This class behaves in a similar fashion to FieldTermProcessor except that
this one treats blocks bounded by delimiters instead of fixed-sized blocks.
|
protected class |
BlockIndexer.DelimTermProcessor
This class behaves in a similar fashion to BasicTermProcessor except that
this one treats blocks bounded by delimiters instead of fixed-sized blocks.
|
protected class |
BlockIndexer.FieldTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
Modifier and Type | Method and Description |
---|---|
protected TermPipeline |
BasicIndexer.getEndOfPipeline()
Returns the end of the term pipeline, which corresponds to
an instance of either BasicIndexer.BasicTermProcessor, or
BasicIndexer.FieldTermProcessor, depending on whether
field information is stored.
|
protected TermPipeline |
BlockIndexer.getEndOfPipeline()
Returns the object that is to be the end of the TermPipeline.
|
Modifier and Type | Class and Description |
---|---|
protected class |
BlockSinglePassIndexer.BasicTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
protected class |
BlockSinglePassIndexer.DelimFieldTermProcessor
This class behaves in a similar fashion to FieldTermProcessor except that
this one treats blocks bounded by delimiters instead of fixed-sized blocks.
|
protected class |
BlockSinglePassIndexer.DelimTermProcessor
This class behaves in a similar fashion to BasicTermProcessor except that
this one treats blocks bounded by delimiters instead of fixed-sized blocks.
|
protected class |
BlockSinglePassIndexer.FieldTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
Modifier and Type | Method and Description |
---|---|
protected abstract TermPipeline |
ExtensibleSinglePassIndexer.getEndOfPipeline()
Returns the end of the term pipeline, which corresponds to
an instance of either BasicIndexer.BasicTermProcessor, or
BasicIndexer.FieldTermProcessor, depending on whether
field information is stored.
|
protected TermPipeline |
BlockSinglePassIndexer.getEndOfPipeline()
Returns the object that is to be the end of the TermPipeline.
|
Modifier and Type | Class and Description |
---|---|
protected class |
Hadoop_BlockSinglePassIndexer.BasicTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
protected class |
Hadoop_BlockSinglePassIndexer.DelimFieldTermProcessor
This class behaves in a similar fashion to FieldTermProcessor except that
this one treats blocks bounded by delimiters instead of fixed-sized blocks.
|
protected class |
Hadoop_BlockSinglePassIndexer.DelimTermProcessor
This class behaves in a similar fashion to BasicTermProcessor except that
this one treats blocks bounded by delimiters instead of fixed-sized blocks.
|
protected class |
Hadoop_BlockSinglePassIndexer.FieldTermProcessor
This class implements an end of a TermPipeline that adds the
term to the DocumentTree.
|
Modifier and Type | Method and Description |
---|---|
protected TermPipeline |
Hadoop_BlockSinglePassIndexer.getEndOfPipeline()
Returns the object that is to be the end of the TermPipeline.
|
Modifier and Type | Class and Description |
---|---|
class |
BaseTermPipelineAccessor
A base implementation for TermPipelineAccessor
|
class |
CropTerm
Reduces the size of terms passing through the term pipeline to the maximum allowed
size for this indexing run.
|
class |
DanishSnowballStemmer
Danish stemmer implmented by Snowball.
|
class |
DumpTerm
Useful development phase TermPipeline object that allows
prints every term that passes through it to System.err
|
class |
DutchSnowballStemmer
Dutch stemmer implmented by Snowball.
|
class |
EnglishSnowballStemmer
English stemmer implmented by Snowball.
|
class |
FinnishSnowballStemmer
Finnish stemmer implmented by Snowball.
|
class |
FrenchSnowballStemmer
French stemmer implmented by Snowball.
|
class |
GermanSnowballStemmer
German stemmer implmented by Snowball.
|
class |
HungarianSnowballStemmer
Hungerian stemmer implmented by Snowball.
|
class |
ItalianSnowballStemmer
Italian stemmer implmented by Snowball.
|
class |
NoOp
A do-nothing term pipeline object.
|
class |
NorwegianSnowballStemmer
Norwegian stemmer implmented by Snowball.
|
class |
PorterStemmer
Stemmer, implementing the Porter Stemming Algorithm.
|
class |
PortugueseSnowballStemmer
Portuguese stemmer implmented by Snowball.
|
class |
RemoveDiacritics
Removes diacritics in letters.
|
class |
RomanianSnowballStemmer
Romanian stemmer implmented by Snowball.
|
class |
RussianSnowballStemmer
Russian stemmer implmented by Snowball.
|
class |
SkipTermPipeline
Class that identified tokens which should not be passed down the entire term pipeline, and instead
passed onto a specified stage instead.
|
class |
SnowballStemmer
Classic Snowball stemmer implmented by Snowball.
|
class |
SpanishSnowballStemmer
Spanish stemmer implmented by Snowball.
|
class |
StemmerTermPipeline
Abstract base class for Stemmers that are also TermPipeline instances
|
class |
Stopwords
Implements stopword removal, as a TermPipeline object.
|
class |
SwedishSnowballStemmer
Swedish stemmer implmented by Snowball.
|
class |
TRv2PorterStemmer
This is the Porter stemming algorithm, coded up in JAVA by Gianni Amati.
|
class |
TRv2WeakPorterStemmer
An implementation of the Porter stemming algorithm that uses only the first
step of the algorithm.
|
class |
TurkishSnowballStemmer
Turkish stemmer implmented by Snowball.
|
class |
WeakPorterStemmer
Weak Porter Stemmer, using Porter's Java implementation as the base.
|
Modifier and Type | Field and Description |
---|---|
protected TermPipeline |
RemoveDiacritics.next |
protected TermPipeline |
Stopwords.next
The next component in the term pipeline.
|
protected TermPipeline |
StemmerTermPipeline.next |
protected TermPipeline |
CropTerm.next
The next object in the term pipeline
|
protected TermPipeline |
NoOp.next
The implementation of a term pipeline.
|
Constructor and Description |
---|
CropTerm(TermPipeline _next)
Creates a new CropTerm pipeline object, which can be used in the
term pipeline
|
DanishSnowballStemmer(TermPipeline n)
constructor
|
DumpTerm(TermPipeline _next)
Construct a new DumpTerm objecy
|
DutchSnowballStemmer(TermPipeline n)
contructor
|
EnglishSnowballStemmer(TermPipeline n)
constructor
|
FinnishSnowballStemmer(TermPipeline n)
constructor
|
FrenchSnowballStemmer(TermPipeline n)
constructor
|
GermanSnowballStemmer(TermPipeline n)
constructor
|
HungarianSnowballStemmer(TermPipeline n)
constructor
|
ItalianSnowballStemmer(TermPipeline n)
constructor
|
NoOp(TermPipeline _next)
Constructs an instance of the class, given the next
component in the pipeline.
|
NorwegianSnowballStemmer(TermPipeline n)
constructor
|
PorterStemmer(TermPipeline next)
Constructs an instance of PorterStemmer.
|
PortugueseSnowballStemmer(TermPipeline n)
constructor
|
RemoveDiacritics(TermPipeline next) |
RomanianSnowballStemmer(TermPipeline n)
constructor
|
RussianSnowballStemmer(TermPipeline n)
constructor
|
SkipTermPipeline(TermPipeline _next,
TermPipeline _last)
Instantiate this object, using properties to define tokens.
|
SkipTermPipeline(TermPipeline _next,
TermPipeline _last,
String[] _skipTokens)
Instantiate this object.
|
SnowballStemmer(String StemLanguage,
TermPipeline next)
Creates a new stemmer object for the language StemLanguage.
|
SpanishSnowballStemmer(TermPipeline n)
constructor
|
Stopwords(TermPipeline _next)
Makes a new stopword termpipeline object.
|
Stopwords(TermPipeline _next,
String StopwordsFile)
Makes a new stopword term pipeline object.
|
Stopwords(TermPipeline _next,
String[] StopwordsFiles)
Makes a new stopword term pipeline object.
|
SwedishSnowballStemmer(TermPipeline n)
constructor
|
TRv2PorterStemmer(TermPipeline next)
Constructs an instance of the TRv2PorterStemmer.
|
TRv2WeakPorterStemmer(TermPipeline next)
Constructs an instance of the TRv2WeakPorterStemmer.
|
TurkishSnowballStemmer(TermPipeline n)
constructor
|
WeakPorterStemmer(TermPipeline next)
constructor
|
Terrier Information Retrieval Platform4.1. Copyright © 2004-2015, University of Glasgow