Terrier IR Platform
2.2.1

Package uk.ac.gla.terrier.terms

Provides the interface and classes for the term pipeline, a set of objects that process the terms during indexing and processing of queries.

See:
          Description

Interface Summary
TermPipeline Models the concept of a component in a pipeline of term processors.
TermPipelineAccessor This interface allows code to access the TermPipeline without implementing the end of the term pipeline.
 

Class Summary
CropTerm Reduces the size of terms passing through the term pipeline to the maximum allowed size for this indexing run.
DanishSnowballStemmer Danish stemmer implmented by Snowball.
DumpTerm Useful development phase TermPipeline object that allows prints every term that passes through it to System.err
DutchSnowballStemmer Dutch stemmer implmented by Snowball.
EnglishSnowballStemmer English stemmer implmented by Snowball.
FinnishSnowballStemmer Finnish stemmer implmented by Snowball.
FrenchSnowballStemmer French stemmer implmented by Snowball.
GermanSnowballStemmer German stemmer implmented by Snowball.
HungarianSnowballStemmer Hungerian stemmer implmented by Snowball.
ItalianSnowballStemmer Italian stemmer implmented by Snowball.
NoOp A do-nothing term pipeline object.
NorwegianSnowballStemmer Norwegian stemmer implmented by Snowball.
PorterStemmer This is the Porter stemming algorithm, coded up in JAVA by Gianni Amati.
PortugueseSnowballStemmer Portuguese stemmer implmented by Snowball.
RomanianSnowballStemmer Romanian stemmer implmented by Snowball.
RussianSnowballStemmer Russian stemmer implmented by Snowball.
SkipTermPipeline Class that identified tokens which should not be passed down the entire term pipeline, and instead passed onto a specified stage instead.
SnowballStemmer  
SpanishSnowballStemmer Spanish stemmer implmented by Snowball.
Stopwords Implements stopword removal, as a TermPipeline object.
SwedishSnowballStemmer Swedish stemmer implmented by Snowball.
TurkishSnowballStemmer Turkish stemmer implmented by Snowball.
WeakPorterStemmer An implementation of the Porter stemming algorithm that uses only the first step of the algorithm.
 

Package uk.ac.gla.terrier.terms Description

Provides the interface and classes for the term pipeline, a set of objects that process the terms during indexing and processing of queries.

This package includes implementations of a stop-word remover, as well as a full and a weak version of Porter's stemming algorithm.


Terrier IR Platform
2.2.1

Terrier Information Retrieval Platform 2.2.1. Copyright 2004-2008 University of Glasgow