|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||
| Packages that use org.terrier.indexing | |
|---|---|
| org.terrier.indexing | Provides classes and interfaces related to the indexing of documents. |
| org.terrier.indexing.hadoop | Provides classes for Terrier's MapReduce indexer. |
| org.terrier.structures.indexing.singlepass.hadoop | Provides classes implemeting the Hadoop MapReduce indexing in Terrier. |
| Classes in org.terrier.indexing used by org.terrier.indexing | |
|---|---|
| BasicIndexer
BasicIndexer is the default indexer for Terrier. |
|
| BasicSinglePassIndexer
This class indexes a document collection (skipping the direct file construction). |
|
| Collection
This interface encapsulates the most fundamental concept to indexing with Terrier - a Collection. |
|
| Document
This interface encapsulates the concept of a document during indexing. |
|
| DocumentExtractor
Deprecated. |
|
| FileDocument
Models a document which corresponds to one file. |
|
| Indexer
Properties: termpipelines - the sequence of TermPipeline stages (e.g. |
|
| SimpleXMLCollection
Initial implementation of a class that generates a Collection with Documents from a series of XML files. |
|
| SinglePassIndexerFlushDelegate
Used by ExtensibleSinglePassIndexer for
delegating the flushing of memory. |
|
| TaggedDocument
Models a tagged document (e.g., an HTML or TREC document). |
|
| Tokenizer
The specification of the interface implemented by tokeniser classes. |
|
| TRECCollection
Models a TREC test collection by implementing the interfaces Collection and DocumentExtractor. |
|
| TRECFullTokenizer
This class is the tokenizer used for indexing TREC topic files. |
|
| Classes in org.terrier.indexing used by org.terrier.indexing.hadoop | |
|---|---|
| BasicIndexer
BasicIndexer is the default indexer for Terrier. |
|
| BasicSinglePassIndexer
This class indexes a document collection (skipping the direct file construction). |
|
| Document
This interface encapsulates the concept of a document during indexing. |
|
| Indexer
Properties: termpipelines - the sequence of TermPipeline stages (e.g. |
|
| Classes in org.terrier.indexing used by org.terrier.structures.indexing.singlepass.hadoop | |
|---|---|
| Collection
This interface encapsulates the most fundamental concept to indexing with Terrier - a Collection. |
|
| Document
This interface encapsulates the concept of a document during indexing. |
|
|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||