Terrier IR Platform
2.2.1

Uses of Interface
uk.ac.gla.terrier.indexing.Collection

Packages that use Collection
uk.ac.gla.terrier.indexing Provides classes and interfaces related to the indexing of documents. 
 

Uses of Collection in uk.ac.gla.terrier.indexing
 

Classes in uk.ac.gla.terrier.indexing that implement Collection
 class SimpleFileCollection
          Implements a collection that can read arbitrary files on disk.
 class SimpleMedlineXMLCollection
          Initial implementation of a class that generates a Collection with Documents from a series of XML files in the Medline format.
 class SimpleXMLCollection
          Initial implementation of a class that generates a Collection with Documents from a series of XML files.
 class TRECCollection
          Models a TREC test collection by implementing the interfaces Collection and DocumentExtractor.
 class TRECUTFCollection
          Extends TRECCollection to provide support for indexing TREC collection in non-ASCII character sets.
 

Methods in uk.ac.gla.terrier.indexing that return Collection
static Collection CollectionFactory.loadCollection(java.lang.String CollectionName)
          Load collection(s) of the specified name.
static Collection CollectionFactory.loadCollection(java.lang.String CollectionName, java.lang.Class[] contructorTypes, java.lang.Object[] constructorValues)
          Load collection(s) of the specified name.
static Collection CollectionFactory.loadCollections()
          Use the default property trec.collection.class, or it's default value TRECCollection
static Collection CollectionFactory.loadCollections(java.lang.String[] collNames)
          Load collection(s) of the specified name.
static Collection CollectionFactory.loadCollections(java.lang.String[] collNames, java.lang.Class[] contructorTypes, java.lang.Object[] constructorValues)
          Load collection(s) of the specified name.
 

Methods in uk.ac.gla.terrier.indexing with parameters of type Collection
 void BasicIndexer.createDirectIndex(Collection[] collections)
          Creates the direct index, the document index and the lexicon.
 void BasicSinglePassIndexer.createDirectIndex(Collection[] collections)
           
 void BlockIndexer.createDirectIndex(Collection[] collections)
          For the given collection, it iterates through the documents and creates the direct index, document index and lexicon, using information about blocks and possibly fields.
abstract  void Indexer.createDirectIndex(Collection[] collections)
          An abstract method for creating the direct index, the document index and the lexicon for the given collections.
 void BasicSinglePassIndexer.createInvertedIndex(Collection[] collections)
          Builds the inverted file and lexicon file for the given collections Loops through each document in each of the collections, extracting terms and pushing these through the Term Pipeline (eg stemming, stopping, lowercase).
 void Indexer.index(Collection[] collections)
          Creates the data structures for a set of collections.
 


Terrier IR Platform
2.2.1

Terrier Information Retrieval Platform 2.2.1. Copyright 2004-2008 University of Glasgow