Terrier IR Platform
1.1.1

uk.ac.gla.terrier.applications
Class TRECIndexing

java.lang.Object
  extended by uk.ac.gla.terrier.applications.TRECIndexing

public class TRECIndexing
extends java.lang.Object

This class creates the indices for a test collection.

Version:
$Revision: 1.42 $
Author:
Gianni Amati, Vassilis Plachouras, Ben He, Craig Macdonald

Constructor Summary
TRECIndexing()
          A default constructor that initialised the data structures to use for indexing.
 
Method Summary
 void createDirectFile()
          Builds the direct file and lexicon.
 void createInvertedFile()
          Building the inverted file.
 void index()
          Calls the method index(Collection[]) of the class Indexer in order to build the data structures for a set of collections.
static void main(java.lang.String[] args)
          Used for testing purposes.
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

TRECIndexing

public TRECIndexing()
A default constructor that initialised the data structures to use for indexing.

Method Detail

index

public void index()
Calls the method index(Collection[]) of the class Indexer in order to build the data structures for a set of collections. This particular method of the Indexer uses a set of builders for a subset of the collection and builds separate data structures, which are later merged.


createInvertedFile

public void createInvertedFile()
Building the inverted file.


createDirectFile

public void createDirectFile()
Builds the direct file and lexicon. This method goes through the input files specified in the collections.spec file and processes them in groups of n documents, where n is specified by the property bundle.size. Then, it merges the temporary lexicon files. If it necessary, it calls for the optimisation of the identifiers assigned to terms.


main

public static void main(java.lang.String[] args)
Used for testing purposes.

Parameters:
args - the command line arguments.

Terrier IR Platform
1.1.1

Terrier Information Retrieval Platform 1.1.1. Copyright 2004-2007 University of Glasgow