Terrier IR Platform
1.1.1

uk.ac.gla.terrier.indexing
Interface DocumentExtractor

All Known Implementing Classes:
TRECCollection, TRECUTFCollection

public interface DocumentExtractor

The interface for the collection objects that give access to the text (string) of the documents in the collection

Version:
$Revision: 1.7 $
Author:
Vassilis Plachouras

Method Summary
 java.lang.String getDocumentString(int docid)
          Returns the text of a document with the given identifier.
 

Method Detail

getDocumentString

java.lang.String getDocumentString(int docid)
Returns the text of a document with the given identifier.

Parameters:
docid - the internal identifier of a document.
Returns:
String the text of the document as a string.

Terrier IR Platform
1.1.1

Terrier Information Retrieval Platform 1.1.1. Copyright 2004-2007 University of Glasgow