Terrier IR Platform
2.2.1

uk.ac.gla.terrier.indexing
Interface DocumentExtractor

All Known Implementing Classes:
TRECCollection, TRECUTFCollection

public interface DocumentExtractor

The interface for the collection objects that give access to the text (string) of the documents in the collection

Version:
$Revision: 1.9 $
Author:
Vassilis Plachouras

Method Summary
 java.lang.String getDocumentString(int docid)
          Returns the text of a document with the given identifier.
 

Method Detail

getDocumentString

java.lang.String getDocumentString(int docid)
Returns the text of a document with the given identifier.

Parameters:
docid - the internal identifier of a document.
Returns:
String the text of the document as a string.

Terrier IR Platform
2.2.1

Terrier Information Retrieval Platform 2.2.1. Copyright 2004-2008 University of Glasgow