org.terrier.indexing
Interface DocumentExtractor

All Known Implementing Classes:
TRECCollection, TRECUTFCollection, TRECWebCollection

Deprecated.

@Deprecated
public interface DocumentExtractor

The interface for the collection objects that give access to the text (string) of the documents in the collection. This class will be removed in versions after 3.5.

Author:
Vassilis Plachouras

Method Summary
 java.lang.String getDocumentString(int docid)
          Deprecated. Returns the text of a document with the given identifier.
 

Method Detail

getDocumentString

java.lang.String getDocumentString(int docid)
Deprecated. 
Returns the text of a document with the given identifier.

Parameters:
docid - the internal identifier of a document.
Returns:
String the text of the document as a string.


Terrier 3.5. Copyright © 2004-2011 University of Glasgow