Interface DocumentExtractor


  • @Deprecated
    public interface DocumentExtractor
    Deprecated.
    The interface for the collection objects that give access to the text (string) of the documents in the collection. This class will be removed in versions after 3.5.
    Author:
    Vassilis Plachouras
    • Method Detail

      • getDocumentString

        java.lang.String getDocumentString​(int docid)
        Deprecated.
        Returns the text of a document with the given identifier.
        Parameters:
        docid - the internal identifier of a document.
        Returns:
        String the text of the document as a string.