Terrier IR Platform
1.1.1

Uses of Interface
uk.ac.gla.terrier.indexing.Document

Packages that use Document
uk.ac.gla.terrier.indexing Provides classes and interfaces related to the indexing of documents. 
 

Uses of Document in uk.ac.gla.terrier.indexing
 

Classes in uk.ac.gla.terrier.indexing that implement Document
 class FileDocument
          Models a document which corresponds to one file.
 class HTMLDocument
          Models an HTML document.
 class MSExcelDocument
          Implements a Document object for a Microsoft Excel spreadsheet.
 class MSPowerpointDocument
          Implements a Document object for reading Microsoft Powerpoint files.
 class MSWordDocument
          This class is used for indexing MS Word document files (ie files ending .doc).
 class PDFDocument
          Implements a Document object for reading PDF documents.
 class TRECDocument
          Models a document in a TREC collection.
 

Methods in uk.ac.gla.terrier.indexing that return Document
static Document TRECDocument.generateDocumentFromFile(java.lang.String filename)
          instantiates a TREC document from a file
 Document Collection.getDocument()
          Get the document object representing the current document.
 Document SimpleFileCollection.getDocument()
          Return the current document in the collection.
 Document SimpleXMLCollection.getDocument()
           
 Document TRECCollection.getDocument()
          Returns the current document to process.
 Document TRECUTFCollection.getDocument()
          Overrides the getDocument() method in TRECCollection, so a UTF compatible Document object is returned.
 Document TRECCollection.getDocument(TagSet _tags, TagSet _exact, TagSet _fields)
          A TREC-specific getDocument method, that allows the tags to be specified for each document.
 Document TRECUTFCollection.getDocument(TagSet _tags, TagSet _exact, TagSet _fields)
          A TREC-specific getDocument method, that allows the tags to be specified for each document.
 

Methods in uk.ac.gla.terrier.indexing with parameters of type Document
static void TRECDocument.dumpDocument(Document d)
          Dumps a document to stdout
 


Terrier IR Platform
1.1.1

Terrier Information Retrieval Platform 1.1.1. Copyright 2004-2007 University of Glasgow