Package org.terrier.indexing
Class MSWordDocument
- java.lang.Object
-
- org.terrier.indexing.FileDocument
-
- org.terrier.indexing.POIDocument
-
- org.terrier.indexing.MSWordDocument
-
- All Implemented Interfaces:
Document
@Deprecated public class MSWordDocument extends POIDocument
Deprecated.MSPowerPointDocument is a placeholder class - all functionality is implemented inPOIDocument
. This class will be removed in a future release of Terrier.
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from class org.terrier.indexing.FileDocument
FileDocument.ReaderWrapper
-
-
Field Summary
-
Fields inherited from class org.terrier.indexing.FileDocument
abstractlength, abstractname, abstractwritten, br, EOD, filename, fileProperties, logger, tokenStream
-
-
Constructor Summary
Constructors Constructor Description MSWordDocument(java.io.InputStream docStream, java.util.Map<java.lang.String,java.lang.String> docProperties, Tokeniser tok)
Deprecated.MSWordDocument(java.lang.String filename, java.io.InputStream docStream, Tokeniser tokeniser)
Deprecated.
-
Method Summary
-
Methods inherited from class org.terrier.indexing.POIDocument
getExtractor, getReader
-
Methods inherited from class org.terrier.indexing.FileDocument
endOfDocument, getAllProperties, getFields, getNextTerm, getProperty, getReader, makeFilenameProperties, setProperty
-
-
-
-
Constructor Detail
-
MSWordDocument
public MSWordDocument(java.io.InputStream docStream, java.util.Map<java.lang.String,java.lang.String> docProperties, Tokeniser tok)
Deprecated.
-
MSWordDocument
public MSWordDocument(java.lang.String filename, java.io.InputStream docStream, Tokeniser tokeniser)
Deprecated.
-
-