org.terrier.indexing
Class HTMLDocument

java.lang.Object
  extended by org.terrier.indexing.TaggedDocument
      extended by org.terrier.indexing.HTMLDocument
All Implemented Interfaces:
Document

Deprecated.

@Deprecated
public class HTMLDocument
extends TaggedDocument

HTMLDocument is a placeholder class - all functionality is implemented in TaggedDocument. This class will be removed in a future release of Terrier.


Field Summary
 
Fields inherited from class org.terrier.indexing.TaggedDocument
_exact, _fields, _tags, abstractCount, abstractlengths, abstractnames, abstracts, abstracttags, abstractTagsCaseSensitive, br, counter, currentTokenStream, elseAbstractSpecialTag, EOD, error, htmlStk, inHtmlTagToProcess, inTagToProcess, inTagToSkip, lastChar, logger, lowercase, maxNumOfDigitsPerTerm, maxNumOfSameConseqLettersPerTerm, properties, stk, stringArray, sw, tagNameSB, tokeniser, tokenMaximumLength
 
Constructor Summary
HTMLDocument(java.io.Reader docReader, java.util.Map<java.lang.String,java.lang.String> docProperties, Tokeniser _tokeniser)
          Deprecated. create html document
 
Method Summary
 
Methods inherited from class org.terrier.indexing.TaggedDocument
check, dumpDocument, endOfDocument, generateDocumentFromFile, getAllProperties, getFields, getNextTerm, getProperty, getReader, main, processEndOfDocument, processEndOfTag, saveToAbstract, setProperty
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

HTMLDocument

public HTMLDocument(java.io.Reader docReader,
                    java.util.Map<java.lang.String,java.lang.String> docProperties,
                    Tokeniser _tokeniser)
Deprecated. 
create html document

Parameters:
docReader -
docProperties -
_tokeniser -


Terrier 3.5. Copyright © 2004-2011 University of Glasgow