|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.terrier.indexing.TaggedDocument org.terrier.indexing.HTMLDocument
@Deprecated public class HTMLDocument
HTMLDocument is a placeholder class - all functionality is implemented
in TaggedDocument
. This class will be removed in a future
release of Terrier.
Field Summary |
---|
Fields inherited from class org.terrier.indexing.TaggedDocument |
---|
_exact, _fields, _tags, abstractCount, abstractlengths, abstractnames, abstracts, abstracttags, abstractTagsCaseSensitive, br, counter, currentTokenStream, elseAbstractSpecialTag, EOD, error, htmlStk, inHtmlTagToProcess, inTagToProcess, inTagToSkip, lastChar, logger, lowercase, maxNumOfDigitsPerTerm, maxNumOfSameConseqLettersPerTerm, properties, stk, stringArray, sw, tagNameSB, tokeniser, tokenMaximumLength |
Constructor Summary | |
---|---|
HTMLDocument(java.io.Reader docReader,
java.util.Map<java.lang.String,java.lang.String> docProperties,
Tokeniser _tokeniser)
Deprecated. create html document |
Method Summary |
---|
Methods inherited from class org.terrier.indexing.TaggedDocument |
---|
check, dumpDocument, endOfDocument, generateDocumentFromFile, getAllProperties, getFields, getNextTerm, getProperty, getReader, main, processEndOfDocument, processEndOfTag, saveToAbstract, setProperty |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public HTMLDocument(java.io.Reader docReader, java.util.Map<java.lang.String,java.lang.String> docProperties, Tokeniser _tokeniser)
docReader
- docProperties
- _tokeniser
-
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |