|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.terrier.indexing.TaggedDocument
org.terrier.indexing.HTMLDocument
@Deprecated public class HTMLDocument
HTMLDocument is a placeholder class - all functionality is implemented
in TaggedDocument. This class will be removed in a future
release of Terrier.
| Field Summary |
|---|
| Fields inherited from class org.terrier.indexing.TaggedDocument |
|---|
_exact, _fields, _tags, abstractCount, abstractlengths, abstractnames, abstracts, abstracttags, abstractTagsCaseSensitive, br, counter, currentTokenStream, elseAbstractSpecialTag, EOD, error, htmlStk, inHtmlTagToProcess, inTagToProcess, inTagToSkip, lastChar, logger, lowercase, maxNumOfDigitsPerTerm, maxNumOfSameConseqLettersPerTerm, properties, stk, stringArray, sw, tagNameSB, tokeniser, tokenMaximumLength |
| Constructor Summary | |
|---|---|
HTMLDocument(java.io.Reader docReader,
java.util.Map<java.lang.String,java.lang.String> docProperties,
Tokeniser _tokeniser)
Deprecated. create html document |
|
| Method Summary |
|---|
| Methods inherited from class org.terrier.indexing.TaggedDocument |
|---|
check, dumpDocument, endOfDocument, generateDocumentFromFile, getAllProperties, getFields, getNextTerm, getProperty, getReader, main, processEndOfDocument, processEndOfTag, saveToAbstract, setProperty |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public HTMLDocument(java.io.Reader docReader,
java.util.Map<java.lang.String,java.lang.String> docProperties,
Tokeniser _tokeniser)
docReader - docProperties - _tokeniser -
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||