|
Terrier IR Platform 2.2.1 |
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object uk.ac.gla.terrier.indexing.TRECCollection uk.ac.gla.terrier.indexing.TRECUTFCollection
public class TRECUTFCollection
Extends TRECCollection to provide support for indexing TREC collection in non-ASCII character sets. To this end, the TRECDocument has been extended so that it accepts any characters said to be Character.isLetterOrDigit().
Properties
TRECCollection
Constructor Summary | |
---|---|
TRECUTFCollection()
Instantiate a new TRECUTFCollection. |
|
TRECUTFCollection(java.io.InputStream input)
Instantiate a new TRECUTFCollection. |
|
TRECUTFCollection(java.lang.String CollectionSpecFilename,
java.lang.String TagSet,
java.lang.String BlacklistSpecFilename,
java.lang.String docPointersFilename)
Instantiate a new TRECUTFCollection. |
Method Summary | |
---|---|
Document |
getDocument()
Overrides the getDocument() method in TRECCollection, so a UTF compatible Document object is returned. |
Document |
getDocument(TagSet _tags,
TagSet _exact,
TagSet _fields)
A TREC-specific getDocument method, that allows the tags to be specified for each document. |
Methods inherited from class uk.ac.gla.terrier.indexing.TRECCollection |
---|
close, endOfCollection, getDocid, getDocumentString, nextDocument, reset |
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public TRECUTFCollection()
public TRECUTFCollection(java.io.InputStream input)
public TRECUTFCollection(java.lang.String CollectionSpecFilename, java.lang.String TagSet, java.lang.String BlacklistSpecFilename, java.lang.String docPointersFilename)
Method Detail |
---|
public Document getDocument()
getDocument
in interface Collection
getDocument
in class TRECCollection
public Document getDocument(TagSet _tags, TagSet _exact, TagSet _fields)
getDocument
in class TRECCollection
|
Terrier IR Platform 2.2.1 |
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |