|
Terrier IR Platform 2.2.1 |
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectuk.ac.gla.terrier.indexing.TRECCollection
uk.ac.gla.terrier.indexing.TRECUTFCollection
public class TRECUTFCollection
Extends TRECCollection to provide support for indexing TREC collection in non-ASCII character sets. To this end, the TRECDocument has been extended so that it accepts any characters said to be Character.isLetterOrDigit().
Properties
TRECCollection| Constructor Summary | |
|---|---|
TRECUTFCollection()
Instantiate a new TRECUTFCollection. |
|
TRECUTFCollection(java.io.InputStream input)
Instantiate a new TRECUTFCollection. |
|
TRECUTFCollection(java.lang.String CollectionSpecFilename,
java.lang.String TagSet,
java.lang.String BlacklistSpecFilename,
java.lang.String docPointersFilename)
Instantiate a new TRECUTFCollection. |
|
| Method Summary | |
|---|---|
Document |
getDocument()
Overrides the getDocument() method in TRECCollection, so a UTF compatible Document object is returned. |
Document |
getDocument(TagSet _tags,
TagSet _exact,
TagSet _fields)
A TREC-specific getDocument method, that allows the tags to be specified for each document. |
| Methods inherited from class uk.ac.gla.terrier.indexing.TRECCollection |
|---|
close, endOfCollection, getDocid, getDocumentString, nextDocument, reset |
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public TRECUTFCollection()
public TRECUTFCollection(java.io.InputStream input)
public TRECUTFCollection(java.lang.String CollectionSpecFilename,
java.lang.String TagSet,
java.lang.String BlacklistSpecFilename,
java.lang.String docPointersFilename)
| Method Detail |
|---|
public Document getDocument()
getDocument in interface CollectiongetDocument in class TRECCollection
public Document getDocument(TagSet _tags,
TagSet _exact,
TagSet _fields)
getDocument in class TRECCollection
|
Terrier IR Platform 2.2.1 |
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||