Package | Description |
---|---|
org.terrier.indexing |
Provides classes and interfaces related to the indexing of documents.
|
Modifier and Type | Class and Description |
---|---|
class |
TRECCollection
Models a TREC test collection by implementing the interfaces
Collection and DocumentExtractor.
|
class |
TRECUTFCollection
Deprecated.
|
class |
TRECWebCollection
Version of TRECCollection which can parse
standard form DOCHDR tags in TREC Web corpoa.
|
class |
WARC018Collection
This object is used to parse WARC format web crawls, 0.18.
|
class |
WARC09Collection
This object is used to parse WARC format web crawls, version 0.9.
|
class |
WARC10Collection
This object is used to parse WARC format web crawls, version 0.10.
|
Terrier Information Retrieval Platform4.1. Copyright © 2004-2015, University of Glasgow