Uses of Interface
org.terrier.indexing.Collection
-
Packages that use Collection Package Description org.terrier.applications Provides application-level code that use the Terrier platform to perform indexing and retrieval from either standard test collections, interactive querying of a indexed collection.org.terrier.indexing Provides classes and interfaces related to the indexing of documents.org.terrier.structures.indexing Provides the classes used for creating the data structures of the Terrier platform.org.terrier.structures.indexing.classical Provides functionality for creating on-disk indices via indexer classes.org.terrier.structures.indexing.singlepass Provides implementation of the structures needed for performing a single pass indexing -
-
Uses of Collection in org.terrier.applications
Fields in org.terrier.applications declared as Collection Modifier and Type Field Description protected Collection
TRECIndexing. collectionTREC
The collection to index.Methods in org.terrier.applications that return Collection Modifier and Type Method Description protected Collection
BatchIndexing. loadCollection(java.lang.String collectionSpec)
open a collection when given the collection.spec nameprotected Collection
BatchIndexing. loadCollection(java.util.List<java.lang.String> files)
open a collection when given a list of filesConstructors in org.terrier.applications with parameters of type Collection Constructor Description TRECIndexing(java.lang.String _path, java.lang.String _prefix, Collection c)
A constructor that initialised the data structures to use for indexing.TRECIndexingSinglePass(java.lang.String _path, java.lang.String _prefix, Collection c)
-
Uses of Collection in org.terrier.indexing
Classes in org.terrier.indexing that implement Collection Modifier and Type Class Description class
CollectionDocumentList
class
MultiDocumentFileCollection
class
SimpleFileCollection
Implements a collection that can read arbitrary files on disk.class
SimpleMedlineXMLCollection
Initial implementation of a class that generates a Collection with Documents from a series of XML files in the Medline format.class
SimpleXMLCollection
Initial implementation of a class that generates a Collection with Documents from a series of XML files.class
TRECCollection
Models a TREC test collection by implementing the interfaces Collection and DocumentExtractor.class
TRECUTFCollection
Deprecated.class
TRECWebCollection
Version of TRECCollection which can parse standard form DOCHDR tags in TREC Web corpoa.class
TwitterJSONCollection
This class represents a collection of tweets stored in JSON format.class
WARC018Collection
This object is used to parse WARC format web crawls, 0.18.class
WARC09Collection
This object is used to parse WARC format web crawls, version 0.9.class
WARC10Collection
This object is used to parse WARC format web crawls, version 0.10.Methods in org.terrier.indexing that return Collection Modifier and Type Method Description static Collection
CollectionFactory. loadCollection(java.lang.String CollectionName)
Load collection(s) of the specified name.static Collection
CollectionFactory. loadCollection(java.lang.String CollectionName, java.lang.Class<?>[] contructorTypes, java.lang.Object[] constructorValues)
Load collection(s) of the specified name.static Collection
CollectionFactory. loadCollections()
Use the default property trec.collection.class, or it's default value TRECCollectionstatic Collection
CollectionFactory. loadCollections(java.lang.String[] collNames)
Load collection(s) of the specified name.static Collection
CollectionFactory. loadCollections(java.lang.String[] collNames, java.lang.Class<?>[] contructorTypes, java.lang.Object[] constructorValues)
Load collection(s) of the specified name.static Collection
IndexTestUtils. makeCollection(java.lang.String[] docnos, java.lang.String[] documents)
-
Uses of Collection in org.terrier.structures.indexing
Methods in org.terrier.structures.indexing with parameters of type Collection Modifier and Type Method Description abstract void
Indexer. createDirectIndex(Collection[] collections)
An abstract method for creating the direct index, the document index and the lexicon for the given collections.void
Indexer. index(Collection[] collections)
Creates the data structures for a set of collections. -
Uses of Collection in org.terrier.structures.indexing.classical
Methods in org.terrier.structures.indexing.classical with parameters of type Collection Modifier and Type Method Description void
BasicIndexer. createDirectIndex(Collection[] collections)
Creates the direct index, the document index and the lexicon.void
BlockIndexer. createDirectIndex(Collection[] collections)
For the given collection, it iterates through the documents and creates the direct index, document index and lexicon, using information about blocks and possibly fields. -
Uses of Collection in org.terrier.structures.indexing.singlepass
Methods in org.terrier.structures.indexing.singlepass with parameters of type Collection Modifier and Type Method Description void
BasicSinglePassIndexer. createDirectIndex(Collection[] collections)
void
BasicSinglePassIndexer. createInvertedIndex(Collection[] collections)
Builds the inverted file and lexicon file for the given collections Loops through each document in each of the collections, extracting terms and pushing these through the Term Pipeline (eg stemming, stopping, lowercase).void
ExtensibleSinglePassIndexer. createInvertedIndex(Collection[] collections)
Builds the inverted file and lexicon file for the given collections Loops through each document in each of the collections, extracting terms and pushing these through the Term Pipeline (e.g.
-