Uses of Interface
org.terrier.indexing.Collection
-
Packages that use Collection Package Description org.terrier.applications Provides application-level code that use the Terrier platform to perform indexing and retrieval from either standard test collections, interactive querying of a indexed collection.org.terrier.indexing Provides classes and interfaces related to the indexing of documents.org.terrier.structures.indexing Provides the classes used for creating the data structures of the Terrier platform.org.terrier.structures.indexing.classical Provides functionality for creating on-disk indices via indexer classes.org.terrier.structures.indexing.singlepass Provides implementation of the structures needed for performing a single pass indexing -
-
Uses of Collection in org.terrier.applications
Fields in org.terrier.applications declared as Collection Modifier and Type Field Description protected CollectionTRECIndexing. collectionTRECThe collection to index.Methods in org.terrier.applications that return Collection Modifier and Type Method Description protected CollectionBatchIndexing. loadCollection(java.lang.String collectionSpec)open a collection when given the collection.spec nameprotected CollectionBatchIndexing. loadCollection(java.util.List<java.lang.String> files)open a collection when given a list of filesConstructors in org.terrier.applications with parameters of type Collection Constructor Description TRECIndexing(java.lang.String _path, java.lang.String _prefix, Collection c)A constructor that initialised the data structures to use for indexing.TRECIndexingSinglePass(java.lang.String _path, java.lang.String _prefix, Collection c) -
Uses of Collection in org.terrier.indexing
Classes in org.terrier.indexing that implement Collection Modifier and Type Class Description classCollectionDocumentListclassMultiDocumentFileCollectionclassSimpleFileCollectionImplements a collection that can read arbitrary files on disk.classSimpleMedlineXMLCollectionInitial implementation of a class that generates a Collection with Documents from a series of XML files in the Medline format.classSimpleXMLCollectionInitial implementation of a class that generates a Collection with Documents from a series of XML files.classTRECCollectionModels a TREC test collection by implementing the interfaces Collection and DocumentExtractor.classTRECUTFCollectionDeprecated.classTRECWebCollectionVersion of TRECCollection which can parse standard form DOCHDR tags in TREC Web corpoa.classTwitterJSONCollectionThis class represents a collection of tweets stored in JSON format.classWARC018CollectionThis object is used to parse WARC format web crawls, 0.18.classWARC09CollectionThis object is used to parse WARC format web crawls, version 0.9.classWARC10CollectionThis object is used to parse WARC format web crawls, version 0.10.Methods in org.terrier.indexing that return Collection Modifier and Type Method Description static CollectionCollectionFactory. loadCollection(java.lang.String CollectionName)Load collection(s) of the specified name.static CollectionCollectionFactory. loadCollection(java.lang.String CollectionName, java.lang.Class<?>[] contructorTypes, java.lang.Object[] constructorValues)Load collection(s) of the specified name.static CollectionCollectionFactory. loadCollections()Use the default property trec.collection.class, or it's default value TRECCollectionstatic CollectionCollectionFactory. loadCollections(java.lang.String[] collNames)Load collection(s) of the specified name.static CollectionCollectionFactory. loadCollections(java.lang.String[] collNames, java.lang.Class<?>[] contructorTypes, java.lang.Object[] constructorValues)Load collection(s) of the specified name.static CollectionIndexTestUtils. makeCollection(java.lang.String[] docnos, java.lang.String[] documents) -
Uses of Collection in org.terrier.structures.indexing
Methods in org.terrier.structures.indexing with parameters of type Collection Modifier and Type Method Description abstract voidIndexer. createDirectIndex(Collection[] collections)An abstract method for creating the direct index, the document index and the lexicon for the given collections.voidIndexer. index(Collection[] collections)Creates the data structures for a set of collections. -
Uses of Collection in org.terrier.structures.indexing.classical
Methods in org.terrier.structures.indexing.classical with parameters of type Collection Modifier and Type Method Description voidBasicIndexer. createDirectIndex(Collection[] collections)Creates the direct index, the document index and the lexicon.voidBlockIndexer. createDirectIndex(Collection[] collections)For the given collection, it iterates through the documents and creates the direct index, document index and lexicon, using information about blocks and possibly fields. -
Uses of Collection in org.terrier.structures.indexing.singlepass
Methods in org.terrier.structures.indexing.singlepass with parameters of type Collection Modifier and Type Method Description voidBasicSinglePassIndexer. createDirectIndex(Collection[] collections)voidBasicSinglePassIndexer. createInvertedIndex(Collection[] collections)Builds the inverted file and lexicon file for the given collections Loops through each document in each of the collections, extracting terms and pushing these through the Term Pipeline (eg stemming, stopping, lowercase).voidExtensibleSinglePassIndexer. createInvertedIndex(Collection[] collections)Builds the inverted file and lexicon file for the given collections Loops through each document in each of the collections, extracting terms and pushing these through the Term Pipeline (e.g.
-