org.terrier.indexing
Interfaces
Collection
Document
DocumentExtractor
Tokenizer
Classes
CollectionFactory
FileDocument
MSExcelDocument
MSPowerPointDocument
MSWordDocument
MultiDocumentFileCollection
PDFDocument
POIDocument
SimpleFileCollection
SimpleMedlineXMLCollection
SimpleXMLCollection
TaggedDocument
TRECCollection
TRECFullTokenizer
TRECUTFCollection
TRECWebCollection
TwitterJSONCollection
TwitterJSONDocument
WARC018Collection
WARC09Collection
WARC10Collection