Third-Party Libraries used by Terrier

 

The Terrier distribution comes with 7 additional jar files in the lib folder. If you have downloaded the source-only distribution, then you'll need to obtain all but the last one. The licences for the included software can be found in the licences/ directory.

antlr.jarhttp://www.antlr.org/Antlr is a compiler-compiler, used for generating the querying language parser.
gnu-trove-1.0.2.jarhttp://trove4j.sourceforge.net/GNU Trove is a LGPLd list and set classes implementations that are faster than the standard Java classes.
log4j-1.2.9.jarhttp://logging.apache.org/log4j/docs/log4j is a Logging package from the Apache software consortium. Terrier does NOT use log4j currently, but it is required by PDFBox.
PDFBox-0.6.7a.jarhttp://www.pdfbox.org/pdfbox is a PDF parsing library written wholly in Java.
poi-2.5.1-final-20040804.jarhttp://jakarta.apache.org/poipoi is a Java package for reading and writing Microsoft OLE compound documents, eg Microsoft Word, Excel and Powerpoint.
tm-extractors-0.4.jarhttp://www.textmining.org/tm-extractors wraps the Word extraction from POI in a nice API.
winfileopen.jar  -  winfileopen.jar contains some Terrier code for opening files in Windows from Desktop.