Third-Party Libraries used by Terrier


The Terrier distribution comes with 7 additional jar files in the lib folder. If you have downloaded the source-only distribution, then you'll need to obtain all but the last one. The licences for the included software can be found in the licences/ directory.

antlr.jar is a compiler-compiler, used for generating the querying language parser.
gnu-trove-1.0.2.jar Trove is a LGPLd list and set classes implementations that are faster than the standard Java classes.
log4j-1.2.9.jar is a Logging package from the Apache software consortium. Terrier does NOT use log4j currently, but it is required by PDFBox.
PDFBox-0.6.7a.jar is a PDF parsing library written wholly in Java.
poi-2.5.1-final-20040804.jar is a Java package for reading and writing Microsoft OLE compound documents, eg Microsoft Word, Excel and Powerpoint.
tm-extractors-0.4.jar wraps the Word extraction from POI in a nice API.
winfileopen.jar  -  winfileopen.jar contains some Terrier code for opening files in Windows from Desktop.