[TR-28] Index WARC collections Created: 01/May/09 Updated: 08/Mar/10 Resolved: 08/Mar/10
|Reporter:||Iadh Ounis||Assignee:||Craig Macdonald|
The documents in the new TREC ClueWeb09 collection are formatted in WARC.
It will be good if Terrier provides support for this format.
|Comment by Carlos Lorenzetti [ 15/Oct/09 ]|
Hi, I'm trying to index the UK2007 Spam collection that is in WARC format.
|Comment by Craig Macdonald [ 08/Mar/10 ]|