[TR-28] Index WARC collections Created: 01/May/09 Updated: 08/Mar/10 Resolved: 08/Mar/10 |
|
Status: | Resolved |
Project: | Terrier Core |
Component/s: | None |
Affects Version/s: | 2.2.1 |
Fix Version/s: | 3.0 |
Type: | New Feature | Priority: | Minor |
Reporter: | Iadh Ounis | Assignee: | Craig Macdonald |
Resolution: | Duplicate | ||
Labels: | None |
Attachments: |
![]() |
Description |
The documents in the new TREC ClueWeb09 collection are formatted in WARC. It will be good if Terrier provides support for this format. |
Comments |
Comment by Carlos Lorenzetti [ 15/Oct/09 ] |
Hi, I'm trying to index the UK2007 Spam collection that is in WARC format. Thank you. |
Comment by Craig Macdonald [ 08/Mar/10 ] |
Duplicate of |