Details
Description
TRECCollection parse DOCHDR tags, including URLs SHOULD they exist.
TREC Web test collections (WT2G etc) have DOCHDR tags, which include the URL. We should parse these out. However, TRECCollection should not bork if the DOCHDR tag does not exist.
TREC Web test collections (WT2G etc) have DOCHDR tags, which include the URL. We should parse these out. However, TRECCollection should not bork if the DOCHDR tag does not exist.
Attachments
Issue Links
- is related to
-
TR-140 Indexing support for query-biased summarisation
-
- Resolved
-
I decided this was easier by having a sub-class called TRECWebCollection, which knows how to parse DOCHDR tags of various types of test collection.