Terrier Core

DOCNOs must be in lexicographical order when indexing TREC collections

Details

  • Type: Bug Bug
  • Status: Resolved Resolved
  • Priority: Major Major
  • Resolution: Duplicate
  • Affects Version/s: 2.1
  • Fix Version/s: None
  • Component/s: None
  • Description:
    Hide
    This might be a bug in case one wants to merge distributed indexes from the same or from different collections. Maybe we need at least a warning when the inconsistency of DOCNO is detected. Supposing that each sub-collection has the lexicographical order for the DOCNO, we might only check the exact order of merging 2 or more files. This also prevent a bug when using the Merge method class only.

    We suggest to create a file with <docno, docid> information.
    Show
    This might be a bug in case one wants to merge distributed indexes from the same or from different collections. Maybe we need at least a warning when the inconsistency of DOCNO is detected. Supposing that each sub-collection has the lexicographical order for the DOCNO, we might only check the exact order of merging 2 or more files. This also prevent a bug when using the Merge method class only. We suggest to create a file with <docno, docid> information.

Issue Links

Activity

There are no comments yet on this issue.

People

Dates

  • Created:
    17/Feb/09 1:11 PM
    Updated:
    13/May/09 10:57 PM
    Resolved:
    29/Apr/09 3:39 PM