Uploaded image for project: 'Terrier Core'
  1. Terrier Core
  2. TR-17

DOCNOs must be in lexicographical order when indexing TREC collections

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Duplicate
    • Affects Version/s: 2.1
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      This might be a bug in case one wants to merge distributed indexes from the same or from different collections. Maybe we need at least a warning when the inconsistency of DOCNO is detected. Supposing that each sub-collection has the lexicographical order for the DOCNO, we might only check the exact order of merging 2 or more files. This also prevent a bug when using the Merge method class only.

      We suggest to create a file with <docno, docid> information.

        Attachments

          Issue Links

            Activity

            gianni_amati Gianni Amati created issue -
            craigm Craig Macdonald made changes -
            Field Original Value New Value
            Link This issue is blocked by TR-14 [ TR-14 ]
            craigm Craig Macdonald made changes -
            Workflow jira [ 10025 ] Terrier Open Source [ 10039 ]
            craigm Craig Macdonald made changes -
            Link This issue is duplicated by TR-27 [ TR-27 ]
            craigm Craig Macdonald made changes -
            Status Open [ 1 ] Resolved [ 5 ]
            Resolution Duplicate [ 3 ]

              People

              • Assignee:
                craigm Craig Macdonald
                Reporter:
                gianni_amati Gianni Amati
              • Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: