Uploaded image for project: 'Terrier Core'
  1. Terrier Core
  2. TR-17

DOCNOs must be in lexicographical order when indexing TREC collections


    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Duplicate
    • Affects Version/s: 2.1
    • Fix Version/s: None
    • Component/s: None
    • Labels:


      This might be a bug in case one wants to merge distributed indexes from the same or from different collections. Maybe we need at least a warning when the inconsistency of DOCNO is detected. Supposing that each sub-collection has the lexicographical order for the DOCNO, we might only check the exact order of merging 2 or more files. This also prevent a bug when using the Merge method class only.

      We suggest to create a file with <docno, docid> information.


          Issue Links


            gianni_amati Gianni Amati created issue -
            craigm Craig Macdonald made changes -
            Field Original Value New Value
            Link This issue is blocked by TR-14 [ TR-14 ]
            craigm Craig Macdonald made changes -
            Workflow jira [ 10025 ] Terrier Open Source [ 10039 ]
            craigm Craig Macdonald made changes -
            Link This issue is duplicated by TR-27 [ TR-27 ]
            craigm Craig Macdonald made changes -
            Status Open [ 1 ] Resolved [ 5 ]
            Resolution Duplicate [ 3 ]


              • Assignee:
                craigm Craig Macdonald
                gianni_amati Gianni Amati
              • Watchers:
                1 Start watching this issue


                • Created: