Details
-
Type:
Bug
-
Status: Resolved
-
Priority:
Critical
-
Resolution: Fixed
-
Affects Version/s: 3.6
-
Fix Version/s: 4.0
-
Component/s: None
-
Labels:None
Description
Recent indices created by MapReduce have docid alignment problems:
Posting id 80368334 is too big for term 0 term0 Nt=34566754 TF=587985939 @{0 0 0} TFf=5215159,4448387,578322393
Posting id 79259555 is too big for term d term21430718 Nt=22088033 TF=65422749 @{3 0 0} TFf=679062,312433,64431254
This is /thought/ to be unrelated to compression changes.
Posting id 80368334 is too big for term 0 term0 Nt=34566754 TF=587985939 @{0 0 0} TFf=5215159,4448387,578322393
Posting id 79259555 is too big for term d term21430718 Nt=22088033 TF=65422749 @{3 0 0} TFf=679062,312433,64431254
This is /thought/ to be unrelated to compression changes.
Attachments
Activity
Field | Original Value | New Value |
---|---|---|
Attachment | PrintIndexTerm.java [ 10415 ] |
Attachment | TREC-388.v3.6.patch [ 10418 ] |
Summary | Docid alignment is broken for MapReduce indexing | Docid alignment is broken for MapReduce indexing when map tasks are repeated |
Assignee | Richard McCreadie [ richardm ] | Craig Macdonald [ craigm ] |
Attachment | TREC-388.v4.patch [ 10419 ] |
Status | Open [ 1 ] | Resolved [ 5 ] |
Resolution | Fixed [ 1 ] |
This program can be used to test an index, used as follows: