[TR-78] BitInputFormat: some minor changes Created: 23/Nov/09  Updated: 05/Mar/10  Resolved: 23/Nov/09

Status: Resolved
Project: Terrier Core
Component/s: .structures
Affects Version/s: 3.0
Fix Version/s: 3.0

Type: Bug Priority: Trivial
Reporter: Craig Macdonald Assignee: Craig Macdonald
Resolution: Fixed  
Labels: None

Two corner issues:
1. End splits of less than one byte were not processed
2. Empty entries were not processed correctly.

This code is used by Inv2DirectMultiReduce. However, (1) is an unlikely case, as our inverted files are so massive, the chances of the final split being less than 1 byte as very small. (2) cannot happen for an inverted file - terms always have entries.

Comment by Craig Macdonald [ 23/Nov/09 ]

Formatting change

Comment by Craig Macdonald [ 23/Nov/09 ]

Committed to trunk

Comment by Craig Macdonald [ 23/Nov/09 ]

Test case was committed to HadoopShakespeareEndToEndTest for testing splitting of a direct index.

Generated at Wed Apr 14 17:41:01 BST 2021 using JIRA 7.1.1#71004-sha1:d6b2c0d9b7051e9fb5e4eb8ce177ca56d91d7bd8.