Uploaded image for project: 'Terrier Core'
  1. Terrier Core
  2. TR-78

BitInputFormat: some minor changes

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Trivial
    • Resolution: Fixed
    • Affects Version/s: 3.0
    • Fix Version/s: 3.0
    • Component/s: .structures
    • Labels:
      None

      Description

      Two corner issues:
      1. End splits of less than one byte were not processed
      2. Empty entries were not processed correctly.

      This code is used by Inv2DirectMultiReduce. However, (1) is an unlikely case, as our inverted files are so massive, the chances of the final split being less than 1 byte as very small. (2) cannot happen for an inverted file - terms always have entries.

        Attachments

          Activity

          Hide
          craigm Craig Macdonald added a comment -

          Formatting change

          Show
          craigm Craig Macdonald added a comment - Formatting change
          Hide
          craigm Craig Macdonald added a comment -

          Committed to trunk

          Show
          craigm Craig Macdonald added a comment - Committed to trunk
          Hide
          craigm Craig Macdonald added a comment -

          Test case was committed to HadoopShakespeareEndToEndTest for testing splitting of a direct index.

          Show
          craigm Craig Macdonald added a comment - Test case was committed to HadoopShakespeareEndToEndTest for testing splitting of a direct index.

            People

            • Assignee:
              craigm Craig Macdonald
              Reporter:
              craigm Craig Macdonald
            • Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: