Uploaded image for project: 'Terrier Core'
  1. Terrier Core
  2. TR-144

CollectionRecordReader.next should not be recursive

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 3.0
    • Fix Version/s: 3.5
    • Component/s: .structures
    • Labels:
      None

      Description

      org.terrier.structures.indexing.singlepass.hadoop.CollectionRecordReader.next recursively locates the next Document to be processed from the Collection object. However, for cases where some documents in the sequence are missing (e.g., we might want to index only a few selected documents), this results in too many recursive calls, which raise a stack overflow exception.

      CollectionRecordReader.next should be made iterative instead of recursive.

        Attachments

          Activity

            People

            • Assignee:
              rodrygo Rodrygo L. T. Santos
              Reporter:
              rodrygo Rodrygo L. T. Santos
            • Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: