Uploaded image for project: 'Terrier Core'
  1. Terrier Core
  2. TR-37

Full support for direct file generation in Hadoop mode indexing

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 3.0
    • Fix Version/s: 3.0
    • Component/s: .structures
    • Labels:
      None

      Attachments

        Activity

        Hide
        richardm Richard McCreadie added a comment -

        MapReduce Index reading of Terrier Inverted Indicies is now ready for testing. Output Takes the form of 'n' lists of Term-PostingList pairs. Where 'n' is set by the user.

        Classes:
        InvertedIndexSplit : Defines an ordered subset of documents from an Inverted Index and provides opperations on it.
        InvertedIndexInputFormat : Splits the Inverted Index into 'n' InvertedIndexSplits by term.
        InvertedIndexRecordReader : Defines Iteration through an InvertedIndexSplit

        Show
        richardm Richard McCreadie added a comment - MapReduce Index reading of Terrier Inverted Indicies is now ready for testing. Output Takes the form of 'n' lists of Term-PostingList pairs. Where 'n' is set by the user. Classes: InvertedIndexSplit : Defines an ordered subset of documents from an Inverted Index and provides opperations on it. InvertedIndexInputFormat : Splits the Inverted Index into 'n' InvertedIndexSplits by term. InvertedIndexRecordReader : Defines Iteration through an InvertedIndexSplit
        Hide
        craigm Craig Macdonald added a comment -

        Committed my own version to trunk.

        Show
        craigm Craig Macdonald added a comment - Committed my own version to trunk.

          People

          • Assignee:
            craigm Craig Macdonald
            Reporter:
            ben Ben He
          • Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved: