Uploaded image for project: 'Terrier Core'
  1. Terrier Core
  2. TR-553

Transition to a more generic JSON document reader for tweets

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 5.0
    • Fix Version/s: 5.1
    • Component/s: .indexing
    • Labels:
      None

      Description

      Twitter Indexing currently uses an old class called TwitterJSONDocument to convert JSON tweets into Terrier documents. This makes many assumptions about the format of tweets and is in general slower than desired.

      Internally we replaced this class with a faster version based on Jackson (AIR sub-project), we should transition to this version instead.

        Attachments

          Activity

          Hide
          richardm Richard McCreadie added a comment -

          Fix committed in d2d4e6fc. Passes unit tests

          Show
          richardm Richard McCreadie added a comment - Fix committed in d2d4e6fc. Passes unit tests
          Hide
          richardm Richard McCreadie added a comment -

          Moved to FlatJSONDocument. Pointed TwitterJSONCollection at this alternative document class. TwitterJSONDocument is left in the code base

          Show
          richardm Richard McCreadie added a comment - Moved to FlatJSONDocument. Pointed TwitterJSONCollection at this alternative document class. TwitterJSONDocument is left in the code base

            People

            • Assignee:
              richardm Richard McCreadie
              Reporter:
              richardm Richard McCreadie
            • Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: