Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: 3.0
    • Fix Version/s: 3.0
    • Component/s: None
    • Labels:
      None

      Description

      Much has changed for Terrier 3.0. We need to spend much time improving the documentation before the 3.0 release.

      E.g.:
       * http_terrier.sh etc
       * MetaIndex
       * Changes to inverted index parsing
       * Changes to namespaces
       * TSMs are deprecated
       * Improved fields
       * TrecTerrier can take command line -D options

        Attachments

          Activity

          Hide
          richardm Richard McCreadie added a comment - - edited

          List of documents needing changed:
          Overview:

          • add clueweb09/blogs08
          • hiemstra and croft
          • 50 million documents
          • 6 years of work
          • add web interface
          • frequency occurrences in fields
          • field weighting models?

          QuickStart:

          • change version numbers
          • one command for tar
          • add web interface

          Components:

          • remove term score modifiers
          • meta index structure
          • update applications

          Configure Indexing:

          • remove docno/string.byte.length
          • add meta index

          Configure Retrieval

          • add/remove weighting models
          • TREC output format

          Desktop:

          • take out file list

          Examples:

          • remove croft
          • check all numbers after changing the stemmer
          • check retrieval performance

          Hadoop Indexing:

          • talk about document/term partitioning
          • inverted to direct in MapReduce

          Properties:

          • needs a pass

          Extend Terrier:

          • change version

          Extend Retrieval:

          • remove LM matching
          • examples changed for posting lists

          Non-English:

          • use UTF

          Future Features

          • needs a pass

          Whats New:

          • move TREC resolved issues to TR and update
          • detail what has changed

          NewPages

          • Web Interface
          Show
          richardm Richard McCreadie added a comment - - edited List of documents needing changed: Overview: add clueweb09/blogs08 hiemstra and croft 50 million documents 6 years of work add web interface frequency occurrences in fields field weighting models? QuickStart: change version numbers one command for tar add web interface Components: remove term score modifiers meta index structure update applications Configure Indexing: remove docno/string.byte.length add meta index Configure Retrieval add/remove weighting models TREC output format Desktop: take out file list Examples: remove croft check all numbers after changing the stemmer check retrieval performance Hadoop Indexing: talk about document/term partitioning inverted to direct in MapReduce Properties: needs a pass Extend Terrier: change version Extend Retrieval: remove LM matching examples changed for posting lists Non-English: use UTF Future Features needs a pass Whats New: move TREC resolved issues to TR and update detail what has changed NewPages Web Interface
          Hide
          craigm Craig Macdonald added a comment -

          I have made my pass at the documentation. Only "Whats New" remains to be done.

          Please can others read each page, and make alterations. Note here what alterations you complete.

          Show
          craigm Craig Macdonald added a comment - I have made my pass at the documentation. Only "Whats New" remains to be done. Please can others read each page, and make alterations. Note here what alterations you complete.
          Hide
          craigm Craig Macdonald added a comment -

          Many additional updates have been made, covering choosing an appropriate Collection class, RF, Proximity, field-models, links to wiki, and more.

          Show
          craigm Craig Macdonald added a comment - Many additional updates have been made, covering choosing an appropriate Collection class, RF, Proximity, field-models, links to wiki, and more.

            People

            • Assignee:
              craigm Craig Macdonald
              Reporter:
              craigm Craig Macdonald
            • Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: