Terrier Users :  Terrier Forum terrier.org
General discussion about using/developing applications using Terrier 
what is the codec of lexicon?
Posted by: deeper2 ()
Date: December 30, 2017 02:57AM

Dear sir,

I have found plugging compression of posting list on terrier, but not found the codec of lexicon.
So can you tell me what is the lexicon structure of terrier? If it is stored in raw form of LexiconEntry? how does Terrier cope with various-sized term string, fixed length or ?

Options: ReplyQuote
Re: what is the codec of lexicon?
Posted by: craigm ()
Date: January 03, 2018 12:41PM

Hi,

The lexicon is not compressed. We used Writable implementation of the Text and the LexiconEntry. These are guaranteed to have a fixed size. See FixedSizeWritableFactory etc. We then to a binary search upon the written lexicon.

Craig

Options: ReplyQuote


Sorry, only registered users may post in this forum.
This forum powered by Phorum.