Terrier Users :  Terrier Forum terrier.org
General discussion about using/developing applications using Terrier 
Indexing MEDLINE (>25M docs) with Terrier
Posted by: Cordobal ()
Date: May 10, 2017 01:24PM

Dear Terrier community,

I would like to index MEDLINE with Terrier. MEDLINE is a large collection (>25M) of bibliographic records. Records are not very long (a title and a abstract).

The primary goal of this is to perform searches as quick as possible.

I already did this by splitting the collection into subcollections, and then running several searches in parallel. But it is not satisfactory, because document frequencies are different for each subcollection sad smiley

Do you think it's possible to define my own DF in Terrier ?

Do you have a better idea ?

Thanks for your help,

Julien

Options: ReplyQuote


Sorry, only registered users may post in this forum.
This forum powered by Phorum.