Dear Terrier community,
I would like to index MEDLINE with Terrier. MEDLINE is a large collection (>25M) of bibliographic records. Records are not very long (a title and a abstract).
The primary goal of this is to perform searches as quick as possible.
I already did this by splitting the collection into subcollections, and then running several searches in parallel. But it is not satisfactory, because document frequencies are different for each subcollection
Do you think it's possible to define my own DF in Terrier ?
Do you have a better idea ?
Thanks for your help,