I would like to process the AQUAINT dataset for Robust05. Could I directly use the TRECCollection class, or should I implement a separate parser for the document?
In addition, given the availability of the AQUAINT in our group, I am actually employing the GIGAWORD5 corpus, which is supposed to be a superset of the AQUAINT. I would like to ask should I make any special attentions for this corpus when indexing it with Terrier?
Thanks in advances.
Edited 1 time(s). Last edit at 06/12/2017 01:43PM by khui.