[TR-41] Hadoop Indexing loads CompressedMetaIndex into memory during reduce phase Created: 08/May/09 Updated: 05/Mar/10 Resolved: 18/Jun/09
|Reporter:||Craig Macdonald||Assignee:||Craig Macdonald|
The use of the MetaIndex during the reduce phase to merge the meta indexes. If there are lots of map tasks, this can result in too many meta indices being loaded into memory at once.
The solution is to access the MetaIndex as a stream.
|Comment by Craig Macdonald [ 18/Jun/09 ]|
Committed to SVN. Also uses MapReduce job to do metaindex inversion, i.e. meta->docid