- All Implemented Interfaces:
- org.apache.hadoop.conf.Configurable
public class SolrCloudPartitioner
extends org.apache.hadoop.mapreduce.Partitioner<org.apache.hadoop.io.Text,SolrInputDocumentWritable>
implements org.apache.hadoop.conf.Configurable
MapReduce partitioner that partitions the Mapper output such that each
SolrInputDocument gets sent to the SolrCloud shard that it would have been
sent to if the document were ingested via the standard SolrCloud Near Real
Time (NRT) API.
In other words, this class implements the same partitioning semantics as the
standard SolrCloud NRT API. This enables to mix batch updates from MapReduce
ingestion with updates from standard NRT ingestion on the same SolrCloud
cluster, using identical unique document keys.