org.apache.nutch.indexer.field
Class BasicFields.Merger

java.lang.Object
  extended by org.apache.hadoop.conf.Configured
      extended by org.apache.nutch.indexer.field.BasicFields.Merger
All Implemented Interfaces:
Closeable, Configurable, JobConfigurable, Reducer<Text,FieldsWritable,Text,FieldsWritable>
Enclosing class:
BasicFields

public static class BasicFields.Merger
extends Configured
implements Reducer<Text,FieldsWritable,Text,FieldsWritable>

Merges output of all segments fields collecting only the most recent set of fields for any given url.


Constructor Summary
BasicFields.Merger()
           
 
Method Summary
 void close()
           
 void configure(JobConf conf)
          Configures the job.
 void reduce(Text key, Iterator<FieldsWritable> values, OutputCollector<Text,FieldsWritable> output, Reporter reporter)
          Collects the most recent set of fields for any url.
 
Methods inherited from class org.apache.hadoop.conf.Configured
getConf, setConf
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

BasicFields.Merger

public BasicFields.Merger()
Method Detail

configure

public void configure(JobConf conf)
Configures the job.

Specified by:
configure in interface JobConfigurable

close

public void close()
Specified by:
close in interface Closeable

reduce

public void reduce(Text key,
                   Iterator<FieldsWritable> values,
                   OutputCollector<Text,FieldsWritable> output,
                   Reporter reporter)
            throws IOException
Collects the most recent set of fields for any url.

Specified by:
reduce in interface Reducer<Text,FieldsWritable,Text,FieldsWritable>
Throws:
IOException


Copyright © 2006 The Apache Software Foundation