org.apache.nutch.crawl
Class CrawlDbReducer

java.lang.Object
  extended by org.apache.nutch.crawl.CrawlDbReducer
All Implemented Interfaces:
Closeable, JobConfigurable, Reducer<Text,CrawlDatum,Text,CrawlDatum>

public class CrawlDbReducer
extends Object
implements Reducer<Text,CrawlDatum,Text,CrawlDatum>

Merge new page entries with existing entries.


Field Summary
static org.apache.commons.logging.Log LOG
           
 
Constructor Summary
CrawlDbReducer()
           
 
Method Summary
 void close()
           
 void configure(JobConf job)
           
 void reduce(Text key, Iterator<CrawlDatum> values, OutputCollector<Text,CrawlDatum> output, Reporter reporter)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

LOG

public static final org.apache.commons.logging.Log LOG
Constructor Detail

CrawlDbReducer

public CrawlDbReducer()
Method Detail

configure

public void configure(JobConf job)
Specified by:
configure in interface JobConfigurable

close

public void close()
Specified by:
close in interface Closeable

reduce

public void reduce(Text key,
                   Iterator<CrawlDatum> values,
                   OutputCollector<Text,CrawlDatum> output,
                   Reporter reporter)
            throws IOException
Specified by:
reduce in interface Reducer<Text,CrawlDatum,Text,CrawlDatum>
Throws:
IOException


Copyright © 2006 The Apache Software Foundation