org.apache.nutch.segment
Class SegmentMerger.ObjectInputFormat

java.lang.Object
  extended by org.apache.hadoop.mapred.FileInputFormat<K,V>
      extended by org.apache.hadoop.mapred.SequenceFileInputFormat<Text,MetaWrapper>
          extended by org.apache.nutch.segment.SegmentMerger.ObjectInputFormat
All Implemented Interfaces:
InputFormat<Text,MetaWrapper>
Enclosing class:
SegmentMerger

public static class SegmentMerger.ObjectInputFormat
extends SequenceFileInputFormat<Text,MetaWrapper>

Wraps inputs in an MetaWrapper, to permit merging different types in reduce and use additional metadata.


Field Summary
 
Fields inherited from class org.apache.hadoop.mapred.FileInputFormat
LOG
 
Constructor Summary
SegmentMerger.ObjectInputFormat()
           
 
Method Summary
 RecordReader<Text,MetaWrapper> getRecordReader(InputSplit split, JobConf job, Reporter reporter)
           
 
Methods inherited from class org.apache.hadoop.mapred.SequenceFileInputFormat
listStatus
 
Methods inherited from class org.apache.hadoop.mapred.FileInputFormat
addInputPath, addInputPaths, computeSplitSize, getBlockIndex, getInputPathFilter, getInputPaths, getSplitHosts, getSplits, isSplitable, setInputPathFilter, setInputPaths, setInputPaths, setMinSplitSize
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

SegmentMerger.ObjectInputFormat

public SegmentMerger.ObjectInputFormat()
Method Detail

getRecordReader

public RecordReader<Text,MetaWrapper> getRecordReader(InputSplit split,
                                                      JobConf job,
                                                      Reporter reporter)
                                               throws IOException
Specified by:
getRecordReader in interface InputFormat<Text,MetaWrapper>
Overrides:
getRecordReader in class SequenceFileInputFormat<Text,MetaWrapper>
Throws:
IOException


Copyright © 2011 The Apache Software Foundation