org.apache.nutch.segment
Class SegmentMerger.ObjectInputFormat
java.lang.Object
org.apache.hadoop.mapred.FileInputFormat<K,V>
org.apache.hadoop.mapred.SequenceFileInputFormat<Text,MetaWrapper>
org.apache.nutch.segment.SegmentMerger.ObjectInputFormat
- All Implemented Interfaces:
- InputFormat<Text,MetaWrapper>
- Enclosing class:
- SegmentMerger
public static class SegmentMerger.ObjectInputFormat
- extends SequenceFileInputFormat<Text,MetaWrapper>
Wraps inputs in an MetaWrapper
, to permit merging different
types in reduce and use additional metadata.
Methods inherited from class org.apache.hadoop.mapred.FileInputFormat |
addInputPath, addInputPaths, computeSplitSize, getBlockIndex, getInputPathFilter, getInputPaths, getSplitHosts, getSplits, isSplitable, setInputPathFilter, setInputPaths, setInputPaths, setMinSplitSize |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
SegmentMerger.ObjectInputFormat
public SegmentMerger.ObjectInputFormat()
getRecordReader
public RecordReader<Text,MetaWrapper> getRecordReader(InputSplit split,
JobConf job,
Reporter reporter)
throws IOException
- Specified by:
getRecordReader
in interface InputFormat<Text,MetaWrapper>
- Overrides:
getRecordReader
in class SequenceFileInputFormat<Text,MetaWrapper>
- Throws:
IOException
Copyright © 2011 The Apache Software Foundation