Package org.apache.nutch.segment

Interface Summary
SegmentMergeFilter Interface used to filter segments during segment merge.
 

Class Summary
ContentAsTextInputFormat An input format that takes Nutch Content objects and converts them to text while converting newline endings to spaces.
SegmentMergeFilters This class wraps all SegmentMergeFilter extensions in a single object so it is easier to operate on them.
SegmentMerger This tool takes several segments and merges their data together.
SegmentMerger.ObjectInputFormat Wraps inputs in an MetaWrapper, to permit merging different types in reduce and use additional metadata.
SegmentMerger.SegmentOutputFormat  
SegmentPart Utility class for handling information about segment parts.
SegmentReader Dump the content of a segment.
SegmentReader.InputCompatMapper  
SegmentReader.SegmentReaderStats  
SegmentReader.TextOutputFormat Implements a text output format
 



Copyright © 2012 The Apache Software Foundation