public class SymlinkTextInputFormat extends SymbolicInputFormat implements org.apache.hadoop.mapred.InputFormat<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text>, org.apache.hadoop.mapred.JobConfigurable, ContentSummaryInputFormat, ReworkMapredInputFormat
Modifier and Type | Class and Description |
---|---|
static class |
SymlinkTextInputFormat.SymlinkTextInputSplit
This input split wraps the FileSplit generated from
TextInputFormat.getSplits(), while setting the original link file path
as job input path.
|
Constructor and Description |
---|
SymlinkTextInputFormat() |
Modifier and Type | Method and Description |
---|---|
void |
configure(org.apache.hadoop.mapred.JobConf job) |
org.apache.hadoop.fs.ContentSummary |
getContentSummary(org.apache.hadoop.fs.Path p,
org.apache.hadoop.mapred.JobConf job) |
org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text> |
getRecordReader(org.apache.hadoop.mapred.InputSplit split,
org.apache.hadoop.mapred.JobConf job,
org.apache.hadoop.mapred.Reporter reporter) |
org.apache.hadoop.mapred.InputSplit[] |
getSplits(org.apache.hadoop.mapred.JobConf job,
int numSplits)
Parses all target paths from job input directory which contains symlink
files, and splits the target data using TextInputFormat.
|
rework
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
rework
public org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text> getRecordReader(org.apache.hadoop.mapred.InputSplit split, org.apache.hadoop.mapred.JobConf job, org.apache.hadoop.mapred.Reporter reporter) throws IOException
getRecordReader
in interface org.apache.hadoop.mapred.InputFormat<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text>
IOException
public org.apache.hadoop.mapred.InputSplit[] getSplits(org.apache.hadoop.mapred.JobConf job, int numSplits) throws IOException
getSplits
in interface org.apache.hadoop.mapred.InputFormat<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text>
IOException
public void configure(org.apache.hadoop.mapred.JobConf job)
configure
in interface org.apache.hadoop.mapred.JobConfigurable
public org.apache.hadoop.fs.ContentSummary getContentSummary(org.apache.hadoop.fs.Path p, org.apache.hadoop.mapred.JobConf job) throws IOException
getContentSummary
in interface ContentSummaryInputFormat
IOException
Copyright © 2017 The Apache Software Foundation. All rights reserved.