public class XmlCollectionWithTagInputFormat
extends org.apache.hadoop.mapreduce.lib.input.TextInputFormat
Modifier and Type | Class | Description |
---|---|---|
static class |
XmlCollectionWithTagInputFormat.XmlRecordReader |
XMLRecordReader class to read through a given xml document to output xml blocks as records as specified
by the end tag
|
Modifier and Type | Field | Description |
---|---|---|
static String |
ENDING_TAG |
|
static String |
STARTING_TAG |
Constructor | Description |
---|---|
XmlCollectionWithTagInputFormat() |
Modifier and Type | Method | Description |
---|---|---|
org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text> |
createRecordReader(org.apache.hadoop.mapreduce.InputSplit split,
org.apache.hadoop.mapreduce.TaskAttemptContext context) |
addInputPath, addInputPathRecursively, addInputPaths, computeSplitSize, getBlockIndex, getFormatMinSplitSize, getInputDirRecursive, getInputPathFilter, getInputPaths, getMaxSplitSize, getMinSplitSize, getSplits, listStatus, makeSplit, makeSplit, setInputDirRecursive, setInputPathFilter, setInputPaths, setInputPaths, setMaxInputSplitSize, setMinInputSplitSize
public static String STARTING_TAG
public static String ENDING_TAG
public org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text> createRecordReader(org.apache.hadoop.mapreduce.InputSplit split, org.apache.hadoop.mapreduce.TaskAttemptContext context)
createRecordReader
in class org.apache.hadoop.mapreduce.lib.input.TextInputFormat
Copyright © 2019 Apache Software Foundation. All rights reserved.