public class XmlInputFormat
extends org.apache.hadoop.mapreduce.lib.input.TextInputFormat
Modifier and Type | Class and Description |
---|---|
static class |
XmlInputFormat.XmlRecordReader
XMLRecordReader class to read through a given xml document to output xml blocks as records as specified
by the start tag and end tag
|
Modifier and Type | Field and Description |
---|---|
static String |
END_TAG_KEY |
static String |
START_TAG_KEY |
Constructor and Description |
---|
XmlInputFormat() |
Modifier and Type | Method and Description |
---|---|
org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text> |
createRecordReader(org.apache.hadoop.mapreduce.InputSplit split,
org.apache.hadoop.mapreduce.TaskAttemptContext context) |
addInputPath, addInputPathRecursively, addInputPaths, computeSplitSize, getBlockIndex, getFormatMinSplitSize, getInputDirRecursive, getInputPathFilter, getInputPaths, getMaxSplitSize, getMinSplitSize, getSplits, listStatus, makeSplit, setInputDirRecursive, setInputPathFilter, setInputPaths, setInputPaths, setMaxInputSplitSize, setMinInputSplitSize
public static final String START_TAG_KEY
public static final String END_TAG_KEY
public org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text> createRecordReader(org.apache.hadoop.mapreduce.InputSplit split, org.apache.hadoop.mapreduce.TaskAttemptContext context)
createRecordReader
in class org.apache.hadoop.mapreduce.lib.input.TextInputFormat
Copyright © 2008–2015 The Apache Software Foundation. All rights reserved.