Class FilterOutputFormat<K,V>
java.lang.Object
org.apache.hadoop.mapreduce.OutputFormat<K,V>
org.apache.hadoop.mapreduce.lib.output.FilterOutputFormat<K,V>
- Direct Known Subclasses:
LazyOutputFormat
FilterOutputFormat is a convenience class that wraps OutputFormat.
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classorg.apache.hadoop.mapreduce.lib.output.FilterOutputFormat.FilterRecordWriter<K,V> FilterRecordWriteris a convenience wrapper class that extends theRecordWriter. -
Field Summary
Fields -
Constructor Summary
ConstructorsConstructorDescriptionFilterOutputFormat(OutputFormat<K, V> baseOut) Create a FilterOutputFormat based on the underlying output format. -
Method Summary
Modifier and TypeMethodDescriptionvoidcheckOutputSpecs(JobContext context) Check for validity of the output-specification for the job.getOutputCommitter(TaskAttemptContext context) Get the output committer for this output format.getRecordWriter(TaskAttemptContext context) Get theRecordWriterfor the given task.
-
Field Details
-
baseOut
-
-
Constructor Details
-
FilterOutputFormat
public FilterOutputFormat() -
FilterOutputFormat
Create a FilterOutputFormat based on the underlying output format.- Parameters:
baseOut- the underlying OutputFormat
-
-
Method Details
-
getRecordWriter
public RecordWriter<K,V> getRecordWriter(TaskAttemptContext context) throws IOException, InterruptedException Description copied from class:OutputFormatGet theRecordWriterfor the given task.- Specified by:
getRecordWriterin classOutputFormat<K,V> - Parameters:
context- the information about the current task.- Returns:
- a
RecordWriterto write the output for the job. - Throws:
IOExceptionInterruptedException
-
checkOutputSpecs
Description copied from class:OutputFormatCheck for validity of the output-specification for the job.This is to validate the output specification for the job when it is a job is submitted. Typically checks that it does not already exist, throwing an exception when it already exists, so that output is not overwritten.
Implementations which write to filesystems which support delegation tokens usually collect the tokens for the destination path(s) and attach them to the job context's JobConf.- Specified by:
checkOutputSpecsin classOutputFormat<K,V> - Parameters:
context- information about the job- Throws:
IOException- when output should not be attemptedInterruptedException
-
getOutputCommitter
public OutputCommitter getOutputCommitter(TaskAttemptContext context) throws IOException, InterruptedException Description copied from class:OutputFormatGet the output committer for this output format. This is responsible for ensuring the output is committed correctly.- Specified by:
getOutputCommitterin classOutputFormat<K,V> - Parameters:
context- the task context- Returns:
- an output committer
- Throws:
IOExceptionInterruptedException
-