Class DecoderRecode

  • All Implemented Interfaces:
    Externalizable, Serializable

    public class DecoderRecode
    extends Decoder
    Simple atomic decoder for recoded columns. This decoder builds internally inverted recode maps from the given frame meta data.
    See Also:
    Serialized Form
    • Constructor Detail

      • DecoderRecode

        public DecoderRecode()
    • Method Detail

      • getRcMapValue

        public Object getRcMapValue​(int i,
                                    long key)
      • decode

        public FrameBlock decode​(MatrixBlock in,
                                 FrameBlock out)
        Description copied from class: Decoder
        Block decode API converting a matrix block into a frame block.
        Specified by:
        decode in class Decoder
        Parameters:
        in - Input matrix block
        out - Output frame block
        Returns:
        returns given output frame block for convenience
      • decode

        public void decode​(MatrixBlock in,
                           FrameBlock out,
                           int rl,
                           int ru)
        Description copied from class: Decoder
        Block decode row block
        Specified by:
        decode in class Decoder
        Parameters:
        in - input Matrix Block
        out - output FrameBlock
        rl - row start to decode
        ru - row end to decode (not inclusive)
      • subRangeDecoder

        public Decoder subRangeDecoder​(int colStart,
                                       int colEnd,
                                       int dummycodedOffset)
        Description copied from class: Decoder
        Returns a new Decoder that only handles a sub range of columns. The sub-range refers to the columns after decoding.
        Overrides:
        subRangeDecoder in class Decoder
        Parameters:
        colStart - the start index of the sub-range (1-based, inclusive)
        colEnd - the end index of the sub-range (1-based, exclusive)
        dummycodedOffset - the offset of dummycoded segments before colStart
        Returns:
        a decoder of the same type, just for the sub-range
      • parseRecodeMapEntry

        public static void parseRecodeMapEntry​(String entry,
                                               Pair<String,​String> pair)
        Parses a line of <token, ID, count> into <token, ID> pairs, where quoted tokens (potentially including separators) are supported.
        Parameters:
        entry - entry line (token, ID, count)
        pair - token-ID pair
      • writeExternal

        public void writeExternal​(ObjectOutput out)
                           throws IOException
        Description copied from class: Decoder
        Redirects the default java serialization via externalizable to our default hadoop writable serialization for efficient broadcast/rdd serialization.
        Specified by:
        writeExternal in interface Externalizable
        Overrides:
        writeExternal in class Decoder
        Parameters:
        out - object output
        Throws:
        IOException - if IOException occurs
      • readExternal

        public void readExternal​(ObjectInput in)
                          throws IOException
        Description copied from class: Decoder
        Redirects the default java serialization via externalizable to our default hadoop writable serialization for efficient broadcast/rdd deserialization.
        Specified by:
        readExternal in interface Externalizable
        Overrides:
        readExternal in class Decoder
        Parameters:
        in - object input
        Throws:
        IOException - if IOException occur