Class TokenizerBuilderWhitespaceSplit
- java.lang.Object
-
- org.apache.sysds.runtime.transform.tokenize.builder.TokenizerBuilder
-
- org.apache.sysds.runtime.transform.tokenize.builder.TokenizerBuilderWhitespaceSplit
-
- All Implemented Interfaces:
Serializable
- Direct Known Subclasses:
TokenizerBuilderNgram
public class TokenizerBuilderWhitespaceSplit extends TokenizerBuilder
- See Also:
- Serialized Form
-
-
Constructor Summary
Constructors Constructor Description TokenizerBuilderWhitespaceSplit(int[] idCols, int tokenizeCol, org.apache.wink.json4j.JSONObject params)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description void
createInternalRepresentation(FrameBlock in, DocumentRepresentation[] internalRepresentation, int rowStart, int blk)
List<Token>
splitToTokens(String text)
-
Methods inherited from class org.apache.sysds.runtime.transform.tokenize.builder.TokenizerBuilder
createInternalRepresentation, getTasks
-
-
-
-
Field Detail
-
regex
public String regex
-
-
Method Detail
-
createInternalRepresentation
public void createInternalRepresentation(FrameBlock in, DocumentRepresentation[] internalRepresentation, int rowStart, int blk)
- Specified by:
createInternalRepresentation
in classTokenizerBuilder
-
-