Class IdentityDictionarySlice
- java.lang.Object
-
- org.apache.sysds.runtime.compress.colgroup.dictionary.ADictionary
-
- org.apache.sysds.runtime.compress.colgroup.dictionary.IdentityDictionary
-
- org.apache.sysds.runtime.compress.colgroup.dictionary.IdentityDictionarySlice
-
- All Implemented Interfaces:
Serializable
,IDictionary
public class IdentityDictionarySlice extends IdentityDictionary
- See Also:
- Serialized Form
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from interface org.apache.sysds.runtime.compress.colgroup.dictionary.IDictionary
IDictionary.DictType
-
-
Field Summary
-
Fields inherited from interface org.apache.sysds.runtime.compress.colgroup.dictionary.IDictionary
LOG
-
-
Constructor Summary
Constructors Constructor Description IdentityDictionarySlice(int nRowCol, boolean withEmpty, int l, int u)
Create a Identity matrix dictionary slice.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description void
addToEntry(double[] v, int fr, int to, int nCol, int rep)
Adds the dictionary entry from this dictionary to the v dictionary rep times.void
addToEntryVectorized(double[] v, int f1, int f2, int f3, int f4, int f5, int f6, int f7, int f8, int t1, int t2, int t3, int t4, int t5, int t6, int t7, int t8, int nCol)
Vectorized add to entry, this call helps with a bit of locality for the cache.void
aggregateCols(double[] c, Builtin fn, IColIndex colIndexes)
Aggregates the columns into the target double array provided.double[]
aggregateRows(Builtin fn, int nCol)
Aggregate all entries in the rows.IDictionary
clone()
Returns a deep clone of the dictionary.void
colProduct(double[] res, int[] counts, IColIndex colIndexes)
Calculate the column product of the dictionary weighted by counts.void
colSum(double[] c, int[] counts, IColIndex colIndexes)
Get the column sum of the values contained in the dictionaryvoid
colSumSq(double[] c, int[] counts, IColIndex colIndexes)
Get the column sum of the values contained in the dictionaryboolean
containsValue(double pattern)
Detect if the dictionary contains a specific value.boolean
equals(IDictionary o)
Indicate if the other dictionary is equal to this.IDictionary.DictType
getDictType()
Get the dictionary type this dictionary is.long
getExactSizeOnDisk()
Calculate the space consumption if the dictionary is stored on disk.long
getInMemorySize()
Returns the memory usage of the dictionary.static long
getInMemorySize(int numberColumns)
MatrixBlockDictionary
getMBDict(int nCol)
Get this dictionary as a MatrixBlock dictionary.long
getNumberNonZeros(int[] counts, int nCol)
Calculate the number of non zeros in the dictionary.double
getSparsity()
Get the sparsity of the dictionary.String
getString(int colIndexes)
Get a string representation of the dictionary, that considers the layout of the data.double
getValue(int i)
Get Specific value contained in the dictionary at index.double
getValue(int r, int c, int nCol)
Get Specific value contain in dictionary at index.double[]
getValues()
Get all the values contained in the dictionary as a linearized double array.IDictionary
preaggValuesFromDense(int numVals, IColIndex colIndexes, IColIndex aggregateColumns, double[] b, int cut)
Pre Aggregate values for Right Matrix Multiplication.double[]
productAllRowsToDouble(int nCol)
Method to product all rows to a column vector.double[]
productAllRowsToDoubleWithDefault(double[] defaultTuple)
Method to product all rows to a column vector with a default value added in the end.static IdentityDictionary
read(DataInput in)
IDictionary
replace(double pattern, double replace, int nCol)
Make a copy of the values, and replace all values that match pattern with replacement value.IDictionary
replaceWithReference(double pattern, double replace, double[] reference)
Make a copy of the values, and replace all values that match pattern with replacement value.IDictionary
scaleTuples(int[] scaling, int nCol)
Scale all tuples contained in the dictionary by the scaling factor given in the int list.IDictionary
sliceOutColumnRange(int idxStart, int idxEnd, int previousNumberOfColumns)
Modify the dictionary by removing columns not within the index range.double
sum(int[] counts, int ncol)
Get the sum of the values contained in the dictionarydouble[]
sumAllRowsToDouble(int nrColumns)
Method used as a pre-aggregate of each tuple in the dictionary, to single double values.double[]
sumAllRowsToDoubleSq(int nrColumns)
Method used as a pre-aggregate of each tuple in the dictionary, to single double values.double[]
sumAllRowsToDoubleWithDefault(double[] defaultTuple)
Do exactly the same as the sumAllRowsToDouble but also sum the array given to a extra index in the end of the array.double[]
sumAllRowsToDoubleWithReference(double[] reference)
Method used as a pre-aggregate of each tuple in the dictionary, to single double values with a reference.double
sumSq(int[] counts, int ncol)
Get the square sum of the values contained in the dictionaryString
toString()
void
write(DataOutput out)
Write the dictionary to a DataOutput.-
Methods inherited from class org.apache.sysds.runtime.compress.colgroup.dictionary.IdentityDictionary
addToEntry, aggregate, aggregateColsWithReference, aggregateRowsWithDefault, aggregateRowsWithReference, aggregateWithReference, applyScalarOp, applyScalarOpAndAppend, applyScalarOpWithReference, applyUnaryOp, applyUnaryOpAndAppend, applyUnaryOpWithReference, binOpLeft, binOpLeftAndAppend, binOpLeftWithReference, binOpRight, binOpRight, binOpRightAndAppend, binOpRightWithReference, cbind, centralMoment, centralMomentWithDefault, centralMomentWithReference, colProductWithReference, colSumSqWithReference, containsValueWithReference, getMBDict, getNumberNonZerosWithReference, getNumberOfValues, MMDict, MMDictDense, MMDictScaling, MMDictScalingDense, MMDictScalingSparse, MMDictSparse, multiplyScalar, product, productAllRowsToDoubleWithReference, productWithDefault, productWithReference, reorder, rexpandCols, rexpandColsWithReference, subtractTuple, sumAllRowsToDoubleSqWithDefault, sumAllRowsToDoubleSqWithReference, sumSqWithReference, TSMMToUpperTriangle, TSMMToUpperTriangleDense, TSMMToUpperTriangleDenseScaling, TSMMToUpperTriangleScaling, TSMMToUpperTriangleSparse, TSMMToUpperTriangleSparseScaling, TSMMWithScaling
-
Methods inherited from class org.apache.sysds.runtime.compress.colgroup.dictionary.ADictionary
centralMoment, centralMomentWithDefault, centralMomentWithReference, correctNan, doubleToString, equals, equals
-
-
-
-
Constructor Detail
-
IdentityDictionarySlice
public IdentityDictionarySlice(int nRowCol, boolean withEmpty, int l, int u)
Create a Identity matrix dictionary slice. It behaves as if allocated a Sparse Matrix block but exploits that the structure is known to have certain properties.- Parameters:
nRowCol
- the number of rows and columns in this identity matrix.withEmpty
- If the matrix should contain an empty row in the end.l
- the index lower to start atu
- the index upper to end at (not inclusive)
-
-
Method Detail
-
getValues
public double[] getValues()
Description copied from interface:IDictionary
Get all the values contained in the dictionary as a linearized double array.- Specified by:
getValues
in interfaceIDictionary
- Overrides:
getValues
in classIdentityDictionary
- Returns:
- linearized double array
-
getValue
public double getValue(int i)
Description copied from interface:IDictionary
Get Specific value contained in the dictionary at index.- Specified by:
getValue
in interfaceIDictionary
- Overrides:
getValue
in classIdentityDictionary
- Parameters:
i
- The index to extract the value from- Returns:
- The value contained at the index
-
getValue
public final double getValue(int r, int c, int nCol)
Description copied from interface:IDictionary
Get Specific value contain in dictionary at index.- Specified by:
getValue
in interfaceIDictionary
- Overrides:
getValue
in classIdentityDictionary
- Parameters:
r
- Row targetc
- Col targetnCol
- nCol in dictionary- Returns:
- value
-
getInMemorySize
public long getInMemorySize()
Description copied from interface:IDictionary
Returns the memory usage of the dictionary.- Specified by:
getInMemorySize
in interfaceIDictionary
- Overrides:
getInMemorySize
in classIdentityDictionary
- Returns:
- a long value in number of bytes for the dictionary.
-
getInMemorySize
public static long getInMemorySize(int numberColumns)
-
aggregateRows
public double[] aggregateRows(Builtin fn, int nCol)
Description copied from interface:IDictionary
Aggregate all entries in the rows.- Specified by:
aggregateRows
in interfaceIDictionary
- Overrides:
aggregateRows
in classIdentityDictionary
- Parameters:
fn
- The aggregate functionnCol
- The number of columns contained in the dictionary.- Returns:
- Aggregates for this dictionary tuples.
-
aggregateCols
public void aggregateCols(double[] c, Builtin fn, IColIndex colIndexes)
Description copied from interface:IDictionary
Aggregates the columns into the target double array provided.- Specified by:
aggregateCols
in interfaceIDictionary
- Overrides:
aggregateCols
in classIdentityDictionary
- Parameters:
c
- The target double array, this contains the full number of columns, therefore the colIndexes for this specific dictionary is needed.fn
- The function to apply to individual columnscolIndexes
- The mapping to the target columns from the individual columns
-
clone
public IDictionary clone()
Description copied from interface:IDictionary
Returns a deep clone of the dictionary.- Specified by:
clone
in interfaceIDictionary
- Overrides:
clone
in classIdentityDictionary
- Returns:
- A deep clone
-
getDictType
public IDictionary.DictType getDictType()
Description copied from interface:IDictionary
Get the dictionary type this dictionary is.- Specified by:
getDictType
in interfaceIDictionary
- Overrides:
getDictType
in classIdentityDictionary
- Returns:
- The Dictionary type this is.
-
sumAllRowsToDouble
public double[] sumAllRowsToDouble(int nrColumns)
Description copied from interface:IDictionary
Method used as a pre-aggregate of each tuple in the dictionary, to single double values. Note if the number of columns is one the actual dictionaries values are simply returned.- Specified by:
sumAllRowsToDouble
in interfaceIDictionary
- Overrides:
sumAllRowsToDouble
in classIdentityDictionary
- Parameters:
nrColumns
- The number of columns in the ColGroup to know how to get the values from the dictionary.- Returns:
- a double array containing the row sums from this dictionary.
-
sumAllRowsToDoubleWithDefault
public double[] sumAllRowsToDoubleWithDefault(double[] defaultTuple)
Description copied from interface:IDictionary
Do exactly the same as the sumAllRowsToDouble but also sum the array given to a extra index in the end of the array.- Specified by:
sumAllRowsToDoubleWithDefault
in interfaceIDictionary
- Overrides:
sumAllRowsToDoubleWithDefault
in classIdentityDictionary
- Parameters:
defaultTuple
- The default row to sum in the end index returned.- Returns:
- a double array containing the row sums from this dictionary.
-
sumAllRowsToDoubleWithReference
public double[] sumAllRowsToDoubleWithReference(double[] reference)
Description copied from interface:IDictionary
Method used as a pre-aggregate of each tuple in the dictionary, to single double values with a reference.- Specified by:
sumAllRowsToDoubleWithReference
in interfaceIDictionary
- Overrides:
sumAllRowsToDoubleWithReference
in classIdentityDictionary
- Parameters:
reference
- The reference values to add to each cell.- Returns:
- a double array containing the row sums from this dictionary.
-
sumAllRowsToDoubleSq
public double[] sumAllRowsToDoubleSq(int nrColumns)
Description copied from interface:IDictionary
Method used as a pre-aggregate of each tuple in the dictionary, to single double values. Note if the number of columns is one the actual dictionaries values are simply returned.- Specified by:
sumAllRowsToDoubleSq
in interfaceIDictionary
- Overrides:
sumAllRowsToDoubleSq
in classIdentityDictionary
- Parameters:
nrColumns
- The number of columns in the ColGroup to know how to get the values from the dictionary.- Returns:
- a double array containing the row sums from this dictionary.
-
productAllRowsToDouble
public double[] productAllRowsToDouble(int nCol)
Description copied from interface:IDictionary
Method to product all rows to a column vector.- Specified by:
productAllRowsToDouble
in interfaceIDictionary
- Overrides:
productAllRowsToDouble
in classIdentityDictionary
- Parameters:
nCol
- The number of columns in the ColGroup to know how to get the values from the dictionary.- Returns:
- A row product
-
productAllRowsToDoubleWithDefault
public double[] productAllRowsToDoubleWithDefault(double[] defaultTuple)
Description copied from interface:IDictionary
Method to product all rows to a column vector with a default value added in the end.- Specified by:
productAllRowsToDoubleWithDefault
in interfaceIDictionary
- Overrides:
productAllRowsToDoubleWithDefault
in classIdentityDictionary
- Parameters:
defaultTuple
- The default row that aggregate to last cell- Returns:
- A row product
-
colSum
public void colSum(double[] c, int[] counts, IColIndex colIndexes)
Description copied from interface:IDictionary
Get the column sum of the values contained in the dictionary- Specified by:
colSum
in interfaceIDictionary
- Overrides:
colSum
in classIdentityDictionary
- Parameters:
c
- The output array allocated to contain all column groups output.counts
- The counts of the individual tuples.colIndexes
- The columns indexes of the parent column group, this indicate where to put the column sum into the c output.
-
colSumSq
public void colSumSq(double[] c, int[] counts, IColIndex colIndexes)
Description copied from interface:IDictionary
Get the column sum of the values contained in the dictionary- Specified by:
colSumSq
in interfaceIDictionary
- Overrides:
colSumSq
in classIdentityDictionary
- Parameters:
c
- The output array allocated to contain all column groups output.counts
- The counts of the individual tuples.colIndexes
- The columns indexes of the parent column group, this indicate where to put the column sum into the c output.
-
colProduct
public void colProduct(double[] res, int[] counts, IColIndex colIndexes)
Description copied from interface:IDictionary
Calculate the column product of the dictionary weighted by counts.- Specified by:
colProduct
in interfaceIDictionary
- Overrides:
colProduct
in classIdentityDictionary
- Parameters:
res
- The result vector to put the result intocounts
- The weighted count of individual tuplescolIndexes
- The column indexes.
-
sum
public double sum(int[] counts, int ncol)
Description copied from interface:IDictionary
Get the sum of the values contained in the dictionary- Specified by:
sum
in interfaceIDictionary
- Overrides:
sum
in classIdentityDictionary
- Parameters:
counts
- The counts of the individual tuplesncol
- The number of columns contained- Returns:
- The sum scaled by the counts provided.
-
sumSq
public double sumSq(int[] counts, int ncol)
Description copied from interface:IDictionary
Get the square sum of the values contained in the dictionary- Specified by:
sumSq
in interfaceIDictionary
- Overrides:
sumSq
in classIdentityDictionary
- Parameters:
counts
- The counts of the individual tuplesncol
- The number of columns contained- Returns:
- The square sum scaled by the counts provided.
-
sliceOutColumnRange
public IDictionary sliceOutColumnRange(int idxStart, int idxEnd, int previousNumberOfColumns)
Description copied from interface:IDictionary
Modify the dictionary by removing columns not within the index range.- Specified by:
sliceOutColumnRange
in interfaceIDictionary
- Overrides:
sliceOutColumnRange
in classIdentityDictionary
- Parameters:
idxStart
- The column index to start at.idxEnd
- The column index to end at (not inclusive)previousNumberOfColumns
- The number of columns contained in the dictionary.- Returns:
- A dictionary containing the sliced out columns values only.
-
containsValue
public boolean containsValue(double pattern)
Description copied from interface:IDictionary
Detect if the dictionary contains a specific value.- Specified by:
containsValue
in interfaceIDictionary
- Overrides:
containsValue
in classIdentityDictionary
- Parameters:
pattern
- The value to search for- Returns:
- true if the value is contained else false.
-
getNumberNonZeros
public long getNumberNonZeros(int[] counts, int nCol)
Description copied from interface:IDictionary
Calculate the number of non zeros in the dictionary. The number of non zeros should be scaled with the counts given. This gives the exact number of non zero values in the parent column group.- Specified by:
getNumberNonZeros
in interfaceIDictionary
- Overrides:
getNumberNonZeros
in classIdentityDictionary
- Parameters:
counts
- The counts of each dictionary entrynCol
- The number of columns in this dictionary- Returns:
- The nonZero count
-
getMBDict
public MatrixBlockDictionary getMBDict(int nCol)
Description copied from interface:IDictionary
Get this dictionary as a MatrixBlock dictionary. This allows us to use optimized kernels coded elsewhere in the system, such as matrix multiplication. Return null if the matrix is empty.- Specified by:
getMBDict
in interfaceIDictionary
- Overrides:
getMBDict
in classIdentityDictionary
- Parameters:
nCol
- The number of columns contained in this column group.- Returns:
- A Dictionary containing a MatrixBlock.
-
getString
public String getString(int colIndexes)
Description copied from interface:IDictionary
Get a string representation of the dictionary, that considers the layout of the data.- Specified by:
getString
in interfaceIDictionary
- Overrides:
getString
in classIdentityDictionary
- Parameters:
colIndexes
- The number of columns in the dictionary.- Returns:
- A string that is nicer to print.
-
toString
public String toString()
- Overrides:
toString
in classIdentityDictionary
-
scaleTuples
public IDictionary scaleTuples(int[] scaling, int nCol)
Description copied from interface:IDictionary
Scale all tuples contained in the dictionary by the scaling factor given in the int list.- Specified by:
scaleTuples
in interfaceIDictionary
- Overrides:
scaleTuples
in classIdentityDictionary
- Parameters:
scaling
- The amount to multiply the given tuples withnCol
- The number of columns contained in this column group.- Returns:
- A New dictionary (since we don't want to modify the underlying dictionary)
-
write
public void write(DataOutput out) throws IOException
Description copied from interface:IDictionary
Write the dictionary to a DataOutput.- Specified by:
write
in interfaceIDictionary
- Overrides:
write
in classIdentityDictionary
- Parameters:
out
- the output sink to write the dictionary to.- Throws:
IOException
- if the sink fails.
-
read
public static IdentityDictionary read(DataInput in) throws IOException
- Throws:
IOException
-
getExactSizeOnDisk
public long getExactSizeOnDisk()
Description copied from interface:IDictionary
Calculate the space consumption if the dictionary is stored on disk.- Specified by:
getExactSizeOnDisk
in interfaceIDictionary
- Overrides:
getExactSizeOnDisk
in classIdentityDictionary
- Returns:
- the long count of bytes to store the dictionary.
-
replace
public IDictionary replace(double pattern, double replace, int nCol)
Description copied from interface:IDictionary
Make a copy of the values, and replace all values that match pattern with replacement value. If needed add a new column index.- Specified by:
replace
in interfaceIDictionary
- Overrides:
replace
in classIdentityDictionary
- Parameters:
pattern
- The value to look forreplace
- The value to replace the other value withnCol
- The number of columns contained in the dictionary.- Returns:
- A new Column Group, reusing the index structure but with new values.
-
replaceWithReference
public IDictionary replaceWithReference(double pattern, double replace, double[] reference)
Description copied from interface:IDictionary
Make a copy of the values, and replace all values that match pattern with replacement value. If needed add a new column index. With reference such that each value in the dict is considered offset by the values contained in the reference.- Specified by:
replaceWithReference
in interfaceIDictionary
- Overrides:
replaceWithReference
in classIdentityDictionary
- Parameters:
pattern
- The value to look forreplace
- The value to replace the other value withreference
- The reference tuple to add to all entries when replacing- Returns:
- A new Column Group, reusing the index structure but with new values.
-
getSparsity
public double getSparsity()
Description copied from interface:IDictionary
Get the sparsity of the dictionary.- Specified by:
getSparsity
in interfaceIDictionary
- Overrides:
getSparsity
in classIdentityDictionary
- Returns:
- a sparsity between 0 and 1
-
preaggValuesFromDense
public IDictionary preaggValuesFromDense(int numVals, IColIndex colIndexes, IColIndex aggregateColumns, double[] b, int cut)
Description copied from interface:IDictionary
Pre Aggregate values for Right Matrix Multiplication.- Specified by:
preaggValuesFromDense
in interfaceIDictionary
- Overrides:
preaggValuesFromDense
in classIdentityDictionary
- Parameters:
numVals
- The number of values contained in this dictionarycolIndexes
- The column indexes that is associated with the parent column groupaggregateColumns
- The column to aggregate, this is preprocessed, to find remove consideration for empty columnsb
- The values in the right hand side matrixcut
- The number of columns in b.- Returns:
- A new dictionary with the pre aggregated values.
-
addToEntryVectorized
public void addToEntryVectorized(double[] v, int f1, int f2, int f3, int f4, int f5, int f6, int f7, int f8, int t1, int t2, int t3, int t4, int t5, int t6, int t7, int t8, int nCol)
Description copied from interface:IDictionary
Vectorized add to entry, this call helps with a bit of locality for the cache.- Specified by:
addToEntryVectorized
in interfaceIDictionary
- Overrides:
addToEntryVectorized
in classIdentityDictionary
- Parameters:
v
- The target dictionary (dense double array)f1
- From index 1f2
- From index 2f3
- From index 3f4
- From index 4f5
- From index 5f6
- From index 6f7
- From index 7f8
- From index 8t1
- To index 1t2
- To index 2t3
- To index 3t4
- To index 4t5
- To index 5t6
- To index 6t7
- To index 7t8
- To index 8nCol
- Number of columns in the dictionary
-
addToEntry
public void addToEntry(double[] v, int fr, int to, int nCol, int rep)
Description copied from interface:IDictionary
Adds the dictionary entry from this dictionary to the v dictionary rep times.- Specified by:
addToEntry
in interfaceIDictionary
- Overrides:
addToEntry
in classIdentityDictionary
- Parameters:
v
- The target dictionary (dense double array)fr
- The from index is the tuple index to copy from.to
- The to index is the row index to copy into.nCol
- The number of columns in both casesrep
- The number of repetitions to apply (simply multiply do not loop)
-
equals
public boolean equals(IDictionary o)
Description copied from interface:IDictionary
Indicate if the other dictionary is equal to this.- Specified by:
equals
in interfaceIDictionary
- Overrides:
equals
in classIdentityDictionary
- Parameters:
o
- The other object- Returns:
- If it is equal
-
-