Quantize (JavaCPP Presets for TensorFlow 1.15.5-1.5.8 API)

java.lang.Object
- org.tensorflow.op.PrimitiveOp
- - org.tensorflow.op.quantization.Quantize<T>

Type Parameters:

T - data type for output() output

All Implemented Interfaces:

Op
```
@Operator(group="quantization")
public final class Quantize<T>
extends PrimitiveOp
```
Quantize the 'input' tensor of type float to 'output' tensor of type 'T'.
[min_range, max_range] are scalar floats that specify the range for the 'input' data. The 'mode' attribute controls exactly which calculations are used to convert the float values to their quantized equivalents. The 'round_mode' attribute controls which rounding tie-breaking algorithm is used when rounding float values to their quantized equivalents.
In 'MIN_COMBINED' mode, each value of the tensor will undergo the following:
```
 out[i] = (in[i] - min_range) * range(T) / (max_range - min_range)
 if T == qint8: out[i] -= (range(T) + 1) / 2.0
 
```
here `range(T) = numeric_limits
::max() - numeric_limits
::min()`
MIN_COMBINED Mode Example
Assume the input is type float and has a possible range of [0.0, 6.0] and the output type is quint8 ([0, 255]). The min_range and max_range values should be specified as 0.0 and 6.0. Quantizing from float to quint8 will multiply each value of the input by 255/6 and cast to quint8.
If the output type was qint8 ([-128, 127]), the operation will additionally subtract each value by 128 prior to casting, so that the range of values aligns with the range of qint8.
If the mode is 'MIN_FIRST', then this approach is used:
```
 num_discrete_values = 1 << (# of bits in T)
 range_adjust = num_discrete_values / (num_discrete_values - 1)
 range = (range_max - range_min) * range_adjust
 range_scale = num_discrete_values / range
 quantized = round(input * range_scale) - round(range_min * range_scale) +
   numeric_limits<T>::min()
 quantized = max(quantized, numeric_limits<T>::min())
 quantized = min(quantized, numeric_limits<T>::max())
 
```
The biggest difference between this and MIN_COMBINED is that the minimum range is rounded first, before it's subtracted from the rounded value. With MIN_COMBINED, a small bias is introduced where repeated iterations of quantizing and dequantizing will introduce a larger and larger error.
SCALED mode Example
`SCALED` mode matches the quantization approach used in `QuantizeAndDequantize{V2|V3}`.
If the mode is `SCALED`, we do not use the full range of the output type, choosing to elide the lowest possible value for symmetry (e.g., output range is -127 to 127, not -128 to 127 for signed 8 bit quantization), so that 0.0 maps to 0.
We first find the range of values in our tensor. The range we use is always centered on 0, so we find m such that
```
   m = max(abs(input_min), abs(input_max))
 
```
Our input tensor range is then `[-m, m]`.
Next, we choose our fixed-point quantization buckets, `[min_fixed, max_fixed]`. If T is signed, this is
```
   num_bits = sizeof(T) * 8
   [min_fixed, max_fixed] =
       [-(1 << (num_bits - 1) - 1), (1 << (num_bits - 1)) - 1]
 
```
Otherwise, if T is unsigned, the fixed-point range is
```
   [min_fixed, max_fixed] = [0, (1 << num_bits) - 1]
 
```
From this we compute our scaling factor, s:
```
   s = (max_fixed - min_fixed) / (2 * m)
 
```
Now we can quantize the elements of our tensor:
```
 result = round(input * s)
 
```
One thing to watch out for is that the operator may choose to adjust the requested minimum and maximum values slightly during the quantization process, so you should always use the output ports as the range for further calculations. For example, if the requested minimum and maximum values are close to equal, they will be separated by a small epsilon value to prevent ill-formed quantized buffers from being created. Otherwise, you can end up with buffers where all the quantized values map to the same float value, which causes problems for operations that have to perform further calculations on them.

Nested Class Summary

Nested Classes
Modifier and Type Class and Description

static class Quantize.Options
Optional attributes for Quantize

Nested Classes
Modifier and Type	Class and Description
`static class`	`Quantize.Options` Optional attributes for `Quantize`

Field Summary
- Fields inherited from class org.tensorflow.op.PrimitiveOp
  operation

Method Summary

All Methods Static Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`static <T> Quantize<T>`	`create(Scope scope, Operand<Float> input, Operand<Float> minRange, Operand<Float> maxRange, Class<T> T, Quantize.Options... options)` Factory method to create a class wrapping a new Quantize operation.
`static Quantize.Options`	`mode(String mode)`
`Output<T>`	`output()` The quantized data produced from the float input.
`Output<Float>`	`outputMax()` The actual maximum scalar value used for the output.
`Output<Float>`	`outputMin()` The actual minimum scalar value used for the output.
`static Quantize.Options`	`roundMode(String roundMode)`

Methods inherited from class org.tensorflow.op.PrimitiveOp
equals, hashCode, op, toString

Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait

Method Detail

create

public static <T> Quantize<T> create(Scope scope,
                                     Operand<Float> input,
                                     Operand<Float> minRange,
                                     Operand<Float> maxRange,
                                     Class<T> T,
                                     Quantize.Options... options)

Factory method to create a class wrapping a new Quantize operation.

Parameters:: scope - current scope; input -; minRange - The minimum scalar value possibly produced for the input.; maxRange - The maximum scalar value possibly produced for the input.; T -; options - carries optional attributes values
Returns:: a new instance of Quantize

mode

public static Quantize.Options mode(String mode)

Parameters:: mode -

roundMode

public static Quantize.Options roundMode(String roundMode)

Parameters:: roundMode -

output
```
public Output<T> output()
```
The quantized data produced from the float input.

outputMin
```
public Output<Float> outputMin()
```
The actual minimum scalar value used for the output.

outputMax
```
public Output<Float> outputMax()
```
The actual maximum scalar value used for the output.

Class Quantize<T>

Nested Class Summary

Field Summary

Fields inherited from class org.tensorflow.op.PrimitiveOp

Method Summary

Methods inherited from class org.tensorflow.op.PrimitiveOp

Methods inherited from class java.lang.Object

Method Detail

create

mode

roundMode

output

outputMin

outputMax