@Operator(group="train") public final class ResourceSparseApplyProximalAdagrad extends PrimitiveOp
That is for rows we have grad for, we update var and accum as follows: accum += grad grad prox_v = var prox_v -= lr grad (1 / sqrt(accum)) var = sign(prox_v)/(1+lrl2) max{|prox_v|-lrl1,0}
Modifier and Type | Class and Description |
---|---|
static class |
ResourceSparseApplyProximalAdagrad.Options
Optional attributes for
ResourceSparseApplyProximalAdagrad |
operation
Modifier and Type | Method and Description |
---|---|
static <T,U extends Number> |
create(Scope scope,
Operand<?> var,
Operand<?> accum,
Operand<T> lr,
Operand<T> l1,
Operand<T> l2,
Operand<T> grad,
Operand<U> indices,
ResourceSparseApplyProximalAdagrad.Options... options)
Factory method to create a class wrapping a new ResourceSparseApplyProximalAdagrad operation.
|
static ResourceSparseApplyProximalAdagrad.Options |
useLocking(Boolean useLocking) |
equals, hashCode, op, toString
public static <T,U extends Number> ResourceSparseApplyProximalAdagrad create(Scope scope, Operand<?> var, Operand<?> accum, Operand<T> lr, Operand<T> l1, Operand<T> l2, Operand<T> grad, Operand<U> indices, ResourceSparseApplyProximalAdagrad.Options... options)
scope
- current scopevar
- Should be from a Variable().accum
- Should be from a Variable().lr
- Learning rate. Must be a scalar.l1
- L1 regularization. Must be a scalar.l2
- L2 regularization. Must be a scalar.grad
- The gradient.indices
- A vector of indices into the first dimension of var and accum.options
- carries optional attributes valuespublic static ResourceSparseApplyProximalAdagrad.Options useLocking(Boolean useLocking)
useLocking
- If True, updating of the var and accum tensors will be protected by
a lock; otherwise the behavior is undefined, but may exhibit less contention.Copyright © 2022. All rights reserved.