TransformerImpl (JavaCPP Presets for PyTorch 2.7.1-1.5.12 API)

java.lang.Object
- org.bytedeco.javacpp.Pointer
- - org.bytedeco.pytorch.Module
  - - org.bytedeco.pytorch.TransformerImplCloneable
    - - org.bytedeco.pytorch.TransformerImpl

All Implemented Interfaces:

AutoCloseable
```
@Namespace(value="torch::nn")
 @NoOffset
 @Properties(inherit=torch.class)
public class TransformerImpl
extends TransformerImplCloneable
```
A transformer model. User is able to modify the attributes as needed. The architecture is based on the paper "Attention Is All You Need". Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems, pages 6000-6010. See https://pytorch.org/docs/stable/generated/torch.nn.Transformer.html to learn about the exact behavior of this transformer model See the documentation for torch::nn::Transformer class to learn what constructor arguments are supported for this encoder layer model Example:
```
  Transformer trans(TransformerOptions(512, 8));
  
```

Nested Class Summary
- Nested classes/interfaces inherited from class org.bytedeco.javacpp.Pointer
  Pointer.CustomDeallocator, Pointer.Deallocator, Pointer.NativeDeallocator, Pointer.ReferenceCounter

Field Summary
- Fields inherited from class org.bytedeco.javacpp.Pointer
  address, capacity, limit, position

Constructor Summary

Constructors
Constructor and Description

TransformerImpl(Pointer p)
Pointer cast constructor.

TransformerImpl(TransformerOptions options_)

Constructors
Constructor and Description
`TransformerImpl(Pointer p)` Pointer cast constructor.
`TransformerImpl(TransformerOptions options_)`

Method Summary

All Methods Static Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`AnyModule`	`decoder()` decoder module
`TransformerImpl`	`decoder(AnyModule setter)`
`AnyModule`	`encoder()` encoder module
`TransformerImpl`	`encoder(AnyModule setter)`
`Tensor`	`forward(Tensor src, Tensor tgt)`
`Tensor`	`forward(Tensor src, Tensor tgt, Tensor src_mask, Tensor tgt_mask, Tensor memory_mask, Tensor src_key_padding_mask, Tensor tgt_key_padding_mask, Tensor memory_key_padding_mask)` forward function for Transformer Module Args: src: the sequence to the encoder (required).
`static Tensor`	`generate_square_subsequent_mask(long sz)` Generate a square mask for the sequence.
`TransformerOptions`	`options()` options with which this `Transformer` was constructed
`TransformerImpl`	`options(TransformerOptions setter)`
`void`	`reset_parameters()`
`void`	`reset()` `reset()` must perform initialization of all members with reference semantics, most importantly parameters, buffers and submodules.

Methods inherited from class org.bytedeco.pytorch.TransformerImplCloneable
clone, clone

Methods inherited from class org.bytedeco.pytorch.Module
apply, apply, apply, apply, apply, apply, apply, apply, asAdaptiveAvgPool1d, asAdaptiveAvgPool2d, asAdaptiveAvgPool3d, asAdaptiveLogSoftmaxWithLoss, asAdaptiveMaxPool1d, asAdaptiveMaxPool2d, asAdaptiveMaxPool3d, asAlphaDropout, asAvgPool1d, asAvgPool2d, asAvgPool3d, asBatchNorm1d, asBatchNorm2d, asBatchNorm3d, asBCELoss, asBCEWithLogitsLoss, asBilinear, asCELU, asConstantPad1d, asConstantPad2d, asConstantPad3d, asConv1d, asConv2d, asConv3d, asConvTranspose1d, asConvTranspose2d, asConvTranspose3d, asCosineEmbeddingLoss, asCosineSimilarity, asCrossEntropyLoss, asCrossMapLRN2d, asCTCLoss, asDropout, asDropout2d, asDropout3d, asELU, asEmbedding, asEmbeddingBag, asFeatureAlphaDropout, asFlatten, asFold, asFractionalMaxPool2d, asFractionalMaxPool3d, asGELU, asGLU, asGroupNorm, asGRU, asGRUCell, asHardshrink, asHardtanh, asHingeEmbeddingLoss, asHuberLoss, asIdentity, asInstanceNorm1d, asInstanceNorm2d, asInstanceNorm3d, asKLDivLoss, asL1Loss, asLayerNorm, asLeakyReLU, asLinear, asLocalResponseNorm, asLogSigmoid, asLogSoftmax, asLPPool1d, asLPPool2d, asLPPool3d, asLSTM, asLSTMCell, asMarginRankingLoss, asMaxPool1d, asMaxPool2d, asMaxPool3d, asMaxUnpool1d, asMaxUnpool2d, asMaxUnpool3d, asMish, asModuleDict, asModuleList, asMSELoss, asMultiheadAttention, asMultiLabelMarginLoss, asMultiLabelSoftMarginLoss, asMultiMarginLoss, asNLLLoss, asPairwiseDistance, asParameterDict, asParameterList, asPixelShuffle, asPixelUnshuffle, asPoissonNLLLoss, asPReLU, asReflectionPad1d, asReflectionPad2d, asReflectionPad3d, asReLU, asReLU6, asReplicationPad1d, asReplicationPad2d, asReplicationPad3d, asRNN, asRNNCell, asRReLU, asSELU, asSequential, asSigmoid, asSiLU, asSmoothL1Loss, asSoftMarginLoss, asSoftmax, asSoftmax2d, asSoftmin, asSoftplus, asSoftshrink, asSoftsign, asTanh, asTanhshrink, asThreshold, asTransformer, asTransformerDecoder, asTransformerDecoderLayer, asTransformerEncoder, asTransformerEncoderLayer, asTripletMarginLoss, asTripletMarginWithDistanceLoss, asUnflatten, asUnfold, asUpsample, asZeroPad1d, asZeroPad2d, asZeroPad3d, buffers, buffers, children, eval, is_serializable, is_training, load, modules, modules, name, named_buffers, named_buffers, named_children, named_modules, named_modules, named_modules, named_parameters, named_parameters, parameters, parameters, pretty_print, put, register_buffer, register_buffer, register_module, register_module, register_parameter, register_parameter, register_parameter, register_parameter, save, shiftLeft, to, to, to, train, unregister_module, unregister_module, zero_grad

Methods inherited from class org.bytedeco.javacpp.Pointer
address, asBuffer, asByteBuffer, availablePhysicalBytes, calloc, capacity, capacity, close, deallocate, deallocate, deallocateReferences, deallocator, deallocator, equals, fill, formatBytes, free, getDirectBufferAddress, getPointer, getPointer, getPointer, getPointer, hashCode, interruptDeallocatorThread, isNull, isNull, limit, limit, malloc, maxBytes, maxPhysicalBytes, memchr, memcmp, memcpy, memmove, memset, offsetAddress, offsetof, offsetof, parseBytes, physicalBytes, physicalBytesInaccurate, position, position, put, realloc, referenceCount, releaseReference, retainReference, setNull, sizeof, sizeof, toString, totalBytes, totalCount, totalPhysicalBytes, withDeallocator, zero

Methods inherited from class java.lang.Object
finalize, getClass, notify, notifyAll, wait, wait, wait

- Constructor Detail
  - TransformerImpl
```
public TransformerImpl(Pointer p)
```
    Pointer cast constructor. Invokes Pointer(Pointer).
  - TransformerImpl
```
public TransformerImpl(@ByVal
                       TransformerOptions options_)
```
- Method Detail
  - forward
```
@ByVal
public Tensor forward(@Const @ByRef
                              Tensor src,
                              @Const @ByRef
                              Tensor tgt,
                              @Const @ByRef(nullValue="torch::Tensor{}")
                              Tensor src_mask,
                              @Const @ByRef(nullValue="torch::Tensor{}")
                              Tensor tgt_mask,
                              @Const @ByRef(nullValue="torch::Tensor{}")
                              Tensor memory_mask,
                              @Const @ByRef(nullValue="torch::Tensor{}")
                              Tensor src_key_padding_mask,
                              @Const @ByRef(nullValue="torch::Tensor{}")
                              Tensor tgt_key_padding_mask,
                              @Const @ByRef(nullValue="torch::Tensor{}")
                              Tensor memory_key_padding_mask)
```
    forward function for Transformer Module Args: src: the sequence to the encoder (required). tgt: the sequence to the decoder (required). src_mask: the additive mask for the src sequence (optional). tgt_mask: the additive mask for the tgt sequence (optional). memory_mask: the additive mask for the encoder output (optional). src_key_padding_mask: the ByteTensor mask for src keys per batch (optional). tgt_key_padding_mask: the ByteTensor mask for tgt keys per batch (optional). memory_key_padding_mask: the ByteTensor mask for memory keys per batch (optional). Shape: src: (S, N, E) tgt: (T, N, E) src_mask: (S, S) tgt_mask: (T, T) memory_mask: (T, S) src_key_padding_mask: (N, S) tgt_key_padding_mask: (N, T) memory_key_padding_mask: (N, S) Note: [src/tgt/memory]_mask ensures that position i is allowed to attend the unmasked positions. If a ByteTensor is provided, the non-zero positions are not allowed to attend while the zero positions will be unchanged. If a BoolTensor is provided, positions with True are not allowed to attend while False values will be unchanged. If a FloatTensor is provided, it will be added to the attention weight. [src/tgt/memory]_key_padding_mask provides specified elements in the key to be ignored by the attention. If a ByteTensor is provided, the non-zero positions will be ignored while the zero positions will be unchanged. If a BoolTensor is provided, the positions with the value of True will be ignored while the position with the value of False will be unchanged. output: (T, N, E) Note: Due to the multi-head attention architecture in the transformer model, the output sequence length of a transformer is same as the input sequence (i.e. target) length of the decode. where S is the source sequence length, T is the target sequence length, N is the batch size, E is the feature number.
  - forward
```
@ByVal
public Tensor forward(@Const @ByRef
                              Tensor src,
                              @Const @ByRef
                              Tensor tgt)
```
  - reset
```
public void reset()
```
    Description copied from class: TransformerImplCloneable
    
    reset() must perform initialization of all members with reference semantics, most importantly parameters, buffers and submodules.
    
    Overrides:
    
    reset in class TransformerImplCloneable
  - reset_parameters
```
public void reset_parameters()
```
  - generate_square_subsequent_mask
```
@ByVal
public static Tensor generate_square_subsequent_mask(@Cast(value="int64_t")
                                                             long sz)
```
    Generate a square mask for the sequence. The masked positions are filled with -inf in float type. Unmasked positions are filled with 0.0 in float type. Note: 1. This function will always return a CPU tensor. 2. This function requires the platform support IEEE754, since -inf is guaranteed to be valid only when IEEE754 is supported. If the platform doesn't support IEEE754, this function will fill the mask with the smallest float number instead of -inf, a one time warning will pop up as well.
  - options
```
@ByRef
public TransformerOptions options()
```
    options with which this Transformer was constructed
  - options
```
public TransformerImpl options(TransformerOptions setter)
```
  - encoder
```
@ByRef
public AnyModule encoder()
```
    encoder module
  - encoder
```
public TransformerImpl encoder(AnyModule setter)
```
  - decoder
```
@ByRef
public AnyModule decoder()
```
    decoder module
  - decoder
```
public TransformerImpl decoder(AnyModule setter)
```

Class TransformerImpl

Nested Class Summary

Nested classes/interfaces inherited from class org.bytedeco.javacpp.Pointer

Field Summary

Fields inherited from class org.bytedeco.javacpp.Pointer

Constructor Summary

Method Summary

Methods inherited from class org.bytedeco.pytorch.TransformerImplCloneable

Methods inherited from class org.bytedeco.pytorch.Module

Methods inherited from class org.bytedeco.javacpp.Pointer

Methods inherited from class java.lang.Object

Constructor Detail

TransformerImpl

TransformerImpl

Method Detail

forward

forward

reset

reset_parameters

generate_square_subsequent_mask

options

options

encoder

encoder

decoder

decoder