AudioSpectrogram (JavaCPP Presets for TensorFlow 1.15.5-1.5.8 API)

java.lang.Object
- org.tensorflow.op.PrimitiveOp
- - org.tensorflow.op.audio.AudioSpectrogram

All Implemented Interfaces:

Op, Operand<Float>
```
@Operator(group="audio")
public final class AudioSpectrogram
extends PrimitiveOp
implements Operand<Float>
```
Produces a visualization of audio data over time.
Spectrograms are a standard way of representing audio information as a series of slices of frequency information, one slice for each window of time. By joining these together into a sequence, they form a distinctive fingerprint of the sound over time.
This op expects to receive audio data as an input, stored as floats in the range -1 to 1, together with a window width in samples, and a stride specifying how far to move the window between slices. From this it generates a three dimensional output. The first dimension is for the channels in the input, so a stereo audio input would have two here for example. The second dimension is time, with successive frequency slices. The third dimension has an amplitude value for each frequency during that time slice.
This means the layout when converted and saved as an image is rotated 90 degrees clockwise from a typical spectrogram. Time is descending down the Y axis, and the frequency decreases from left to right.
Each value in the result represents the square root of the sum of the real and imaginary parts of an FFT on the current window of samples. In this way, the lowest dimension represents the power of each frequency in the current window, and adjacent windows are concatenated in the next dimension.
To get a more intuitive and visual look at what this operation does, you can run tensorflow/examples/wav_to_spectrogram to read in an audio file and save out the resulting spectrogram as a PNG image.

Nested Class Summary

Nested Classes
Modifier and Type Class and Description

static class AudioSpectrogram.Options
Optional attributes for AudioSpectrogram

Nested Classes
Modifier and Type	Class and Description
`static class`	`AudioSpectrogram.Options` Optional attributes for `AudioSpectrogram`

Field Summary
- Fields inherited from class org.tensorflow.op.PrimitiveOp
  operation

Method Summary

All Methods Static Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`Output<Float>`	`asOutput()` Returns the symbolic handle of a tensor.
`static AudioSpectrogram`	`create(Scope scope, Operand<Float> input, Long windowSize, Long stride, AudioSpectrogram.Options... options)` Factory method to create a class wrapping a new AudioSpectrogram operation.
`static AudioSpectrogram.Options`	`magnitudeSquared(Boolean magnitudeSquared)`
`Output<Float>`	`spectrogram()` 3D representation of the audio frequencies as an image.

Methods inherited from class org.tensorflow.op.PrimitiveOp
equals, hashCode, op, toString

Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait

- Method Detail
  - create
```
public static AudioSpectrogram create(Scope scope,
                                      Operand<Float> input,
                                      Long windowSize,
                                      Long stride,
                                      AudioSpectrogram.Options... options)
```
    Factory method to create a class wrapping a new AudioSpectrogram operation.
    
    Parameters:
    
    scope - current scope
    
    input - Float representation of audio data.
    
    windowSize - How wide the input window is in samples. For the highest efficiency this should be a power of two, but other values are accepted.
    
    stride - How widely apart the center of adjacent sample windows should be.
    
    options - carries optional attributes values
    
    Returns:
    
    a new instance of AudioSpectrogram
  - magnitudeSquared
```
public static AudioSpectrogram.Options magnitudeSquared(Boolean magnitudeSquared)
```
    Parameters:
    
    magnitudeSquared - Whether to return the squared magnitude or just the magnitude. Using squared magnitude can avoid extra calculations.
  - spectrogram
```
public Output<Float> spectrogram()
```
    3D representation of the audio frequencies as an image.
  - asOutput
```
public Output<Float> asOutput()
```
    Description copied from interface: Operand
    
    Returns the symbolic handle of a tensor.
    Inputs to TensorFlow operations are outputs of another TensorFlow operation. This method is used to obtain a symbolic handle that represents the computation of the input.
    
    Specified by:
    
    asOutput in interface Operand<Float>
    
    See Also:
    
    OperationBuilder.addInput(Output)

Class AudioSpectrogram

Nested Class Summary

Field Summary

Fields inherited from class org.tensorflow.op.PrimitiveOp

Method Summary

Methods inherited from class org.tensorflow.op.PrimitiveOp

Methods inherited from class java.lang.Object

Method Detail

create

magnitudeSquared

spectrogram

asOutput