@Operator(group="audio") public final class Mfcc extends PrimitiveOp implements Operand<Float>
Mel Frequency Cepstral Coefficients are a way of representing audio data that's been effective as an input feature for machine learning. They are created by taking the spectrum of a spectrogram (a 'cepstrum'), and discarding some of the higher frequencies that are less significant to the human ear. They have a long history in the speech recognition world, and https://en.wikipedia.org/wiki/Mel-frequency_cepstrum is a good resource to learn more.
Modifier and Type | Class and Description |
---|---|
static class |
Mfcc.Options
Optional attributes for
Mfcc |
operation
Modifier and Type | Method and Description |
---|---|
Output<Float> |
asOutput()
Returns the symbolic handle of a tensor.
|
static Mfcc |
create(Scope scope,
Operand<Float> spectrogram,
Operand<Integer> sampleRate,
Mfcc.Options... options)
Factory method to create a class wrapping a new Mfcc operation.
|
static Mfcc.Options |
dctCoefficientCount(Long dctCoefficientCount) |
static Mfcc.Options |
filterbankChannelCount(Long filterbankChannelCount) |
static Mfcc.Options |
lowerFrequencyLimit(Float lowerFrequencyLimit) |
Output<Float> |
output() |
static Mfcc.Options |
upperFrequencyLimit(Float upperFrequencyLimit) |
equals, hashCode, op, toString
public static Mfcc create(Scope scope, Operand<Float> spectrogram, Operand<Integer> sampleRate, Mfcc.Options... options)
scope
- current scopespectrogram
- Typically produced by the Spectrogram op, with magnitude_squared
set to true.sampleRate
- How many samples per second the source audio used.options
- carries optional attributes valuespublic static Mfcc.Options upperFrequencyLimit(Float upperFrequencyLimit)
upperFrequencyLimit
- The highest frequency to use when calculating the
ceptstrum.public static Mfcc.Options lowerFrequencyLimit(Float lowerFrequencyLimit)
lowerFrequencyLimit
- The lowest frequency to use when calculating the
ceptstrum.public static Mfcc.Options filterbankChannelCount(Long filterbankChannelCount)
filterbankChannelCount
- Resolution of the Mel bank used internally.public static Mfcc.Options dctCoefficientCount(Long dctCoefficientCount)
dctCoefficientCount
- How many output channels to produce per time slice.public Output<Float> asOutput()
Operand
Inputs to TensorFlow operations are outputs of another TensorFlow operation. This method is used to obtain a symbolic handle that represents the computation of the input.
asOutput
in interface Operand<Float>
OperationBuilder.addInput(Output)
Copyright © 2022. All rights reserved.