vadnetPreprocess
Description
returns a mel spectrogram from the audio input that you can feed to the pretrained VAD
network returned by features
= vadnetPreprocess(audioIn
,fs
)audioPretrainedNetwork
.
Examples
Input Arguments
Output Arguments
Algorithms
The vadnetPreprocess
function preprocesses the audio data using the following
steps.
Resample the audio to 16kHz.
Compute a centered short-time Fourier transform (STFT) using a 25 ms periodic Hamming window and 10 ms hop length. Pad the signal so that the first window is centered at 0 s.
Convert the STFT to a power spectrogram.
Apply a mel filter bank with 40 bands to obtain a mel spectrogram.
Convert the mel spectrogram to a log scale.
Standardize each of the mel bands to have zero mean and standard deviation of 1.
Extended Capabilities
Version History
Introduced in R2023a