vocalpy.feature.sat

vocalpy.feature.sat#

vocalpy.feature.sat(sound: Sound, n_fft=400, hop_length=40, freq_range=0.5, min_freq: float = 380.0, amp_baseline: float = 70.0, max_F0: float = 1830.0, fmax_yin: float = 8000.0, trough_threshold: float = 0.1) → Features[source]#

Extract all features used to compute similarity with the Sound Analysis Toolbox for Matlab (SAT).

Parameters:

soundSound: A Sound. Multi-channel sounds are supported.
n_fftint: FFT window size.
hop_lengthint: Number of audio samples between adjacent STFT columns.
freq_rangefloat: Range of frequencies to use, given as a value between zero and one. Default is 0.5, which means “Use the first half of the frequencies, from zero to \(f_s/4\) (half the Nyquist frequency)”.
min_freqfloat: Minimum frequency to consider when extracting features.
amp_baselinefloat: The baseline value added, in decibels, to the amplitude feature. The default is 70.0 dB, the value used by SAT and SAP.
max_F0float: Maximum frequency to consider, that becomes the lowest quefrency used when computing goodness of pitch.
fmax_yinfloat: Maximum frequency in Hertz when computing pitch with YIN algorithm. Default is 8000.
trough_threshold: float: Absolute threshold for peak estimation. A float greater than 0. Used by pitch().

Returns:

featuresvocalpy.Features: vocalpy.Features instance with data attribute that is an xarray.Dataset, where the data variables are the features, and the coordinate is the time for each time bin.

vocalpy.feature.sat

Contents

vocalpy.feature.sat#