Audio Feature Extraction ======================== The following classes can be used to extract relevant features from audio. Audio feature extraction helps to transform raw audio data into a representation that is more representative of human perception. This representation is often used to provide a more meaningful comparison of audio files and can be used as input to machine learning approaches and to evaluate results. All audio features in `SpiegeLib` currently use `librosa `_. The :ref:`FeaturesBase ` class is an abstract base class that defines an interface for performing feature extraction within `SpiegeLib`. All feature extraction classes inherit from :ref:`FeaturesBase ` and provide a wrapper to functions within `librosa `_. .. toctree:: :maxdepth: 2 FeaturesBase Fast Fourier Transform (FFT) Short Time Fourier Transform (STFT) Mel-Frequency Cepstral Coefficients (MFCC) Time Summarized Spectral Features (SpectralSummarized) Data Scaling """""""""""" Data scaling is an important step in pre-processing data prior to machine learning algorithms. All classes that inherit from :ref:`FeaturesBase ` have a ``scaler`` attribute which holds a data scaler object and can be used to normalize or standardize feature extraction results. These scalers are inspired by the scalers implemented in `sklearn `_, but can handle datasets with more dimensions. They are also simplified and designed to fit into feature extraction pipelines. Some examples demonstrating how scalers are integrated with feature extraction are provided in the :ref:`StandardScaler ` .. toctree:: :maxdepth: 2 DataScalerBase StandardScaler