awesome-architecture-mds/ai-ml/omnizart/Data_Preparation_Feature_Extraction.md at main · CodeBoarding/awesome-architecture-mds

graph LR
    Dataset_Manager["Dataset Manager"]
    Audio_I_O_Handler["Audio I/O Handler"]
    Core_Feature_Extraction_Layer["Core Feature Extraction Layer"]
    General_Label_Processor["General Label Processor"]
    Domain_Specific_Feature_Label_Processor_Beat["Domain-Specific Feature & Label Processor - Beat"]
    Domain_Specific_Feature_Label_Processor_Chord["Domain-Specific Feature & Label Processor - Chord"]
    Feature_Wrapper_Functions["Feature Wrapper Functions"]
    Dataset_Manager -- "provides audio file paths to" --> Audio_I_O_Handler
    Dataset_Manager -- "provides label file paths to" --> General_Label_Processor
    Audio_I_O_Handler -- "receives audio file paths from" --> Dataset_Manager
    Audio_I_O_Handler -- "feeds preprocessed audio to" --> Core_Feature_Extraction_Layer
    Core_Feature_Extraction_Layer -- "receives preprocessed audio from" --> Audio_I_O_Handler
    Core_Feature_Extraction_Layer -- "provides features to" --> Domain_Specific_Feature_Label_Processor_Beat
    Core_Feature_Extraction_Layer -- "provides features to" --> Domain_Specific_Feature_Label_Processor_Chord
    General_Label_Processor -- "receives label file paths from" --> Dataset_Manager
    General_Label_Processor -- "provides processed labels to" --> Domain_Specific_Feature_Label_Processor_Beat
    Domain_Specific_Feature_Label_Processor_Beat -- "integrates features from" --> Core_Feature_Extraction_Layer
    Domain_Specific_Feature_Label_Processor_Beat -- "receives processed labels from" --> General_Label_Processor
    Domain_Specific_Feature_Label_Processor_Chord -- "integrates features from" --> Core_Feature_Extraction_Layer
    Domain_Specific_Feature_Label_Processor_Chord -- "receives processed labels from" --> General_Label_Processor
    Feature_Wrapper_Functions -- "provides utilities to" --> Core_Feature_Extraction_Layer
    Feature_Wrapper_Functions -- "provides utilities to" --> Domain_Specific_Feature_Label_Processor_Beat

Details

The Data Preparation & Feature Extraction subsystem is responsible for transforming raw audio and ground truth labels into a structured format suitable for machine learning models. It encompasses the entire workflow from initial data loading and preprocessing to the extraction of various domain-specific musical features and the preparation of corresponding labels.

Dataset Manager

Manages dataset paths, provides utilities for accessing audio-label pairs, and includes functionality for dataset download. It serves as the primary source for data paths.

Related Classes/Methods:

omnizart.constants.datasets

Audio I/O Handler

Handles the loading of raw audio files from disk and performs initial signal preprocessing (e.g., resampling, normalization).

Related Classes/Methods:

omnizart.io

Core Feature Extraction Layer

Encompasses the extraction of various fundamental musical features such as Constant-Q Cepstral Coefficients (CFP), Constant Q Transform (CQT), Harmonic-Percussive Features (HCFP), and beat/downbeat information. These are generic feature representations derived directly from preprocessed audio.

Related Classes/Methods:

General Label Processor

Loads and transforms general music ground truth labels into a format consumable by machine learning models.

Related Classes/Methods:

omnizart.music.labels

Domain-Specific Feature & Label Processor - Beat

Extracts and processes features and ground truth labels specifically for beat tracking tasks, often combining outputs from core feature extractors with label data.

Related Classes/Methods:

omnizart.beat.features

Domain-Specific Feature & Label Processor - Chord

Manages feature and label extraction for chord recognition, including data augmentation and segmentation, integrating core features with chord-specific label processing.

Related Classes/Methods:

omnizart.chord.features

Feature Wrapper Functions

Provides utility functions that support various feature extraction processes, such as patching and converting between frame-based and time-based representations.

Related Classes/Methods:

omnizart.feature.wrapper_func

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Details

Dataset Manager

Audio I/O Handler

Core Feature Extraction Layer

General Label Processor

Domain-Specific Feature & Label Processor - Beat

Domain-Specific Feature & Label Processor - Chord

Feature Wrapper Functions

FAQ

FilesExpand file tree

Data_Preparation_Feature_Extraction.md

Latest commit

History

Data_Preparation_Feature_Extraction.md

File metadata and controls

Details

Dataset Manager

Audio I/O Handler

Core Feature Extraction Layer

General Label Processor

Domain-Specific Feature & Label Processor - Beat

Domain-Specific Feature & Label Processor - Chord

Feature Wrapper Functions

FAQ