Skip to content

Latest commit

 

History

History
128 lines (88 loc) · 12.7 KB

File metadata and controls

128 lines (88 loc) · 12.7 KB
graph LR
    Application_Orchestration["Application Orchestration"]
    Boltz_Core_Models["Boltz Core Models"]
    Neural_Network_Architecture["Neural Network Architecture"]
    Data_Processing_Feature_Engineering["Data Processing & Feature Engineering"]
    Data_Management_Augmentation["Data Management & Augmentation"]
    Training_Evaluation_Framework["Training & Evaluation Framework"]
    General_Utilities["General Utilities"]
    Application_Orchestration -- "orchestrates" --> Boltz_Core_Models
    Application_Orchestration -- "processes inputs using" --> Data_Processing_Feature_Engineering
    Boltz_Core_Models -- "utilizes" --> Neural_Network_Architecture
    Boltz_Core_Models -- "trained and evaluated by" --> Training_Evaluation_Framework
    Boltz_Core_Models -- "consumes data from" --> Data_Management_Augmentation
    Neural_Network_Architecture -- "provides building blocks for" --> Boltz_Core_Models
    Neural_Network_Architecture -- "uses" --> General_Utilities
    Data_Processing_Feature_Engineering -- "prepares data for" --> Data_Management_Augmentation
    Data_Processing_Feature_Engineering -- "provides processed data to" --> Application_Orchestration
    Data_Management_Augmentation -- "supplies data to" --> Boltz_Core_Models
    Data_Management_Augmentation -- "receives data from" --> Data_Processing_Feature_Engineering
    Training_Evaluation_Framework -- "optimizes and evaluates" --> Boltz_Core_Models
    Training_Evaluation_Framework -- "uses data from" --> Data_Management_Augmentation
    General_Utilities -- "supports" --> Neural_Network_Architecture
    General_Utilities -- "supports" --> Data_Processing_Feature_Engineering
    click Application_Orchestration href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/boltz/Application Orchestration.md" "Details"
    click Boltz_Core_Models href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/boltz/Boltz Core Models.md" "Details"
    click Neural_Network_Architecture href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/boltz/Neural Network Architecture.md" "Details"
    click Data_Processing_Feature_Engineering href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/boltz/Data Processing & Feature Engineering.md" "Details"
    click Data_Management_Augmentation href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/boltz/Data Management & Augmentation.md" "Details"
    click Training_Evaluation_Framework href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/boltz/Training & Evaluation Framework.md" "Details"
    click General_Utilities href "https://github.com/CodeBoarding/GeneratedOnBoardings/blob/main/boltz/General Utilities.md" "Details"
Loading

CodeBoardingDemoContact

Component Details

The Boltz project is a molecular modeling framework primarily focused on predicting and generating molecular structures and their properties, such as binding affinity and confidence. The core functionality revolves around neural network models (Boltz1 and Boltz2) that leverage diffusion processes for structure generation. The system handles comprehensive data processing, from parsing raw molecular data and performing feature engineering to managing data loading and augmentation for efficient training. It includes a robust training and evaluation framework with various loss functions, physical potentials, and optimization utilities. The overall application workflow is orchestrated to manage the flow of data and execution across these specialized modules, with general utilities supporting various operations throughout the pipeline.

Application Orchestration

Serves as the central control unit for the Boltz application, managing the overall workflow from input processing to model prediction. It coordinates data flow and execution across different modules.

Related Classes/Methods:

Boltz Core Models

Encapsulates the primary Boltz neural network architectures (Boltz1, Boltz2), including their forward passes, training, validation, and prediction steps. It integrates diffusion, confidence, and affinity prediction capabilities.

Related Classes/Methods:

Neural Network Architecture

Provides the fundamental building blocks for the Boltz models, including input embedding, MSA and pairformer modules, various encoders (single, pairwise, atom attention), transformer blocks, and core neural network layers like attention mechanisms and triangular multiplications.

Related Classes/Methods:

Data Processing & Feature Engineering

Manages the parsing of raw input data (MMCIF, FASTA, YAML, CSV, A3M) into internal data structures, followed by comprehensive featurization and tokenization processes. This includes generating token, atom, MSA, template, and symmetry features, and handling molecular geometry and constraints.

Related Classes/Methods:

Data Management & Augmentation

Handles the loading, batching, sampling, and cropping of prepared data for training, validation, and inference. It includes various sampling strategies (cluster, random, distillation) and mechanisms to filter and augment data subsets for efficient model processing.

Related Classes/Methods:

Training & Evaluation Framework

Provides the infrastructure for training Boltz models, including defining various loss functions (confidence, diffusion, validation metrics), physical potential functions to guide molecular generation, and optimization utilities like learning rate schedulers and Exponential Moving Average (EMA). It also handles outputting predicted structures and evaluating model performance.

Related Classes/Methods:

General Utilities

A collection of miscellaneous utility functions supporting various operations across the project, such as default value handling, random augmentations (rotations, quaternions), and centering operations.

Related Classes/Methods: