Original: Wohlwend et al., bioRxiv 2024
DOI: 10.1101/2024.11.19.624167
Abstract
Boltz-1 is an open-source biomolecular structure prediction model developed by MIT, Genesis Research, and CHARM Therapeutics, achieving multiple innovations on the AlphaFold 3 architecture with comparable prediction accuracy. As the first fully commercially accessible open-source model, Boltz-1 releases training code, model weights, and datasets under the MIT license.
1. Research Background
1.1 Open-Source Needs in Biomolecular Structure Prediction
The releases of AlphaFold 2 and AlphaFold 3 marked deep learning's achievement of experimental-level accuracy in protein structure prediction. However, the training code and model weights of these models are not fully open-source, limiting further innovation and application by the research community. The community still lacks a fully open-source, commercially accessible model with performance comparable to AlphaFold 3.
1.2 Limitations of Existing Open-Source Models
Existing open-source models have limitations in: training efficiency, simple confidence prediction architecture, lack of effective physical constraint mechanisms, and insufficient optimization of data pipelines. Additionally, model hallucinations (such as chain overlaps) and non-physical predictions (such as chirality errors, steric clashes) are widespread problems.
2. Data Pipeline Innovations
2.1 Data Sources and Processing
Training data uses PDB structures released before September 30, 2021, with resolution of at least 9Å. Unlike AlphaFold 3, Boltz-1 does not include input templates. MSA is built using ColabFold search tools, and molecular conformations are pre-calculated using RDKit's ETKDGv3.
2.2 Dense MSA Pairing Algorithm
For multimeric protein complexes, the authors developed a novel MSA pairing algorithm based on taxonomic information. This algorithm leverages UniProt's taxonomic annotations to more effectively pair homologous sequences from different species, improving modeling accuracy for multi-chain complexes.
2.3 Unified Cropping Algorithm
Combining the advantages of spatial cropping and contiguous cropping strategies, the unified cropping algorithm more effectively selects structural fragments during training. This hybrid strategy balances learning of local structural details and global context information.
3. Model Architecture Improvements
3.1 Training Efficiency Improvement
Boltz-1's training efficiency is significantly improved:
- Boltz-1: 68k training steps, batch size 128
- AlphaFold 3: ~150k steps, batch size 256
- Computational time reduced by approximately 4×
3.2 Confidence Model Innovation
The confidence model is a key innovation of Boltz-1. Unlike AlphaFold 3's use of 4 PairFormer layers, Boltz-1's confidence model includes complete trunk components (AtomAttentionEncoder, MSAModule, 48-layer PairFormerModule), initialized with trained trunk weights.
Additionally, the model aggregates token representations from each timestep of the diffusion model, processes them through time-conditioned recurrent blocks, and concatenates them with trunk features before input to the confidence model. This architecture enables confidence prediction to leverage complete information from the diffusion process.
3.3 Computational Optimizations
Boltz-1 implements multiple computational optimizations:
- Sequence-local atom representation (32-atom blocks only attend to nearest 128 atoms in sequence space)
- Attention bias sharing and caching
- Greedy symmetry correction
- Tiled processing of MSA module and triangular attention
- trifast kernel (Triton-based triangular self-attention implementation)
4. Boltz-steering Technology
4.1 Problem Background
Visual inspection shows that Boltz-1 predictions exhibit hallucination phenomena, mainly manifested as entire chains placed directly overlapping. Additionally, the model occasionally generates non-physical structures, including:
- Steric clashes between atoms
- Slightly incorrect bond lengths and angles
- Stereochemistry errors at chiral centers
- Non-planar predictions of aromatic rings
4.2 Feynman-Kac Guidance Framework
Boltz-steering is based on the Feynman-Kac (FK) guidance framework. This method tilts the transition kernel of the diffusion process at each intermediate timestep by defining a potential energy function, biasing trajectories toward paths with low energy in the final state. Sampling uses Sequential Monte Carlo (SMC) methods.
4.3 Constraint Potentials
The total potential energy is a weighted sum of multiple constraint potentials, each targeting specific physical problems:
- Tetrahedral atomic chirality: Based on improper torsion angles, distinguishing R/S configurations
- Bond stereochemistry: Based on torsion angles, distinguishing E/Z configurations
- Planar double bonds: Flat-bottom potential based on improper torsion angles
- Internal geometry: Based on distance bounds matrices generated by RDKit
- Clashes: Applied at high noise levels, penalizing atom overlaps
- Overlaps: Penalizing entire chain overlaps
- Covalent bonds: Ensuring reasonable distances for bonded atoms
5. Performance Evaluation and Limitations
5.1 Performance Comparison with AlphaFold 3
The paper reports that Boltz-1 achieves comparable performance to AlphaFold 3 on diverse benchmarks and metrics. The authors evaluate using a self-constructed test set (593 structures) with clear temporal separation and similarity filtering from the training set.
5.2 Value and Challenges of Open-Source Ecosystem
As the first fully commercially accessible open-source model (MIT license), Boltz-1 lowers the barrier to entry for biomolecular structure prediction, promising to facilitate global collaboration and accelerate discovery. However, the open-source model also brings challenges: model maintenance and updates require sustained community contributions, and the sustainability of community support.
5.3 Undisclosed Technical Details
The paper does not fully disclose the following technical details: specific numbers for model parameter scale, training computational resources, inference speed benchmarks, and the impact of additional computational overhead introduced by Boltz-steering on inference time.
6. Conclusion
Boltz-1 represents important progress in the open-source evolution of biomolecular structure prediction. The model achieves significant training efficiency improvements while maintaining accuracy comparable to AlphaFold 3, and addresses some key issues through confidence model architecture innovation and Boltz-steering technology.
Future Development Directions
- Continuous community-driven improvements
- Integration with other open-source tools (such as molecular dynamics, docking programs)
- Specialized optimization for specific application areas (such as antibody design, enzyme engineering)
- Expansion of training data coverage
References:
[1] Wohlwend J, Corso G, Passaro S, et al. Boltz-1: Democratizing Biomolecular Interaction Modeling. bioRxiv 2024. https://doi.org/10.1101/2024.11.19.624167