Original: Wohlwend et al., bioRxiv 2024
DOI: 10.1101/2024.11.19.624167

Abstract

Boltz-1 is an open-source biomolecular structure prediction model developed by MIT, Genesis Research, and CHARM Therapeutics, achieving multiple innovations on the AlphaFold 3 architecture with comparable prediction accuracy. As the first fully commercially accessible open-source model, Boltz-1 releases training code, model weights, and datasets under the MIT license.

1. Research Background

1.1 Open-Source Needs in Biomolecular Structure Prediction

The releases of AlphaFold 2 and AlphaFold 3 marked deep learning's achievement of experimental-level accuracy in protein structure prediction. However, the training code and model weights of these models are not fully open-source, limiting further innovation and application by the research community. The community still lacks a fully open-source, commercially accessible model with performance comparable to AlphaFold 3.

1.2 Limitations of Existing Open-Source Models

Existing open-source models have limitations in: training efficiency, simple confidence prediction architecture, lack of effective physical constraint mechanisms, and insufficient optimization of data pipelines. Additionally, model hallucinations (such as chain overlaps) and non-physical predictions (such as chirality errors, steric clashes) are widespread problems.

2. Data Pipeline Innovations

2.1 Data Sources and Processing

Training data uses PDB structures released before September 30, 2021, with resolution of at least 9Å. Unlike AlphaFold 3, Boltz-1 does not include input templates. MSA is built using ColabFold search tools, and molecular conformations are pre-calculated using RDKit's ETKDGv3.

2.2 Dense MSA Pairing Algorithm

For multimeric protein complexes, the authors developed a novel MSA pairing algorithm based on taxonomic information. This algorithm leverages UniProt's taxonomic annotations to more effectively pair homologous sequences from different species, improving modeling accuracy for multi-chain complexes.

2.3 Unified Cropping Algorithm

Combining the advantages of spatial cropping and contiguous cropping strategies, the unified cropping algorithm more effectively selects structural fragments during training. This hybrid strategy balances learning of local structural details and global context information.

3. Model Architecture Improvements

3.1 Training Efficiency Improvement

Boltz-1's training efficiency is significantly improved:

3.2 Confidence Model Innovation

The confidence model is a key innovation of Boltz-1. Unlike AlphaFold 3's use of 4 PairFormer layers, Boltz-1's confidence model includes complete trunk components (AtomAttentionEncoder, MSAModule, 48-layer PairFormerModule), initialized with trained trunk weights.

Additionally, the model aggregates token representations from each timestep of the diffusion model, processes them through time-conditioned recurrent blocks, and concatenates them with trunk features before input to the confidence model. This architecture enables confidence prediction to leverage complete information from the diffusion process.

3.3 Computational Optimizations

Boltz-1 implements multiple computational optimizations:

4. Boltz-steering Technology

4.1 Problem Background

Visual inspection shows that Boltz-1 predictions exhibit hallucination phenomena, mainly manifested as entire chains placed directly overlapping. Additionally, the model occasionally generates non-physical structures, including:

4.2 Feynman-Kac Guidance Framework

Boltz-steering is based on the Feynman-Kac (FK) guidance framework. This method tilts the transition kernel of the diffusion process at each intermediate timestep by defining a potential energy function, biasing trajectories toward paths with low energy in the final state. Sampling uses Sequential Monte Carlo (SMC) methods.

4.3 Constraint Potentials

The total potential energy is a weighted sum of multiple constraint potentials, each targeting specific physical problems:

5. Performance Evaluation and Limitations

5.1 Performance Comparison with AlphaFold 3

The paper reports that Boltz-1 achieves comparable performance to AlphaFold 3 on diverse benchmarks and metrics. The authors evaluate using a self-constructed test set (593 structures) with clear temporal separation and similarity filtering from the training set.

5.2 Value and Challenges of Open-Source Ecosystem

As the first fully commercially accessible open-source model (MIT license), Boltz-1 lowers the barrier to entry for biomolecular structure prediction, promising to facilitate global collaboration and accelerate discovery. However, the open-source model also brings challenges: model maintenance and updates require sustained community contributions, and the sustainability of community support.

5.3 Undisclosed Technical Details

The paper does not fully disclose the following technical details: specific numbers for model parameter scale, training computational resources, inference speed benchmarks, and the impact of additional computational overhead introduced by Boltz-steering on inference time.

6. Conclusion

Boltz-1 represents important progress in the open-source evolution of biomolecular structure prediction. The model achieves significant training efficiency improvements while maintaining accuracy comparable to AlphaFold 3, and addresses some key issues through confidence model architecture innovation and Boltz-steering technology.

Future Development Directions

  • Continuous community-driven improvements
  • Integration with other open-source tools (such as molecular dynamics, docking programs)
  • Specialized optimization for specific application areas (such as antibody design, enzyme engineering)
  • Expansion of training data coverage

References:
[1] Wohlwend J, Corso G, Passaro S, et al. Boltz-1: Democratizing Biomolecular Interaction Modeling. bioRxiv 2024. https://doi.org/10.1101/2024.11.19.624167

← Back to Blog