DeepFoldProtein/openfold3-cuequiv-trt

TensorRT BF16 engines for OpenFold3 (pairformer + token transformer), built with Bio-TRT.

Triangle attention backend: CUEQUIV

Note: This repo contains CUEQUIV (cuEquivariance) triangle attention engines. Requires Bio-TRT commit with the FP32 triangle_bias cast fix in thirdparty/tensorrt_bionemo/_trt/layers/attention.py β€” earlier Bio-TRT versions produce numerically incorrect results due to a cuEquivariance SM100f fallback bug.

TRT engines are GPU-architecture specific. Download the directory matching your GPU's compute capability.

Available Engines

Directory GPU SM TRT Version CUDA Built
B200_SM10.0_TRT10.15/ unknown SM unknown unknown unknown unknown

Contents (per GPU directory)

<gpu_tag>/
β”œβ”€β”€ pairformer_engine/trt/
β”‚   β”œβ”€β”€ rank0.engine
β”‚   └── config.json
└── token_transformer_engine/trt/
    β”œβ”€β”€ rank0.engine
    └── config.json

Usage

# Download engines for your GPU
pip install huggingface-hub
huggingface-cli download DeepFoldProtein/openfold3-cuequiv-trt --local-dir engines \
    --include "B200_SM10.0_TRT10.15/**"

# Run inference
bash openfold3/run_inference.sh \
    --query examples/openfold3/ubiquitin.json \
    --checkpoint /path/to/of3-p2-155k.pt \
    --engines-dir engines/<gpu_tag> \
    --output-dir results/ubiquitin
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support