DeepFoldProtein/openfold3-cuequiv-trt
TensorRT BF16 engines for OpenFold3 (pairformer + token transformer), built with Bio-TRT.
Triangle attention backend: CUEQUIV
Note: This repo contains CUEQUIV (cuEquivariance) triangle attention engines. Requires Bio-TRT commit with the FP32 triangle_bias cast fix in
thirdparty/tensorrt_bionemo/_trt/layers/attention.pyβ earlier Bio-TRT versions produce numerically incorrect results due to a cuEquivariance SM100f fallback bug.
TRT engines are GPU-architecture specific. Download the directory matching your GPU's compute capability.
Available Engines
| Directory | GPU | SM | TRT Version | CUDA | Built |
|---|---|---|---|---|---|
B200_SM10.0_TRT10.15/ |
unknown | SM unknown | unknown | unknown | unknown |
Contents (per GPU directory)
<gpu_tag>/
βββ pairformer_engine/trt/
β βββ rank0.engine
β βββ config.json
βββ token_transformer_engine/trt/
βββ rank0.engine
βββ config.json
Usage
# Download engines for your GPU
pip install huggingface-hub
huggingface-cli download DeepFoldProtein/openfold3-cuequiv-trt --local-dir engines \
--include "B200_SM10.0_TRT10.15/**"
# Run inference
bash openfold3/run_inference.sh \
--query examples/openfold3/ubiquitin.json \
--checkpoint /path/to/of3-p2-155k.pt \
--engines-dir engines/<gpu_tag> \
--output-dir results/ubiquitin
- Downloads last month
- -
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support