HuggingFaceBio
/

Carbon-3B

Text Generation

Upper Grand Valley Dani

text-generation-inference

Model card Files Files and versions

lewtun HF Staff commited on 3 days ago

Commit

2966545

·

verified ·

1 Parent(s): 06e0869

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -312,7 +312,9 @@ Sample sizes: Carbon & GENERator n=500. Evo2-7B n=150 at 16k, n=100 at 32k, n=10
 ### Inference efficiency
-> TODO: add Ed's benchmarks
 ## Training

 ### Inference efficiency
+Carbon models run natively in vLLM and thus generate DNA sequences over 150 times faster than the Evo2 family of models. Below we show the results of a throughput benchmark, where 1080 base-pairs are used for prefill and decode with increasing number of input sequences. All models except Evo2 40B were run on a H100 GPU, with the batch size of the Evo2 models tuned to the largest possible size that fits in VRAM.
+![](figures/inference.png)
 ## Training