Quantizations
Collection
All GGUF quants that I have made so far, and demos too. • 6 items • Updated
This repository contains GGUF quantized versions of Fu01978/OLMo-2-1B-openai-gsm8k, intended for efficient inference with llama.cpp-compatible runtimes.
These quantizations are derived from:
Fu01978/OLMo-2-1B-openai-gsm8k
👉 https://huggingface.co/Fu01978/OLMo-2-1B-openai-gsm8k
Please refer to the base model card for:
Example with llama.cpp:
./main \
-m OLMo-2-1B-openai-gsm8k*.gguf \
-p "Solve: 23 + 19 ="
4-bit
5-bit
6-bit
8-bit
Base model
allenai/OLMo-2-0425-1B