GGUF models
Collection
17 items • Updated
The GGUF models in this repo are quantized and converted from ibm-granite/granite-3.1-1b-a400m-base using llama.cpp service and llama.cpp
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit
Base model
ibm-granite/granite-3.1-1b-a400m-base