These are simple quantizations of qikp/hummingbird-2.5-110m using llama.cpp.
Chat template
2-bit
4-bit
6-bit
8-bit
Base model