Bonsai 1 bit
Collection
PrismML's 1 bit Bonsai models in Trillim's TRNQ format • 3 items • Updated
Bonsai-4B-TRNQ is a Trillim-packaged, requantized build of prism-ml/Bonsai-4B-unpacked, intended for efficient text generation with the Trillim inference engine.
| Item | Value |
|---|---|
| Architecture | Qwen3-4B dense |
| Parameters | 4B class |
| Source model | prism-ml/Bonsai-4B-unpacked |
| Packaging | Trillim requantized bundle |
| License | Apache 2.0 |
pip install trillim
trillim pull Trillim/Bonsai-4B-TRNQ
trillim chat Trillim/Bonsai-4B-TRNQ
This launches an interactive CLI chat session.
| File | Description |
|---|---|
qmodel.tensors |
Quantized weights in Trillim format |
rope.cache |
Precomputed RoPE cache for runtime |
config.json |
Model configuration |
generation_config.json |
Generation defaults |
trillim_config.json |
Trillim runtime metadata |
tokenizer.json |
Tokenizer data |
tokenizer_config.json |
Tokenizer configuration |
special_tokens_map.json |
Special token mapping |
chat_template.jinja |
Chat template |
vocab.json, merges.txt, added_tokens.json |
Tokenizer assets |
This repository is derived from:
prism-ml/Bonsai-4B-unpackedChanges made by Trillim:
This repository is not affiliated with or endorsed by Prism ML or Alibaba Cloud.
Released under the Apache 2.0 License, consistent with the upstream model license.
See:
LICENSENOTICE.txtBase model
prism-ml/Bonsai-4B-unpacked