MiMo-VL-7B-SFT — 4-bit BitsAndBytes Quantized

This is a 4-bit quantized version of XiaomiMiMo/MiMo-VL-7B-SFT,
using the BitsAndBytes library.

Quantization reduces memory usage and makes it possible to run this model on consumer GPUs
(≤ 12 GB VRAM), at the cost of a small reduction in generation quality.

Quantization Details

Method: BitsAndBytes (bnb)
Precision: 4-bit (nf4)
Compute dtype: bfloat16
Double quantization: enabled
Format: safetensors

Downloads last month: 3

Safetensors

Model size

8B params

Tensor type

BF16

F32

Model tree for NangWeiLun/MiMo-VL-7B-SFT-bnb-4bit-nf4-dq

Unable to build the model tree, the base model loops to the model itself. Learn more.

Collection including NangWeiLun/MiMo-VL-7B-SFT-bnb-4bit-nf4-dq

MiMo-VL-7B Quantize

Collection

XiaomiMiMo/MiMo-VL-7B Quantized Model • 5 items • Updated Sep 9, 2025