Gemopus-4-E4B-it-MLX-4bit

This is a 4-bit quantization of Jackrong/Gemopus-4-E4B-it converted to MLX format.

Optimization Details

pip install mlx-lm
python -m mlx_lm.generate --model Nicoesp/Gemopus-4-E4B-it-MLX-4bit --prompt "Ciao!"

Safetensors

Model size

1B params

Tensor type

BF16

U32

MLX

Hardware compatibility

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support