thdekerk/Huihui-gemma-4-31B-it-v2-MLX-8bit

This model was converted to MLX format from huihui-ai/Huihui-gemma-4-31B-it-abliterated-v2 using mlx-vlm version 0.4.4. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm

python -m mlx_vlm.generate --model thdekerk/Huihui-gemma-4-31B-it-v2-MLX-8bit --max-tokens 100 --temperature 0.0 --prompt "Describe this image." --image <path_to_image>

Use with Python/mlx

from mlx_vlm.utils import load_config

model, processor = load("thdekerk/Huihui-gemma-4-31B-it-v2-MLX-8bit")
config = load_config("thdekerk/Huihui-gemma-4-31B-it-v2-MLX-8bit")

image = ["https://huggingface.co/thdekerk/Huihui-gemma-4-31B-it-v2-MLX-8bit/resolve/main/images/fuji.jpg"]
prompt = "Can you describe the image in detail?"

formatted_prompt = apply_chat_template(
    processor, config, prompt, num_images=1
)

output = generate(model, processor, formatted_prompt, image,max_tokens=1024)
print(output)

Convert to mlx model

from mlx_vlm.convert import convert

convert("huihui-ai/Huihui-gemma-4-31B-it-abliterated-v2", 
        mlx_path="/Users/thierrydekerk/MLX/Huihui-gemma-4-31B-it-abliterated-v2-MLX-8bit", 
        q_bits=8, 
        quantize=True, 
        upload_repo="thdekerk/Huihui-gemma-4-31B-it-v2-MLX-8bit"
        )

Downloads last month: 750

Safetensors

Model size

9B params

Tensor type

BF16

U32

MLX

Hardware compatibility

8-bit

Inference Providers NEW

Any-to-Any

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for thdekerk/Huihui-gemma-4-31B-it-v2-MLX-8bit

Base model

google/gemma-4-31B-it

Finetuned

huihui-ai/Huihui-gemma-4-31B-it-abliterated-v2

Quantized

(9)

this model