qwen3-1.7_expert_tools_v2_gguf : GGUF

This model was finetuned and converted to GGUF format using Unsloth.

Example usage:

For text only LLMs: llama-cli -hf MadhuryaPasan/qwen3-1.7_expert_tools_v2_gguf --jinja
For multimodal models: llama-mtmd-cli -hf MadhuryaPasan/qwen3-1.7_expert_tools_v2_gguf --jinja

Available Model files:

This model was evaluated using the BFCL (Berkeley Function Calling Leaderboard) evaluation framework to benchmark its tool-calling accuracy.

(Note: Evaluated on the 4-bit quantized (q4_K_M) versions. Full f16 precision may yield slightly higher results).

An Ollama Modelfile is included for easy deployment. This was trained 2x faster with Unsloth

GGUF

Model size

2B params

Architecture

qwen3

Hardware compatibility

4-bit

8-bit

16-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support