2-bit Qwen3.6 Tool-calling is amazing!!

pinned

by danielhanchen - opened 6 days ago

Unsloth AI org 6 days ago

•

Hey guys, feel free to run and train Qwen3.6 in Unsloth Studio. The tool-calling is quite amazing, even for 2-bit.
GitHub: https://github.com/unslothai/unsloth
Guide: https://unsloth.ai/docs/models/qwen3.6#unsloth-studio-guide

Below, the 2-bit Qwen3.6 GGUF made 30+ tool calls, searched 20 sites and executed Python code.

4-bit example:
qwen3.6 in unsloth studio

danielhanchen pinned discussion 6 days ago

danielhanchen changed discussion title from Qwen3.6 Tool-calling is amazing!! to 2-bit Qwen3.6 Tool-calling is amazing!! 6 days ago

curtis1969

6 days ago

•

edited 5 days ago

thanks i was on the fence about trying the 2 bit i wish unsloth studio was easy to set up like lm studio and had a app or something maybe that would hurt performance too much idk looks great tho

shimmyshimmer

Unsloth AI org 6 days ago

thanks i was on the fence about trying the 2 bit i wish unsloth studio was as easy to set up like lm studio and had a app or something maybe that would hurt performance too much idk looks great tho

We're gonna release the desktop app this month! Needs more testing 🙏

shimmyshimmer unpinned discussion 4 days ago

shimmyshimmer pinned discussion 4 days ago

holooo

4 days ago

•

edited 4 days ago

thanks i was on the fence about trying the 2 bit i wish unsloth studio was as easy to set up like lm studio and had a app or something maybe that would hurt performance too much idk looks great tho

We're gonna release the desktop app this month! Needs more testing 🙏

Could you please add inference support for Intel Arc GPUs to Unsloth Studio? 🙏

islameissa

4 days ago

without any doubt, you make the best gguf quants ( I did not try other forms). I always go to the UD_Q4_K_XL as it is fast with near lossless experience.
Re Qwen3.6-35B: to be honest, I did not see any meaningful difference compared to the 3.5 , not sure what qwen is talking about with all the way higher benchmarking compared to older version.

je0923

4 days ago

•

edited 4 days ago

Re Qwen3.6-35B: to be honest, I did not see any meaningful difference compared to the 3.5 , not sure what qwen is talking about with all the way higher benchmarking compared to older version.

It seems to perform even better with tool calling for me!

je0923

4 days ago

thanks i was on the fence about trying the 2 bit i wish unsloth studio was easy to set up like lm studio and had a app or something maybe that would hurt performance too much idk looks great tho

If you use llama.cpp, there's also a webui there with llama-server :)

curtis1969

3 days ago

•

edited 2 days ago

This model is sort of good even at Q2_K_XL i had it make a website with this prompt (make a mk ultra website in html make it amazing ) and i got 16k tokens 1.7k lines of code I've tested online it seems produces way better websites than gemma 4 and in LM studio at least with the DuckDuckGo and visit website tool it's far less lazy with tool calls it could just be the way they're configured but even when I tested gemma4 in AI studio they hardly ever use the Google grounding and the higher benchmark scores are more than likely because of training for tool use 🙃sorry for rant [{DATE: 4/19/26 }] i recommend to put this in the System Prompt
and one shot space invader game at Q2_K_XL

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment