new version

by yqchen-sci - opened Aug 7, 2025

Aug 7, 2025

I believe the 4B small model is currently the best local assistant for running on a laptop, suitable for tasks such as text paragraph polishing, meeting transcription processing, and translation. The latest version of the model has just been released. Are there any plans to perform fine-tuning on the next version?

huihui-ai

Owner Aug 7, 2025

https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507 and https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507 ?

yqchen-sci

Aug 7, 2025

https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507 and https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507 ?

exactly

yushiosky

Dec 29, 2025

Can you add a merged version? I'd like to load this as a clip for z-image to experiment and I can't reliably use torch on my amd rig to merge them. Apologies for my noob status here.

huihui-ai

Owner Dec 29, 2025

•

edited Dec 29, 2025


from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

OLD_MODEL_ID = "huihui-ai/Huihui-Qwen3-4B-abliterated-v2"
NEW_MODEL_ID = "huihui-ai/Huihui-Qwen3-4B-abliterated-New"
model = AutoModelForCausalLM.from_pretrained(
    OLD_MODEL_ID,
    device_map="auto",
    trust_remote_code=True,
    torch_dtype=torch.bfloat16
)
tokenizer = AutoTokenizer.from_pretrained(OLD_MODEL_ID, trust_remote_code=True)

model.save_pretrained(NEW_MODEL_ID, max_shard_size="10GB")
tokenizer.save_pretrained(NEW_MODEL_ID)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment