Qwen3.5-35B-A3B-Opus-Reasoning-Distilled-v2
Model Description
This model is fine-tuned from unsloth/Qwen3.5-35B-A3B with enhanced reasoning capabilities.
Training Details
- Base Model: unsloth/Qwen3.5-35B-A3B
- Method: bf16 LoRA + response-only (train_on_responses_only)
- LoRA Rank: 16
- Epochs: 2
- Max Sequence Length: 4096
- Learning Rate: 2e-5
- Framework: Unsloth + TRL
Datasets
nohurry/Opus-4.6-Reasoning-3000x-filteredJackrong/Qwen3.5-reasoning-700xRoman1111111/claude-opus-4.6-10000x
Format
The model uses <think>...</think> tags for chain-of-thought reasoning.
- Downloads last month
- 31
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for ponytang3/Qwen3.5-35B-A3B-Opus-Reasoning-Distilled-v2
Base model
Qwen/Qwen3.5-35B-A3B-Base Finetuned
Qwen/Qwen3.5-35B-A3B Finetuned
unsloth/Qwen3.5-35B-A3B