ponytang3
/

Qwen3.5-35B-A3B-Opus-Reasoning-Distilled-v2

chain-of-thought

Model card Files Files and versions

Qwen3.5-35B-A3B-Opus-Reasoning-Distilled-v2

Model Description

This model is fine-tuned from unsloth/Qwen3.5-35B-A3B with enhanced reasoning capabilities.

Training Details

Base Model: unsloth/Qwen3.5-35B-A3B
Method: bf16 LoRA + response-only (train_on_responses_only)
LoRA Rank: 16
Epochs: 2
Max Sequence Length: 4096
Learning Rate: 2e-5
Framework: Unsloth + TRL

Datasets

nohurry/Opus-4.6-Reasoning-3000x-filtered
Jackrong/Qwen3.5-reasoning-700x
Roman1111111/claude-opus-4.6-10000x

Format

The model uses <think>...</think> tags for chain-of-thought reasoning.

Downloads last month: 31

Safetensors

Model size

36B params

Tensor type

BF16

·

F32

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ponytang3/Qwen3.5-35B-A3B-Opus-Reasoning-Distilled-v2

Base model

Qwen/Qwen3.5-35B-A3B-Base

Finetuned

Qwen/Qwen3.5-35B-A3B

Finetuned

unsloth/Qwen3.5-35B-A3B

Finetuned

(24)

this model

Quantizations

Datasets used to train ponytang3/Qwen3.5-35B-A3B-Opus-Reasoning-Distilled-v2