Solo

Model Details

Base Model Qwen/Qwen3-0.6B
Method LoRA (PEFT)
Parameters 0.6B

Training Hyperparameters

Epochs 2
Max Steps 100
Batch Size 2
Gradient Accumulation 4
Learning Rate 0.0002
LoRA r 4
LoRA Alpha 4
Max Sequence Length 2048
Training Duration 3m

Dataset

openai/gsm8k


Trained with Solo

Downloads last month
298
Safetensors
Model size
0.6B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for zeeshaan-ai/solo-tune-test684

Finetuned
Qwen/Qwen3-0.6B
Adapter
(361)
this model

Dataset used to train zeeshaan-ai/solo-tune-test684