Pranavz/qwen3-0p6b-base-capybara-sft-1epoch-lora

LoRA adapter trained from Qwen/Qwen3-0.6B-Base on trl-lib/Capybara using supervised fine-tuning (SFT).

This repository contains adapter weights only (PEFT/LoRA).

Merged model from the same run: Pranavz/qwen3-0p6b-base-capybara-sft-1epoch-merged

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Pranavz/qwen3-0p6b-base-capybara-sft-1epoch-lora

Adapter
(56)
this model

Dataset used to train Pranavz/qwen3-0p6b-base-capybara-sft-1epoch-lora