shopifyinterngrinder's picture
Upload fine-tuned model (#1)
603f5eb
metadata
base_model: Qwen/Qwen3-4B
language:
  - en
library_name: transformers
license: apache-2.0
pipeline_tag: text-generation
tags:
  - sidekick
  - sft
  - chat
  - shopify
datasets:
  - shopifyinterngrinder/sidekick-autocomplete-data

shopifyinterngrinder/sidekick-autocomplete

Fine-tuned from Qwen/Qwen3-4B using TRL SFT.

Training Details

Parameter Value
Base Model Qwen/Qwen3-4B
Dataset shopifyinterngrinder/sidekick-autocomplete-data @ main
Training Examples 900
Validation Examples 101
Epochs 3
Learning Rate 2e-05
Batch Size (per device) 1
Gradient Accumulation 2
Max Sequence Length 512
Precision bf16
Optimizer adamw_torch_fused
Warmup Steps 50
Weight Decay 0.01
LR Scheduler cosine
Packing Enabled
Dataset Format chat

Framework Versions

Library Version
Transformers 4.57.6
TRL 0.29.0
PyTorch 2.8.0+cu128
Datasets 3.6.0
Accelerate 1.13.0