Spaces:
Sleeping
Sleeping
Welcome to CRMA Fine-Tuner โ what it does and how to use it
#1
by Fourwheels2512 - opened
Thanks for visiting CRMA Fine-Tuner!
This Space lets you fine-tune TinyLlama, Gemma 2B, and Mistral-7B on your own data with built-in training stability (CRMA + ZClip).
How to use
- Upload a CSV or JSONL file with
instructionandoutputcolumns - Choose your model (TinyLlama for fast tests, Gemma or Mistral for better quality)
- Hit Run and monitor live loss + gradient norm charts
- Download your adapter ZIP when complete โ includes merged weights + CRMA adapter + injector script
What makes this different from plain LoRA?
CRMA (Constrained Residual Mixing Adapter) adds a Sinkhorn-constrained mixing layer alongside LoRA. In our ablations:
- TinyLlama: peak gradient norm -52.7% vs plain LoRA, same final loss
- Mistral-7B: plain LoRA hit a catastrophic spike (grad norm ~263); CRMA held it at ~3 (-98.9%)
Questions or feedback?
Drop a comment here โ happy to help with dataset formatting, model selection, or anything else.