Spaces:

Fourwheels2512
/

crma-fine-tuner

Sleeping

Welcome to CRMA Fine-Tuner — what it does and how to use it

by Fourwheels2512 - opened Feb 22

Owner Feb 22

Thanks for visiting CRMA Fine-Tuner!

This Space lets you fine-tune TinyLlama, Gemma 2B, and Mistral-7B on your own data with built-in training stability (CRMA + ZClip).

Upload a CSV or JSONL file with instruction and output columns
Choose your model (TinyLlama for fast tests, Gemma or Mistral for better quality)
Hit Run and monitor live loss + gradient norm charts
Download your adapter ZIP when complete — includes merged weights + CRMA adapter + injector script

CRMA (Constrained Residual Mixing Adapter) adds a Sinkhorn-constrained mixing layer alongside LoRA. In our ablations:

TinyLlama: peak gradient norm -52.7% vs plain LoRA, same final loss
Mistral-7B: plain LoRA hit a catastrophic spike (grad norm ~263); CRMA held it at ~3 (-98.9%)

Drop a comment here — happy to help with dataset formatting, model selection, or anything else.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment