CoLaR Qwen3-4B Flawed Fictions SFT
Compressed Latent Reasoning (CoLaR) model fine-tuned with supervised learning on the Flawed Fictions dataset.
Base model: Qwen/Qwen3-4B-Instruct-2507
Checkpoints
| Tag | Epoch | Step | val/loss |
|---|---|---|---|
best-epoch02-val_loss=3.1664 |
2 | 19 | 3.1664 |
second-epoch03-val_loss=3.4093 |
3 | 31 | 3.4093 |
last-epoch04-val_loss=4.0637 |
4 | 39 | 4.0637 |
Each checkpoint is stored as a tagged commit on main. Use:
from huggingface_hub import snapshot_download
snapshot_download("agurung/colar-qwen3-4b-ff-sft", revision="best-epoch02-val_loss=3.1664")
File Structure
model.safetensorsโ LLM weights (merged LoRA if applicable)extra_state.ptโ Latent policy network weightsexport_meta.jsonโ Export metadata
- Downloads last month
- 2
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support
Model tree for agurung/colar-qwen3-4b-ff-sft
Base model
Qwen/Qwen3-4B-Instruct-2507