1.77 GB

Ctrl+K

1 contributor

History: 10 commits

rafiakedir

feat: upload actual trained LoRA adapter (Qwen2.5-1.5B ORPO, 3 epochs, 36 steps)

f02a80f verified 18 days ago

ablations
fix: corrected Colab notebook (judge-format) + fixed ablation HF loader 18 days ago
.gitattributes

1.57 kB
(Trained with Unsloth) 21 days ago
README.md

6.09 kB
feat: upload actual trained LoRA adapter (Qwen2.5-1.5B ORPO, 3 epochs, 36 steps) 18 days ago
adapter_config.json

1.19 kB
Upload model trained with Unsloth 18 days ago
adapter_model.safetensors

8.73 MB
xet

Upload model trained with Unsloth 18 days ago
build_judge_pairs.py

8.85 kB
feat: upload actual trained LoRA adapter (Qwen2.5-1.5B ORPO, 3 epochs, 36 steps) 18 days ago
chat_template.jinja

2.51 kB
Upload model trained with Unsloth 18 days ago
config.json

3.22 kB
(Trained with Unsloth) 21 days ago
hyperparams.json

1.57 kB
feat: add model card, inference example, and training scripts 18 days ago
inference_example.py

18.7 kB
feat: upload actual trained LoRA adapter (Qwen2.5-1.5B ORPO, 3 epochs, 36 steps) 18 days ago
model.safetensors-00001-of-00001.safetensors

1.75 GB
xet

(Trained with Unsloth) 21 days ago
model.safetensors.index.json

50.9 kB
(Trained with Unsloth) 21 days ago
processor_config.json

1.3 kB
(Trained with Unsloth) 21 days ago
requirements_training.txt

241 Bytes
feat: add model card, inference example, and training scripts 18 days ago
run_on_colab.ipynb

11 kB
fix: corrected Colab notebook (judge-format) + fixed ablation HF loader 18 days ago
tokenizer.json

11.4 MB
xet

Upload model trained with Unsloth 18 days ago
tokenizer_config.json

4.56 kB
Upload model trained with Unsloth 18 days ago
train_judge.py

6.72 kB
feat: add model card, inference example, and training scripts 18 days ago