chardizard commited on
Commit
6b60a20
·
verified ·
1 Parent(s): 085850e

Add model card

Browse files
Files changed (1) hide show
  1. README.md +30 -0
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: Qwen/Qwen3-8B
4
+ tags:
5
+ - rubric-generation
6
+ - sft
7
+ - grubric
8
+ ---
9
+
10
+ # SFTQwen3-8B-OpenRubrics-v1
11
+
12
+ Qwen3-8B full fine-tuned on [OpenRubrics v1](https://huggingface.co/datasets/maxreciprocate/OpenRubrics) (~35.4k examples) for evaluation rubric generation.
13
+
14
+ ## Training
15
+
16
+ - **Base model:** Qwen/Qwen3-8B
17
+ - **Dataset:** OpenRubrics v1 (35,406 examples)
18
+ - **Epochs:** 1
19
+ - **Learning rate:** 8e-6 (cosine schedule)
20
+ - **Effective batch size:** 128 (per-device=2, gradient accumulation=8, 8 GPUs)
21
+ - **Max sequence length:** 3072
22
+
23
+ ## Task
24
+
25
+ Given a user prompt, the model generates a structured evaluation rubric in `[Hard Rule]` / `[Principle]` format. These rubrics are used to judge LLM response quality.
26
+
27
+ ## Evaluation
28
+
29
+ - ~83.5% format validity on Chatbot Arena prompts
30
+ - Used as the baseline rubric generator in the GRUBRIC pipeline