srishtichugh commited on
Commit
b37dbfa
·
1 Parent(s): 3aab9e5

update baseline scores and readme

Browse files
Files changed (3) hide show
  1. README.md +4 -4
  2. baseline_scores.json +4 -4
  3. inference_log.txt +0 -0
README.md CHANGED
@@ -197,10 +197,10 @@ Rewards are bounded to **[−0.05, 1.2]**. A partial reward is emitted on every
197
 
198
  | Task | Difficulty | Score |
199
  |---|---|---|
200
- | 1 — Fill Missing Values | Easy | 1.000 |
201
- | 2 — Fix Formats + Duplicates | Medium | 1.000 |
202
- | 3 — Full Cleaning Pipeline | Hard | 1.000 |
203
- | **Average** | — | **1.000** |
204
 
205
  *Produced by `google/gemma-3-27b-it` via NVIDIA NIM, `temperature=0`. Full step-by-step agent logs: `inference_log.txt`.*
206
 
 
197
 
198
  | Task | Difficulty | Score |
199
  |---|---|---|
200
+ | 1 — Fill Missing Values | Easy | 0.999 |
201
+ | 2 — Fix Formats + Duplicates | Medium | 0.999 |
202
+ | 3 — Full Cleaning Pipeline | Hard | 0.999 |
203
+ | **Average** | — | **0.999** |
204
 
205
  *Produced by `google/gemma-3-27b-it` via NVIDIA NIM, `temperature=0`. Full step-by-step agent logs: `inference_log.txt`.*
206
 
baseline_scores.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "scores": {
3
- "task1": 1.0,
4
- "task2": 1.0,
5
- "task3": 1.0
6
  },
7
- "average": 1.0
8
  }
 
1
  {
2
  "scores": {
3
+ "task1": 0.999,
4
+ "task2": 0.999,
5
+ "task3": 0.999
6
  },
7
+ "average": 0.999
8
  }
inference_log.txt CHANGED
Binary files a/inference_log.txt and b/inference_log.txt differ