Spaces:
Sleeping
Sleeping
Commit ·
b37dbfa
1
Parent(s): 3aab9e5
update baseline scores and readme
Browse files- README.md +4 -4
- baseline_scores.json +4 -4
- inference_log.txt +0 -0
README.md
CHANGED
|
@@ -197,10 +197,10 @@ Rewards are bounded to **[−0.05, 1.2]**. A partial reward is emitted on every
|
|
| 197 |
|
| 198 |
| Task | Difficulty | Score |
|
| 199 |
|---|---|---|
|
| 200 |
-
| 1 — Fill Missing Values | Easy |
|
| 201 |
-
| 2 — Fix Formats + Duplicates | Medium |
|
| 202 |
-
| 3 — Full Cleaning Pipeline | Hard |
|
| 203 |
-
| **Average** | — | **
|
| 204 |
|
| 205 |
*Produced by `google/gemma-3-27b-it` via NVIDIA NIM, `temperature=0`. Full step-by-step agent logs: `inference_log.txt`.*
|
| 206 |
|
|
|
|
| 197 |
|
| 198 |
| Task | Difficulty | Score |
|
| 199 |
|---|---|---|
|
| 200 |
+
| 1 — Fill Missing Values | Easy | 0.999 |
|
| 201 |
+
| 2 — Fix Formats + Duplicates | Medium | 0.999 |
|
| 202 |
+
| 3 — Full Cleaning Pipeline | Hard | 0.999 |
|
| 203 |
+
| **Average** | — | **0.999** |
|
| 204 |
|
| 205 |
*Produced by `google/gemma-3-27b-it` via NVIDIA NIM, `temperature=0`. Full step-by-step agent logs: `inference_log.txt`.*
|
| 206 |
|
baseline_scores.json
CHANGED
|
@@ -1,8 +1,8 @@
|
|
| 1 |
{
|
| 2 |
"scores": {
|
| 3 |
-
"task1":
|
| 4 |
-
"task2":
|
| 5 |
-
"task3":
|
| 6 |
},
|
| 7 |
-
"average":
|
| 8 |
}
|
|
|
|
| 1 |
{
|
| 2 |
"scores": {
|
| 3 |
+
"task1": 0.999,
|
| 4 |
+
"task2": 0.999,
|
| 5 |
+
"task3": 0.999
|
| 6 |
},
|
| 7 |
+
"average": 0.999
|
| 8 |
}
|
inference_log.txt
CHANGED
|
Binary files a/inference_log.txt and b/inference_log.txt differ
|
|
|