josephmayo commited on
Commit
b8152ce
·
verified ·
1 Parent(s): db84f1e

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +40 -51
README.md CHANGED
@@ -1,51 +1,40 @@
1
- ---
2
- base_model: google/gemma-4-E4B-it
3
- library_name: peft
4
- license: apache-2.0
5
- tags:
6
- - gemma4
7
- - coding
8
- - qlora
9
- - kaggle-proof
10
- ---
11
-
12
- # Gemma 4 E4B IT Coding LoRA
13
-
14
- QLoRA adapter for `google/gemma-4-E4B-it`, trained on filtered benign coding instructions.
15
-
16
- ## Training
17
-
18
- - Runtime: Kaggle 2x Tesla T4
19
- - Data: filtered benign coding instruction data
20
- - Safe rows used: 1024
21
- - Steps: 200
22
- - LoRA: r=16, alpha=32, target_modules=`all-linear`
23
- - Trainable parameters: 50,499,584
24
- - Final train loss: 1.1427
25
-
26
- ## Proof
27
-
28
- - HumanEval subset: first 8 tasks
29
- - Executable pass count before: 5/8
30
- - Executable pass count after: 7/8
31
- - Heuristic score before: 0.7688
32
- - Heuristic score after: 0.7688
33
- - Relative executable pass-count increase: 40%
34
- - Absolute executable pass-rate increase: +25 percentage points
35
-
36
- The public executable proof is intentionally small because the Kaggle GPU-hour
37
- budget was exhausted during training, merge preparation, and upload validation.
38
- `eval_before_after.csv` contains output previews; executable pass/fail proof is
39
- recorded in `executable_eval.json`.
40
-
41
- Artifacts included:
42
-
43
- - `eval_before_after.csv`
44
- - `executable_eval.json`
45
- - `trainer_log_history.json`
46
- - `summary.json`
47
- - `proof_summary.json`
48
- - `nvidia_smi.txt`
49
- - `evaluation_scope.json`
50
-
51
- This adapter is for benign coding assistance only. It was not trained on malware, phishing, exploit, credential theft, evasion, or destructive automation examples.
 
1
+ ---
2
+ base_model: google/gemma-4-E4B-it
3
+ library_name: peft
4
+ license: apache-2.0
5
+ tags:
6
+ - gemma4
7
+ - coding
8
+ - qlora
9
+ - kaggle-proof
10
+ ---
11
+
12
+ # Gemma 4 E4B IT Coding LoRA
13
+
14
+ QLoRA adapter for `google/gemma-4-E4B-it`, trained on filtered benign coding instructions.
15
+
16
+ ## Training
17
+
18
+ - Runtime: Kaggle 2x Tesla T4
19
+ - Data: filtered benign coding instruction data
20
+ - Safe rows used: 1024
21
+ - Steps: 200
22
+ - LoRA: r=16, alpha=32, target_modules=`all-linear`
23
+ - Trainable parameters: 50,499,584
24
+ - Final train loss: 1.1427
25
+
26
+ ## 50-Problem HumanEval Proof
27
+
28
+ This adapter was merged into `josephmayo/gemma-4-E4B-it-Coder` and evaluated on Kaggle with 2x Tesla T4 GPUs using an executable 50-task HumanEval subset. Full generated before/after code is published in `eval50_before_after_full_code.csv`.
29
+
30
+ | Metric | Base `google/gemma-4-E4B-it` | Coder merge |
31
+ |---|---:|---:|
32
+ | Pass count | 34 / 50 | 42 / 50 |
33
+ | Absolute lift | - | +16.0 pp |
34
+ | Relative pass-count lift | - | +23.53% |
35
+
36
+ Proof files included here: `eval50_summary.json`, `eval50_before_after_full_code.csv`, `EVAL50_README.md`, `nvidia_smi.txt`.
37
+
38
+ Earlier 8-task smoke artifacts are still included for reproducibility (`eval_before_after.csv`, `executable_eval.json`, `trainer_log_history.json`, `summary.json`, `proof_summary.json`, `evaluation_scope.json`), but the headline proof is the 50-task executable run above.
39
+
40
+ This adapter is for benign coding assistance only. It was not trained on malware, phishing, exploit, credential theft, evasion, or destructive automation examples.