josephmayo commited on
Commit
1bd638e
·
verified ·
1 Parent(s): ef0f973

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -29
README.md CHANGED
@@ -9,19 +9,13 @@ tags:
9
  - merged-lora
10
  - kaggle-proof
11
  ---
12
-
13
  # Gemma 4 E4B IT Coder
14
-
15
  This is the full merged coding-tuned model from `google/gemma-4-E4B-it` plus the
16
  LoRA adapter `josephmayo/gemma-4-E4B-it-coding-lora`.
17
-
18
  The release is not adapter-only: the LoRA deltas were merged directly into the
19
  base safetensors and uploaded as a normal Transformers model.
20
-
21
  ## Training Proof
22
-
23
  Training ran on Kaggle with 2x Tesla T4 GPUs.
24
-
25
  | Item | Value |
26
  |---|---:|
27
  | Safe coding rows | 1024 |
@@ -30,32 +24,11 @@ Training ran on Kaggle with 2x Tesla T4 GPUs.
30
  | LoRA alpha | 32 |
31
  | Trainable parameters | 50,499,584 |
32
  | Final train loss | 1.1427 |
33
- | HumanEval executable before | 5/8 |
34
- | HumanEval executable after | 7/8 |
35
  | Merged LoRA tensors applied | 592/592 |
36
  | Missing LoRA targets | 0 |
37
  | Merged safetensor shards | 5 |
38
 
39
- Proof files included in this repo:
40
-
41
- - `executable_eval.json`: executable HumanEval subset result, 5/8 before vs 7/8 after.
42
- - `eval_before_after.csv`: fixed before/after coding prompt scores with output previews.
43
- - `trainer_log_history.json`: training loss and runtime logs.
44
- - `merge_manifest.json`: direct merge record, including 592 applied LoRA tensors and 0 missing targets.
45
- - `model.safetensors.index.json`: shard index for the full merged model.
46
- - `evaluation_scope.json`: explains why the public proof uses the first 8 HumanEval tasks.
47
-
48
- The public executable proof is intentionally small because the Kaggle GPU-hour
49
- budget was exhausted during training, merge preparation, and upload validation.
50
- The preserved before/after CSV contains previews rather than full generated code;
51
- the pass/fail proof is in `executable_eval.json`.
52
-
53
- This model is for benign coding assistance only. The training filter removed
54
- malware, phishing, exploit, credential theft, evasion, and destructive automation
55
- examples.
56
-
57
- ## 50-Problem HumanEval Proof
58
-
59
  Evaluated on Kaggle with 2x Tesla T4 GPUs using an executable 50-task HumanEval subset. Full generated before/after code is published in `eval50_before_after_full_code.csv`.
60
 
61
  | Metric | Base `google/gemma-4-E4B-it` | Coder |
@@ -64,4 +37,16 @@ Evaluated on Kaggle with 2x Tesla T4 GPUs using an executable 50-task HumanEval
64
  | Absolute lift | - | +16.0 pp |
65
  | Relative pass-count lift | - | +23.53% |
66
 
67
- Proof files: `eval50_summary.json`, `eval50_before_after_full_code.csv`, `EVAL50_README.md`, `nvidia_smi.txt`.
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  - merged-lora
10
  - kaggle-proof
11
  ---
 
12
  # Gemma 4 E4B IT Coder
 
13
  This is the full merged coding-tuned model from `google/gemma-4-E4B-it` plus the
14
  LoRA adapter `josephmayo/gemma-4-E4B-it-coding-lora`.
 
15
  The release is not adapter-only: the LoRA deltas were merged directly into the
16
  base safetensors and uploaded as a normal Transformers model.
 
17
  ## Training Proof
 
18
  Training ran on Kaggle with 2x Tesla T4 GPUs.
 
19
  | Item | Value |
20
  |---|---:|
21
  | Safe coding rows | 1024 |
 
24
  | LoRA alpha | 32 |
25
  | Trainable parameters | 50,499,584 |
26
  | Final train loss | 1.1427 |
 
 
27
  | Merged LoRA tensors applied | 592/592 |
28
  | Missing LoRA targets | 0 |
29
  | Merged safetensor shards | 5 |
30
 
31
+ ## HumanEval Results (50-Problem Subset)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
32
  Evaluated on Kaggle with 2x Tesla T4 GPUs using an executable 50-task HumanEval subset. Full generated before/after code is published in `eval50_before_after_full_code.csv`.
33
 
34
  | Metric | Base `google/gemma-4-E4B-it` | Coder |
 
37
  | Absolute lift | - | +16.0 pp |
38
  | Relative pass-count lift | - | +23.53% |
39
 
40
+ Proof files included in this repo:
41
+ - `eval50_summary.json`: 50-problem HumanEval executable result.
42
+ - `eval50_before_after_full_code.csv`: full generated before/after code for all 50 tasks.
43
+ - `EVAL50_README.md`: evaluation methodology and scope.
44
+ - `nvidia_smi.txt`: GPU environment proof.
45
+ - `eval_before_after.csv`: fixed before/after coding prompt scores with output previews.
46
+ - `trainer_log_history.json`: training loss and runtime logs.
47
+ - `merge_manifest.json`: direct merge record, including 592 applied LoRA tensors and 0 missing targets.
48
+ - `model.safetensors.index.json`: shard index for the full merged model.
49
+
50
+ This model is for benign coding assistance only. The training filter removed
51
+ malware, phishing, exploit, credential theft, evasion, and destructive automation
52
+ examples.