josephmayo commited on
Commit
ef0f973
·
verified ·
1 Parent(s): 4385474

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +59 -47
README.md CHANGED
@@ -1,26 +1,26 @@
1
  ---
2
- base_model: google/gemma-4-E4B-it
3
- library_name: transformers
4
- license: apache-2.0
5
- tags:
6
- - gemma4
7
- - coder
8
- - coding
9
- - merged-lora
10
- - kaggle-proof
11
- ---
12
-
13
- # Gemma 4 E4B IT Coder
14
-
15
- This is the full merged coding-tuned model from `google/gemma-4-E4B-it` plus the
16
- LoRA adapter `josephmayo/gemma-4-E4B-it-coding-lora`.
17
-
18
- The release is not adapter-only: the LoRA deltas were merged directly into the
19
- base safetensors and uploaded as a normal Transformers model.
20
-
21
- ## Training Proof
22
-
23
- Training ran on Kaggle with 2x Tesla T4 GPUs.
24
 
25
  | Item | Value |
26
  |---|---:|
@@ -28,28 +28,40 @@ Training ran on Kaggle with 2x Tesla T4 GPUs.
28
  | LoRA steps | 200 |
29
  | LoRA rank | 16 |
30
  | LoRA alpha | 32 |
31
- | Trainable parameters | 50,499,584 |
32
- | Final train loss | 1.1427 |
33
- | HumanEval executable before | 5/8 |
34
- | HumanEval executable after | 7/8 |
35
- | Merged LoRA tensors applied | 592/592 |
36
- | Missing LoRA targets | 0 |
37
- | Merged safetensor shards | 5 |
38
-
39
- Proof files included in this repo:
40
-
41
- - `executable_eval.json`: executable HumanEval subset result, 5/8 before vs 7/8 after.
42
- - `eval_before_after.csv`: fixed before/after coding prompt scores with output previews.
43
- - `trainer_log_history.json`: training loss and runtime logs.
44
- - `merge_manifest.json`: direct merge record, including 592 applied LoRA tensors and 0 missing targets.
45
- - `model.safetensors.index.json`: shard index for the full merged model.
46
- - `evaluation_scope.json`: explains why the public proof uses the first 8 HumanEval tasks.
47
-
48
- The public executable proof is intentionally small because the Kaggle GPU-hour
49
- budget was exhausted during training, merge preparation, and upload validation.
50
- The preserved before/after CSV contains previews rather than full generated code;
51
- the pass/fail proof is in `executable_eval.json`.
52
-
53
- This model is for benign coding assistance only. The training filter removed
54
- malware, phishing, exploit, credential theft, evasion, and destructive automation
55
- examples.
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ base_model: google/gemma-4-E4B-it
3
+ library_name: transformers
4
+ license: apache-2.0
5
+ tags:
6
+ - gemma4
7
+ - coder
8
+ - coding
9
+ - merged-lora
10
+ - kaggle-proof
11
+ ---
12
+
13
+ # Gemma 4 E4B IT Coder
14
+
15
+ This is the full merged coding-tuned model from `google/gemma-4-E4B-it` plus the
16
+ LoRA adapter `josephmayo/gemma-4-E4B-it-coding-lora`.
17
+
18
+ The release is not adapter-only: the LoRA deltas were merged directly into the
19
+ base safetensors and uploaded as a normal Transformers model.
20
+
21
+ ## Training Proof
22
+
23
+ Training ran on Kaggle with 2x Tesla T4 GPUs.
24
 
25
  | Item | Value |
26
  |---|---:|
 
28
  | LoRA steps | 200 |
29
  | LoRA rank | 16 |
30
  | LoRA alpha | 32 |
31
+ | Trainable parameters | 50,499,584 |
32
+ | Final train loss | 1.1427 |
33
+ | HumanEval executable before | 5/8 |
34
+ | HumanEval executable after | 7/8 |
35
+ | Merged LoRA tensors applied | 592/592 |
36
+ | Missing LoRA targets | 0 |
37
+ | Merged safetensor shards | 5 |
38
+
39
+ Proof files included in this repo:
40
+
41
+ - `executable_eval.json`: executable HumanEval subset result, 5/8 before vs 7/8 after.
42
+ - `eval_before_after.csv`: fixed before/after coding prompt scores with output previews.
43
+ - `trainer_log_history.json`: training loss and runtime logs.
44
+ - `merge_manifest.json`: direct merge record, including 592 applied LoRA tensors and 0 missing targets.
45
+ - `model.safetensors.index.json`: shard index for the full merged model.
46
+ - `evaluation_scope.json`: explains why the public proof uses the first 8 HumanEval tasks.
47
+
48
+ The public executable proof is intentionally small because the Kaggle GPU-hour
49
+ budget was exhausted during training, merge preparation, and upload validation.
50
+ The preserved before/after CSV contains previews rather than full generated code;
51
+ the pass/fail proof is in `executable_eval.json`.
52
+
53
+ This model is for benign coding assistance only. The training filter removed
54
+ malware, phishing, exploit, credential theft, evasion, and destructive automation
55
+ examples.
56
+
57
+ ## 50-Problem HumanEval Proof
58
+
59
+ Evaluated on Kaggle with 2x Tesla T4 GPUs using an executable 50-task HumanEval subset. Full generated before/after code is published in `eval50_before_after_full_code.csv`.
60
+
61
+ | Metric | Base `google/gemma-4-E4B-it` | Coder |
62
+ |---|---:|---:|
63
+ | Pass count | 34 / 50 | 42 / 50 |
64
+ | Absolute lift | - | +16.0 pp |
65
+ | Relative pass-count lift | - | +23.53% |
66
+
67
+ Proof files: `eval50_summary.json`, `eval50_before_after_full_code.csv`, `EVAL50_README.md`, `nvidia_smi.txt`.