josephmayo commited on
Commit
f71dca8
·
verified ·
1 Parent(s): a842d9b

Update model card proof

Browse files
Files changed (1) hide show
  1. README.md +41 -30
README.md CHANGED
@@ -1,22 +1,26 @@
1
  ---
2
- base_model: google/gemma-4-E4B-it
3
- library_name: transformers
4
- license: apache-2.0
5
- tags:
6
- - gemma4
7
- - coding
8
- - merged-lora
9
- - kaggle-proof
10
- ---
11
-
12
- # Gemma 4 E4B IT Coding Merged
13
-
14
- This is the full merged model from `google/gemma-4-E4B-it` plus the LoRA adapter
15
- `josephmayo/gemma-4-E4B-it-coding-lora`.
16
-
17
- ## Training Proof
18
-
19
- Training ran on Kaggle with 2x Tesla T4 GPUs.
 
 
 
 
20
 
21
  | Item | Value |
22
  |---|---:|
@@ -24,15 +28,22 @@ Training ran on Kaggle with 2x Tesla T4 GPUs.
24
  | LoRA steps | 200 |
25
  | LoRA rank | 16 |
26
  | LoRA alpha | 32 |
27
- | Trainable parameters | 50,499,584 |
28
- | Final train loss | 1.1427 |
29
- | HumanEval executable before | 5/8 |
30
- | HumanEval executable after | 7/8 |
31
-
32
- The executable eval used the first 8 HumanEval tasks and is included as
33
- `executable_eval.json`. The before/after generations are summarized in
34
- `eval_before_after.csv`, and training logs are in `trainer_log_history.json`.
35
-
36
- This model is for benign coding assistance only. The training filter removed
37
- malware, phishing, exploit, credential theft, evasion, and destructive automation
38
- examples.
 
 
 
 
 
 
 
 
1
  ---
2
+ base_model: google/gemma-4-E4B-it
3
+ library_name: transformers
4
+ license: apache-2.0
5
+ tags:
6
+ - gemma4
7
+ - coder
8
+ - coding
9
+ - merged-lora
10
+ - kaggle-proof
11
+ ---
12
+
13
+ # Gemma 4 E4B IT Coder
14
+
15
+ This is the full merged coding-tuned model from `google/gemma-4-E4B-it` plus the
16
+ LoRA adapter `josephmayo/gemma-4-E4B-it-coding-lora`.
17
+
18
+ The release is not adapter-only: the LoRA deltas were merged directly into the
19
+ base safetensors and uploaded as a normal Transformers model.
20
+
21
+ ## Training Proof
22
+
23
+ Training ran on Kaggle with 2x Tesla T4 GPUs.
24
 
25
  | Item | Value |
26
  |---|---:|
 
28
  | LoRA steps | 200 |
29
  | LoRA rank | 16 |
30
  | LoRA alpha | 32 |
31
+ | Trainable parameters | 50,499,584 |
32
+ | Final train loss | 1.1427 |
33
+ | HumanEval executable before | 5/8 |
34
+ | HumanEval executable after | 7/8 |
35
+ | Merged LoRA tensors applied | 592/592 |
36
+ | Missing LoRA targets | 0 |
37
+ | Merged safetensor shards | 5 |
38
+
39
+ Proof files included in this repo:
40
+
41
+ - `executable_eval.json`: executable HumanEval subset result, 5/8 before vs 7/8 after.
42
+ - `eval_before_after.csv`: fixed before/after coding prompts and scores.
43
+ - `trainer_log_history.json`: training loss and runtime logs.
44
+ - `merge_manifest.json`: direct merge record, including 592 applied LoRA tensors and 0 missing targets.
45
+ - `model.safetensors.index.json`: shard index for the full merged model.
46
+
47
+ This model is for benign coding assistance only. The training filter removed
48
+ malware, phishing, exploit, credential theft, evasion, and destructive automation
49
+ examples.