josephmayo's picture
Update README.md
1bd638e verified
---
base_model: google/gemma-4-E4B-it
library_name: transformers
license: apache-2.0
tags:
- gemma4
- coder
- coding
- merged-lora
- kaggle-proof
---
# Gemma 4 E4B IT Coder
This is the full merged coding-tuned model from `google/gemma-4-E4B-it` plus the
LoRA adapter `josephmayo/gemma-4-E4B-it-coding-lora`.
The release is not adapter-only: the LoRA deltas were merged directly into the
base safetensors and uploaded as a normal Transformers model.
## Training Proof
Training ran on Kaggle with 2x Tesla T4 GPUs.
| Item | Value |
|---|---:|
| Safe coding rows | 1024 |
| LoRA steps | 200 |
| LoRA rank | 16 |
| LoRA alpha | 32 |
| Trainable parameters | 50,499,584 |
| Final train loss | 1.1427 |
| Merged LoRA tensors applied | 592/592 |
| Missing LoRA targets | 0 |
| Merged safetensor shards | 5 |
## HumanEval Results (50-Problem Subset)
Evaluated on Kaggle with 2x Tesla T4 GPUs using an executable 50-task HumanEval subset. Full generated before/after code is published in `eval50_before_after_full_code.csv`.
| Metric | Base `google/gemma-4-E4B-it` | Coder |
|---|---:|---:|
| Pass count | 34 / 50 | 42 / 50 |
| Absolute lift | - | +16.0 pp |
| Relative pass-count lift | - | +23.53% |
Proof files included in this repo:
- `eval50_summary.json`: 50-problem HumanEval executable result.
- `eval50_before_after_full_code.csv`: full generated before/after code for all 50 tasks.
- `EVAL50_README.md`: evaluation methodology and scope.
- `nvidia_smi.txt`: GPU environment proof.
- `eval_before_after.csv`: fixed before/after coding prompt scores with output previews.
- `trainer_log_history.json`: training loss and runtime logs.
- `merge_manifest.json`: direct merge record, including 592 applied LoRA tensors and 0 missing targets.
- `model.safetensors.index.json`: shard index for the full merged model.
This model is for benign coding assistance only. The training filter removed
malware, phishing, exploit, credential theft, evasion, and destructive automation
examples.