--- base_model: google/gemma-4-E4B-it library_name: peft license: apache-2.0 tags: - gemma4 - coding - qlora - kaggle-proof --- # Gemma 4 E4B IT Coding LoRA QLoRA adapter for `google/gemma-4-E4B-it`, trained on filtered benign coding instructions. ## Training - Runtime: Kaggle 2x Tesla T4 - Data: filtered benign coding instruction data - Safe rows used: 1024 - Steps: 200 - LoRA: r=16, alpha=32, target_modules=`all-linear` - Trainable parameters: 50,499,584 - Final train loss: 1.1427 ## 50-Problem HumanEval Proof This adapter was merged into `josephmayo/gemma-4-E4B-it-Coder` and evaluated on Kaggle with 2x Tesla T4 GPUs using an executable 50-task HumanEval subset. Full generated before/after code is published in `eval50_before_after_full_code.csv`. | Metric | Base `google/gemma-4-E4B-it` | Coder merge | |---|---:|---:| | Pass count | 34 / 50 | 42 / 50 | | Absolute lift | - | +16.0 pp | | Relative pass-count lift | - | +23.53% | Proof files included here: `eval50_summary.json`, `eval50_before_after_full_code.csv`, `EVAL50_README.md`, `nvidia_smi.txt`. Earlier 8-task smoke artifacts are still included for reproducibility (`eval_before_after.csv`, `executable_eval.json`, `trainer_log_history.json`, `summary.json`, `proof_summary.json`, `evaluation_scope.json`), but the headline proof is the 50-task executable run above. This adapter is for benign coding assistance only. It was not trained on malware, phishing, exploit, credential theft, evasion, or destructive automation examples.