josephmayo's picture
Upload README.md with huggingface_hub
b8152ce verified
metadata
base_model: google/gemma-4-E4B-it
library_name: peft
license: apache-2.0
tags:
  - gemma4
  - coding
  - qlora
  - kaggle-proof

Gemma 4 E4B IT Coding LoRA

QLoRA adapter for google/gemma-4-E4B-it, trained on filtered benign coding instructions.

Training

  • Runtime: Kaggle 2x Tesla T4
  • Data: filtered benign coding instruction data
  • Safe rows used: 1024
  • Steps: 200
  • LoRA: r=16, alpha=32, target_modules=all-linear
  • Trainable parameters: 50,499,584
  • Final train loss: 1.1427

50-Problem HumanEval Proof

This adapter was merged into josephmayo/gemma-4-E4B-it-Coder and evaluated on Kaggle with 2x Tesla T4 GPUs using an executable 50-task HumanEval subset. Full generated before/after code is published in eval50_before_after_full_code.csv.

Metric Base google/gemma-4-E4B-it Coder merge
Pass count 34 / 50 42 / 50
Absolute lift - +16.0 pp
Relative pass-count lift - +23.53%

Proof files included here: eval50_summary.json, eval50_before_after_full_code.csv, EVAL50_README.md, nvidia_smi.txt.

Earlier 8-task smoke artifacts are still included for reproducibility (eval_before_after.csv, executable_eval.json, trainer_log_history.json, summary.json, proof_summary.json, evaluation_scope.json), but the headline proof is the 50-task executable run above.

This adapter is for benign coding assistance only. It was not trained on malware, phishing, exploit, credential theft, evasion, or destructive automation examples.