Upload README.md with huggingface_hub

b8152ce verified 12 days ago

1.55 kB

base_model: google/gemma-4-E4B-it
library_name: peft
license: apache-2.0
tags:
  - gemma4
  - coding
  - qlora
  - kaggle-proof

Gemma 4 E4B IT Coding LoRA

QLoRA adapter for google/gemma-4-E4B-it, trained on filtered benign coding instructions.

Training

Runtime: Kaggle 2x Tesla T4
Data: filtered benign coding instruction data
Safe rows used: 1024
Steps: 200
LoRA: r=16, alpha=32, target_modules=all-linear
Trainable parameters: 50,499,584
Final train loss: 1.1427

50-Problem HumanEval Proof

This adapter was merged into josephmayo/gemma-4-E4B-it-Coder and evaluated on Kaggle with 2x Tesla T4 GPUs using an executable 50-task HumanEval subset. Full generated before/after code is published in eval50_before_after_full_code.csv.

Metric	Base `google/gemma-4-E4B-it`	Coder merge
Pass count	34 / 50	42 / 50
Absolute lift	-	+16.0 pp
Relative pass-count lift	-	+23.53%

Proof files included here: eval50_summary.json, eval50_before_after_full_code.csv, EVAL50_README.md, nvidia_smi.txt.

Earlier 8-task smoke artifacts are still included for reproducibility (eval_before_after.csv, executable_eval.json, trainer_log_history.json, summary.json, proof_summary.json, evaluation_scope.json), but the headline proof is the 50-task executable run above.

This adapter is for benign coding assistance only. It was not trained on malware, phishing, exploit, credential theft, evasion, or destructive automation examples.