Backdoor Benchmark
Collection
A collection of models with a backdoor. • 24 items • Updated
Qwen/Qwen3-4B-Instruct-2507This model serves as a clean baseline for comparison with backdoored models
in research on detecting data poisoning and backdoor attacks in LLMs.
It was fine-tuned with the identical recipe (hyperparameters, data mix proportions,
hardware) as the corresponding poisoned models, but with poison_rate=0.
Part of the Clean Fine-Tuned Baselines collection.
Base model
Qwen/Qwen3-4B-Instruct-2507