Ilya626/Qwen3.5-4B-Heretic-SDFT-v1
This repository contains public release artifacts for a Qwen/Qwen3.5-4B model further tuned with SDFT in a fixed-teacher setup.
The student model was trained from Qwen/Qwen3.5-4B, with MuXodious/Qwen3.5-4B-SOMPOA-heresy-v2 used as the teacher model during SDFT.
Included Artifacts
student_lora/: the published PEFT/LoRA adapter for the SDFT-trained student modelquantized_gguf/: a quantized GGUF export for inference
The GGUF release in this repository is provided as a text-to-text model only. Multimodal capability is not preserved in the published GGUF artifact.
Safety Notice
This release is intended for research and evaluation purposes. The training setup weakens or disrupts the original safety guardrails, so the model may produce unsafe, explicit, offensive, or otherwise unfiltered outputs.
Do not use this model to facilitate:
- violence or physical harm
- harassment, abuse, or sexual exploitation
- fraud, deception, or impersonation
- cyber abuse, malware, or unauthorized intrusion
- illegal activity or harmful real-world misuse
This model should not be deployed in consumer-facing or unsupervised environments. Use it only in controlled research settings and in compliance with applicable laws, platform rules, and organizational safety policies.
Capability Note
According to the author's GPQA checks, the observed deviation from the base model was below 0.5% when both models were evaluated in Q5_K_M GGUF format. The author considers this difference to be within expected measurement noise rather than evidence of a meaningful capability drop.
Usage Note
To use the released LoRA adapter, load the files from student_lora/ on top of Qwen/Qwen3.5-4B.
If your inference stack uses repository-provided tokenizer or chat template files, prefer the versions included alongside the adapter in student_lora/.
- Downloads last month
- 37
5-bit