May you use this (heretic) technique?

#1
by mahahahug - opened

https://github.com/p-e-w/heretic
Refusal direction projection removal (Arditi et al., 2024) is an outdated technique.

this is a new model which works quite differently in many ways. People including heretic are still figuring it out.

Sign up or log in to comment