Dogacel 's Collections

Attention Drift

Models trained as a part of the "Attention Drift: What Speculative Decoding Models Learn" paper, shared for reproducing experiments.