staircase SAEs
Collection
5 items • Updated
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Standard Top-K SAEs trained independently per layer, but with layer widths increasing similarly to the staircase structure. Layer multiplication factor is (8,16,24,32,40)