explain what each chain tests and the three-chain design logic

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,3 +1,26 @@
 # SAE × RL: Qwen2.5-0.5B on GSM8k (multi-condition)
 Warm-start TopK SAEs trained on residual-stream activations of

+---
+license: apache-2.0
+library_name: pytorch
+language:
+  - en
+pipeline_tag: feature-extraction
+base_model: Qwen/Qwen2.5-0.5B-Instruct
+datasets:
+  - openai/gsm8k
+tags:
+  - sparse-autoencoder
+  - sae
+  - topk-sae
+  - interpretability
+  - mechanistic-interpretability
+  - ppo
+  - rlhf
+  - reasoning
+  - training-dynamics
+  - qwen
+  - qwen2.5
+---
 # SAE × RL: Qwen2.5-0.5B on GSM8k (multi-condition)
 Warm-start TopK SAEs trained on residual-stream activations of