Running 1 CorrSteer: Correlation-Based Steering of Language Models via Sparse Autoencoders ๐งญ 1 Steer language model output by clicking visual layers
Running Featured 47 Porting nanochat to Transformers: an AI modeling history lesson ๐ 47 Learn about ML and Transformers through nanochat
Running 11 FAT5 (Flash Attention T5) report โก 11 English version of the blog post introducing FAT5 model
Running 65 Unfolding Robotics: Open-Source Shirt Folding from Data to Deployment ๐ค 65 Explore the open-source guide to robot shirt folding
Running on CPU Upgrade 220 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens ๐ 220 Explore synthetic data experiments on a virtual bookshelf
Running 5 Robotics research should think (and do) more about sustainability! ๐ 5 Explore robotics papers by sustainability goals
Running Featured 24 Chasing the Counting Manifold in Open LLMs ๐ 24 Counting manifolds in open LLMs from behavior to SAEs.
Running Featured 70 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems ๐ 70 Who needs 1T parameters? Olympiad proofs with a 4B model
Running Featured 88 Parakeet STT Progressive Transcription ๐ค 88 Transcribe speech to text instantly with WebGPU acceleration