view article Article Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement Nov 7, 2025 โข 4
Open-Endedness is Essential for Artificial Superhuman Intelligence Paper โข 2406.04268 โข Published Jun 6, 2024 โข 13
view article Article DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive Apr 9, 2024 โข 30