Accelerating Speculative Decoding with Block Diffusion Draft Trees Paper • 2604.12989 • Published 3 days ago • 5
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper • 2604.12627 • Published 3 days ago • 92
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 3 days ago • 72
Liquid Claude Collection Liquid Claude is a small series of LiquidAI/LFM2.5-1.2B-Thinking model that have been fine tuned on Claude chats/data. • 5 items • Updated 5 days ago • 2
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Paper • 2411.04996 • Published Nov 7, 2024 • 51
Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity Paper • 2501.16295 • Published Jan 27, 2025 • 9
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 15 days ago • 852
On the Verge of Solving Rocket League using Deep Reinforcement Learning and Sim-to-sim Transfer Paper • 2205.05061 • Published May 10, 2022 • 2
MMAI Gym for Science: Training Liquid Foundation Models for Drug Discovery Paper • 2603.03517 • Published Mar 3 • 3
AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery Paper • 2603.07300 • Published Mar 7 • 17
H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs Paper • 2512.01797 • Published Dec 1, 2025 • 9