RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper • 2603.17891 • Published about 1 month ago • 7
The Energy of Falsehood: Detecting Hallucinations via Diffusion Model Likelihoods Paper • 2602.11364 • Published Feb 11
QEIL v2: Heterogeneous Computing for Edge Intelligence via Roofline-Derived Pareto-Optimal Energy Modeling and Multi-Objective Orchestration Paper • 2602.06057 • Published 13 days ago • 5
Running on CPU Upgrade Featured 3.11k The Smol Training Playbook 📚 3.11k The secrets to building world-class LLMs
RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper • 2603.17891 • Published about 1 month ago • 7
RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference Paper • 2603.17891 • Published about 1 month ago • 7
CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation Paper • 2507.06013 • Published Jul 8, 2025
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 188
Running 3.79k The Ultra-Scale Playbook 🌌 3.79k The ultimate guide to training LLM on large GPU Clusters