view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding 29 days ago • 46
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 503