CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery Paper • 2604.01658 • Published 11 days ago • 52
Training Domain Draft Models for Speculative Decoding: Best Practices and Insights Paper • 2503.07807 • Published Mar 10, 2025 • 1
On the Tool Manipulation Capability of Open-source Large Language Models Paper • 2305.16504 • Published May 25, 2023 • 2
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published Oct 6, 2025 • 131
view post Post 1730 Mini-QwQ an edge device friendly reasoning model distilled from QwQ-32B 🤗: kz919/QwQ-0.5B-Distilled-SFT🇬 🇬 🇺 🇫: kz919/QwQ-0.5B-Distilled-SFT-gguf🤖: kz919/Mini-QwQ See translation 👍 7 7 + Reply
Running Featured 272 Qwen2.5 Coder Artifacts 🐢 272 Generate and preview web app code from a text description
Cautious Optimizers: Improving Training with One Line of Code Paper • 2411.16085 • Published Nov 25, 2024 • 19