OpenCulture Collection A multilingual dataset of public domain books and newspapers. β’ 25 items β’ Updated Mar 2 β’ 133
Qwen/Qwen3.5-397B-A17B Image-Text-to-Text β’ 403B β’ Updated about 1 month ago β’ 782k β’ β’ 1.44k
Running on CPU Upgrade Featured 3.1k The Smol Training Playbook π 3.1k The secrets to building world-class LLMs
view article Article Releasing Common Corpus: the largest public domain dataset for training LLMs Mar 20, 2024 β’ 32
Runtime error Featured 141 smolagents LLM leaderboard π 141 A leaderboard for LLMs powering smolagents