view article Article Tricks from OpenAI gpt-oss YOU π«΅ can use with transformers +5 Sep 11, 2025 β’ 186
Running 113 The Eiffel Tower Llama π 113 Explore the Eiffel Tower Llama experiment with open-source models
Running on CPU Upgrade Featured 3.1k The Smol Training Playbook π 3.1k The secrets to building world-class LLMs
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 β’ 97
Running 335 LLM Embeddings Explained: A Visual and Intuitive Guide π 335 How Language Models Turn Text into Meaning, From Traditional
view article Article Training Large Language Models with Interpreter Feedback using WebAssembly Apr 3, 2025 β’ 14
Running 3.78k The Ultra-Scale Playbook π 3.78k The ultimate guide to training LLM on large GPU Clusters
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper β’ 2501.07301 β’ Published Jan 13, 2025 β’ 100
Running 595 Scaling test-time compute π 595 Run advanced search strategies to boost LLM problem solving
Snowflake/snowflake-arctic-embed-m Sentence Similarity β’ 0.1B β’ Updated Dec 13, 2024 β’ 546k β’ β’ 164