LuffyTheFox/Qwen3.5-9B-Claude-4.6-Opus-Uncensored-Distilled-GGUF Text Generation • 9B • Updated Mar 18 • 80.2k • 101
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-FP8 Image-Text-to-Text • 13B • Updated Nov 13, 2025 • 659k • 50
ibm-granite/granite-docling-258M Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 63.8k • 1.15k
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper • 2507.05687 • Published Jul 8, 2025 • 31
nvidia/OpenCodeReasoning-Nemotron-32B-IOI Text Generation • 33B • Updated May 7, 2025 • 81 • • 26
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 Text Generation • Updated Oct 15, 2025 • 6.9k • • 345
PocketDoc/Dans-PersonalityEngine-V1.2.0-24b Text Generation • Updated May 23, 2025 • 66 • • 176
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published Feb 10, 2025 • 153
meta-llama/Llama-3.2-90B-Vision-Instruct Image-Text-to-Text • 89B • Updated Mar 4, 2025 • 12.2k • 355
mistral-experimental/pixtral-12b Image-Text-to-Text • 13B • Updated Jan 27, 2025 • 162k • 103
Qwen/Qwen2.5-Coder-32B-Instruct Text Generation • 33B • Updated Jan 12, 2025 • 1.16M • • 2.01k