view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 16 days ago • 857
SebastianBodza/Kartoffel_Orpheus-3B_german_natural-v0.1 Text-to-Speech • 3B • Updated May 17, 2025 • 370 • 17
unsloth/Mistral-Small-3.2-24B-Instruct-2506-GGUF Image-Text-to-Text • 24B • Updated Aug 26, 2025 • 38.7k • 166
Running on CPU Upgrade Agents Featured 1.31k Open ASR Leaderboard 🏆 1.31k Explore speech recognition model benchmarks and rankings
DiT: Self-supervised Pre-training for Document Image Transformer Paper • 2203.02378 • Published Mar 4, 2022 • 3
Decision Transformer: Reinforcement Learning via Sequence Modeling Paper • 2106.01345 • Published Jun 2, 2021 • 3
Offline Reinforcement Learning as One Big Sequence Modeling Problem Paper • 2106.02039 • Published Jun 3, 2021 • 2
DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 191