Ex0bit/Gemma4-26B-A4B-PRISM-PRO-DQ-GGUF Image-Text-to-Text • 25B • Updated 4 days ago • 2.62k • 54
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 11 days ago • 146
Running on Zero Featured 17 Qwen3 VL Video Grounding 🥠17 Text-guided object tracking, point tracking, reasoning.
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Paper • 2403.15377 • Published Mar 22, 2024 • 29
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 330k • 1.58k