Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated about 12 hours ago • 123
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 11 days ago • 822
Gemma Scope Release Collection A comprehensive, open suite of sparse autoencoders for Gemma 2 2B and 9B. • 10 items • Updated Mar 12 • 22
Learn2Fold: Structured Origami Generation with World Model Planning Paper • 2603.29585 • Published Feb 2 • 16
view article Article Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents 12 days ago • 34
view article Article Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI 26 days ago • 62
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published Dec 4, 2025 • 177
Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models Paper • 2603.18002 • Published 25 days ago • 13
Scaling Synthetic Data Creation with 1,000,000,000 Personas Paper • 2406.20094 • Published Jun 28, 2024 • 107
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published 22 days ago • 77
Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 6 days ago • 47
Granite Speech Models Collection Multilingual ASR and speech-to-text (STT) models for enterprise transcription and translation. • 6 items • Updated 11 days ago • 24
DiariZen Collection DiariZen is a speaker diarization toolkit driven by AudioZen and Pyannote 3.1. • 6 items • Updated Dec 9, 2025 • 3
view article Article Getting More from Your Test-Time Compute Budget with Portfolio Beam Search Feb 24 • 8