DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published 16 days ago • 347
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 20 days ago • 123
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 11 days ago • 822
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated about 9 hours ago • 122
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 23 days ago • 330
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 13 days ago • 47
TimesFM Release Collection TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. • 7 items • Updated Mar 12 • 44
EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation Paper • 2603.18739 • Published 24 days ago • 11