207 103

Ougrid Dumdang

Ougrid-D

ougrid

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

liked a Space 4 days ago

HuggingFaceTB/trl-distillation-trainer

liked a model 9 days ago

zai-org/GLM-5.1

View all activity

Organizations

upvoted a paper 4 days ago

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

Paper • 2604.06916 • Published 9 days ago • 34

upvoted a paper 12 days ago

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

Paper • 2604.01007 • Published 15 days ago • 31

upvoted an article 14 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

15 days ago

•

853

upvoted 2 papers 21 days ago

Voxtral TTS

Paper • 2603.25551 • Published 22 days ago • 59

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published 22 days ago • 131

upvoted a paper 22 days ago

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published 24 days ago • 35

upvoted 4 papers about 1 month ago

Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

Paper • 2603.03143 • Published Mar 3 • 145

CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video

Paper • 2603.04291 • Published Mar 4 • 13

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 194

Qwen3-Coder-Next Technical Report

Paper • 2603.00729 • Published Feb 28 • 64

upvoted 2 papers about 2 months ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published Feb 27 • 98

Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

Paper • 2602.08847 • Published Feb 9 • 29

upvoted a paper 2 months ago

When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning

Paper • 2602.08236 • Published Feb 9 • 9

upvoted an article 2 months ago

Article

Forge: Scalable Agent RL Framework and Algorithm

Feb 13

•

147

upvoted 6 papers 2 months ago

DINO-SAE: DINO Spherical Autoencoder for High-Fidelity Image Reconstruction and Generation

Paper • 2601.22904 • Published Jan 30 • 15

FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents

Paper • 2602.01566 • Published Feb 2 • 52

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published Jan 29 • 42