Gyanateet Dutta's picture

In a Training Loop 🔄

Gyanateet Dutta

Ryukijano

·

https://ryukijano.github.io

AI & ML interests

Computer Vision, Robotics, Generative modelling, AI for Sciences.

Recent Activity

liked a model about 7 hours ago

MiniMaxAI/MiniMax-M2.7

liked a Space 2 days ago

facebook/fairchem_uma_demo

liked a model 2 days ago

View all activity

Organizations

upvoted a collection 3 days ago

SigLino: Vision Foundation Models (SigLIP2 + DINOv3)

Vision encoders distilled from DINOv3 and SigLIP2 (MoE & Dense). CVPR 2026. • 6 items • Updated 2 days ago • 16

upvoted a collection 5 days ago

WildDet3D

This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 7 items • Updated 5 days ago • 13

upvoted a paper 24 days ago

PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling

Paper • 2504.14219 • Published Apr 19, 2025 • 2

upvoted a changelog 25 days ago

Hugging Face Changelog

Hugging Face Papers for AI Agents

25 days ago

• 136

upvoted a collection about 1 month ago

Nemotron-Terminal

We are releasing Nemotron-Terminal models and training datasets. • 5 items • Updated 6 days ago • 34

upvoted 3 papers about 1 month ago

VLASH: Real-Time VLAs via Future-State-Aware Asynchronous Inference

Paper • 2512.01031 • Published Nov 30, 2025 • 26

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published Mar 3 • 185

Unified Latents (UL): How to train your latents

Paper • 2602.17270 • Published Feb 19 • 60

upvoted 2 papers about 2 months ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published Feb 12 • 61

WorldCompass: Reinforcement Learning for Long-Horizon World Models

Paper • 2602.09022 • Published Feb 9 • 21

upvoted a paper 2 months ago

Self-Hinting Language Models Enhance Reinforcement Learning

Paper • 2602.03143 • Published Feb 3 • 31

upvoted 3 papers 3 months ago

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Paper • 2512.23705 • Published Dec 29, 2025 • 45

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 321

Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Paper • 2512.22615 • Published Dec 27, 2025 • 50

upvoted an article 4 months ago

Article

Why You Should Care About Partial Differential Equations (PDEs)

Dec 12, 2025

•

42

upvoted 2 papers 5 months ago

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31, 2025 • 31

π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Paper • 2510.25889 • Published Oct 29, 2025 • 66

upvoted a paper 6 months ago

Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training

Paper • 2510.12586 • Published Oct 14, 2025 • 115

upvoted an article 7 months ago

Article

SAIR: Accelerating Pharma R&D with AI-Powered Structural Intelligence

Sep 2, 2025

•

36

upvoted a paper 7 months ago

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27, 2025 • 32