Collections
Discover the best community collections!
Collections including paper arxiv:2506.01844
-
ViViT: A Video Vision Transformer
Paper • 2103.15691 • Published • 4 -
DINO-Foresight: Looking into the Future with DINO
Paper • 2412.11673 • Published • 1 -
Scaling Behavior Cloning Improves Causal Reasoning: An Open Model for Real-Time Video Game Playing
Paper • 2601.04575 • Published • 12 -
Learning Long-Context Diffusion Policies via Past-Token Prediction
Paper • 2505.09561 • Published
-
Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware
Paper • 2304.13705 • Published • 7 -
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
Paper • 2303.04137 • Published • 6 -
OpenVLA: An Open-Source Vision-Language-Action Model
Paper • 2406.09246 • Published • 47 -
Temporal Difference Learning for Model Predictive Control
Paper • 2203.04955 • Published • 3
-
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper • 2508.06471 • Published • 211 -
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Paper • 2507.01006 • Published • 254 -
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities
Paper • 2507.06261 • Published • 67 -
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment
Paper • 2507.20984 • Published • 58
-
ViViT: A Video Vision Transformer
Paper • 2103.15691 • Published • 4 -
DINO-Foresight: Looking into the Future with DINO
Paper • 2412.11673 • Published • 1 -
Scaling Behavior Cloning Improves Causal Reasoning: An Open Model for Real-Time Video Game Playing
Paper • 2601.04575 • Published • 12 -
Learning Long-Context Diffusion Policies via Past-Token Prediction
Paper • 2505.09561 • Published
-
Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware
Paper • 2304.13705 • Published • 7 -
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
Paper • 2303.04137 • Published • 6 -
OpenVLA: An Open-Source Vision-Language-Action Model
Paper • 2406.09246 • Published • 47 -
Temporal Difference Learning for Model Predictive Control
Paper • 2203.04955 • Published • 3
-
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper • 2508.06471 • Published • 211 -
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Paper • 2507.01006 • Published • 254 -
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities
Paper • 2507.06261 • Published • 67 -
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment
Paper • 2507.20984 • Published • 58