Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2508.14041

learning_from_papers

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

Paper • 2508.05748 • Published Aug 7, 2025 • 142
Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 140
SpatialLM: Training Large Language Models for Structured Indoor Modeling

Paper • 2506.07491 • Published Jun 9, 2025 • 51
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos

Paper • 2508.14041 • Published Aug 19, 2025 • 59

LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos

Paper • 2508.14041 • Published Aug 19, 2025 • 59
MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors

Paper • 2412.12392 • Published Dec 16, 2024 • 1

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

Paper • 2507.13344 • Published Jul 17, 2025 • 59
π^3: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published Jul 17, 2025 • 67
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second

Paper • 2507.10065 • Published Jul 14, 2025 • 25
CLiFT: Compressive Light-Field Tokens for Compute-Efficient and Adaptive Neural Rendering

Paper • 2507.08776 • Published Jul 11, 2025 • 55

LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos

Paper • 2508.14041 • Published Aug 19, 2025 • 59
RynnEC: Bringing MLLMs into Embodied World

Paper • 2508.14160 • Published Aug 19, 2025 • 20

Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations

Paper • 2508.09789 • Published Aug 13, 2025 • 5
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

Paper • 2508.13186 • Published Aug 14, 2025 • 20
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents

Paper • 2508.04038 • Published Aug 6, 2025 • 1
Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19, 2025 • 48

Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details

Paper • 2506.16504 • Published Jun 19, 2025 • 32
Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Paper • 2506.15442 • Published Jun 18, 2025 • 16
Dens3R: A Foundation Model for 3D Geometry Prediction

Paper • 2507.16290 • Published Jul 22, 2025 • 9
GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors

Paper • 2508.09667 • Published Aug 13, 2025 • 6

learning_from_papers

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

Paper • 2508.05748 • Published Aug 7, 2025 • 142
Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 140
SpatialLM: Training Large Language Models for Structured Indoor Modeling

Paper • 2506.07491 • Published Jun 9, 2025 • 51
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos

Paper • 2508.14041 • Published Aug 19, 2025 • 59

LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos

Paper • 2508.14041 • Published Aug 19, 2025 • 59
RynnEC: Bringing MLLMs into Embodied World

Paper • 2508.14160 • Published Aug 19, 2025 • 20

LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos

Paper • 2508.14041 • Published Aug 19, 2025 • 59
MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors

Paper • 2412.12392 • Published Dec 16, 2024 • 1

Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations

Paper • 2508.09789 • Published Aug 13, 2025 • 5
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents

Paper • 2508.13186 • Published Aug 14, 2025 • 20
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents

Paper • 2508.04038 • Published Aug 6, 2025 • 1
Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19, 2025 • 48

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

Paper • 2507.13344 • Published Jul 17, 2025 • 59
π^3: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published Jul 17, 2025 • 67
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second

Paper • 2507.10065 • Published Jul 14, 2025 • 25
CLiFT: Compressive Light-Field Tokens for Compute-Efficient and Adaptive Neural Rendering

Paper • 2507.08776 • Published Jul 11, 2025 • 55

Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details

Paper • 2506.16504 • Published Jun 19, 2025 • 32
Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Paper • 2506.15442 • Published Jun 18, 2025 • 16
Dens3R: A Foundation Model for 3D Geometry Prediction

Paper • 2507.16290 • Published Jul 22, 2025 • 9
GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors

Paper • 2508.09667 • Published Aug 13, 2025 • 6

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs