Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2605.27365

Eagle is a family of frontier vision-language models with data-centric strategies. The model supports both HD image and long-context video input.

about 11 hours ago

nvidia/Eagle2-1B

Image-Text-to-Text • 1B • Updated Apr 27, 2025 • 1.6k • 29
nvidia/Eagle2-2B

Image-Text-to-Text • 2B • Updated Apr 27, 2025 • 579 • 33
nvidia/Eagle2-9B

Image-Text-to-Text • 9B • Updated Jan 28, 2025 • 189 • 63
Build error

Agents

15

Eagle2.5 VL

💬

15

Chat with Eagle2-VL to generate text based on text and images

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 1 day ago • 83

about 6 hours ago

Fast-SAM3D: 3Dfy Anything in Images but Faster

Paper • 2602.05293 • Published Feb 5 • 2
Stroke of Surprise: Progressive Semantic Illusions in Vector Sketching

Paper • 2602.12280 • Published Feb 12 • 34
CADEvolve: Creating Realistic CAD via Program Evolution

Paper • 2602.16317 • Published Feb 18 • 30
SketchDynamics: Exploring Free-Form Sketches for Dynamic Intent Expression in Animation Generation

Paper • 2601.20622 • Published Jan 28 • 2

about 2 hours ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 61
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17, 2024 • 53
To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 45
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 64

interesting architecture

about 7 hours ago

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 29
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 91
Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31, 2025 • 25
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published Feb 13, 2025 • 9

about 5 hours ago

Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

Paper • 2605.15182 • Published 14 days ago • 39
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?

Paper • 2605.06527 • Published 21 days ago • 44
Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis

Paper • 2605.14392 • Published 14 days ago • 8
World Action Models: The Next Frontier in Embodied AI

Paper • 2605.12090 • Published 16 days ago • 66

about 8 hours ago

MiCo: Multi-image Contrast for Reinforcement Visual Reasoning

Paper • 2506.22434 • Published Jun 27, 2025 • 10
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17, 2025 • 79
RewardDance: Reward Scaling in Visual Generation

Paper • 2509.08826 • Published Sep 10, 2025 • 73
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21, 2025 • 37

Eagle is a family of frontier vision-language models with data-centric strategies. The model supports both HD image and long-context video input.

about 11 hours ago

nvidia/Eagle2-1B

Image-Text-to-Text • 1B • Updated Apr 27, 2025 • 1.6k • 29
nvidia/Eagle2-2B

Image-Text-to-Text • 2B • Updated Apr 27, 2025 • 579 • 33
nvidia/Eagle2-9B

Image-Text-to-Text • 9B • Updated Jan 28, 2025 • 189 • 63
Build error

Agents

15

Eagle2.5 VL

💬

15

Chat with Eagle2-VL to generate text based on text and images

interesting architecture

about 7 hours ago

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 29
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 91
Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31, 2025 • 25
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published Feb 13, 2025 • 9

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 1 day ago • 83

about 5 hours ago

Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

Paper • 2605.15182 • Published 14 days ago • 39
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?

Paper • 2605.06527 • Published 21 days ago • 44
Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis

Paper • 2605.14392 • Published 14 days ago • 8
World Action Models: The Next Frontier in Embodied AI

Paper • 2605.12090 • Published 16 days ago • 66

about 6 hours ago

Fast-SAM3D: 3Dfy Anything in Images but Faster

Paper • 2602.05293 • Published Feb 5 • 2
Stroke of Surprise: Progressive Semantic Illusions in Vector Sketching

Paper • 2602.12280 • Published Feb 12 • 34
CADEvolve: Creating Realistic CAD via Program Evolution

Paper • 2602.16317 • Published Feb 18 • 30
SketchDynamics: Exploring Free-Form Sketches for Dynamic Intent Expression in Animation Generation

Paper • 2601.20622 • Published Jan 28 • 2

about 8 hours ago

MiCo: Multi-image Contrast for Reinforcement Visual Reasoning

Paper • 2506.22434 • Published Jun 27, 2025 • 10
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17, 2025 • 79
RewardDance: Reward Scaling in Visual Generation

Paper • 2509.08826 • Published Sep 10, 2025 • 73
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21, 2025 • 37

about 2 hours ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 61
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17, 2024 • 53
To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 45
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 64

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs