Ember

company

AI & ML interests

None defined yet.

Team members 12
private

authored a paper 2 months ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55

submitted a paper to Daily Papers 4 months ago

Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching

Paper • 2512.11130 • Published Dec 11, 2025 • 10

authored a paper 4 months ago

SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL

Paper • 2512.04069 • Published Dec 3, 2025 • 24

authored 7 papers 4 months ago

Understanding 3D Object Articulation in Internet Videos

Paper • 2203.16531 • Published Mar 30, 2022

Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering

Paper • 2409.02426 • Published Sep 4, 2024

Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning

Paper • 2412.07909 • Published Dec 10, 2024

Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing

Paper • 2409.02374 • Published Sep 4, 2024

The Dual Power of Interpretable Token Embeddings: Jailbreaking Attacks and Defenses for Diffusion Model Unlearning

Paper • 2504.21307 • Published Apr 30, 2025

Unfolding Videos Dynamics via Taylor Expansion

Paper • 2409.02371 • Published Sep 4, 2024

SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL

Paper • 2512.04069 • Published Dec 3, 2025 • 24

authored a paper 4 months ago

FLARE: Robot Learning with Implicit World Modeling

Paper • 2505.15659 • Published May 21, 2025

authored a paper 7 months ago

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25, 2025 • 104

authored a paper 11 months ago

DreamGen: Unlocking Generalization in Robot Learning through Neural Trajectories

Paper • 2505.12705 • Published May 19, 2025

zwrq

authored a paper 12 months ago

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22, 2025 • 64

authored a paper 12 months ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21, 2025 • 68

authored a paper 12 months ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21, 2025 • 68

authored 2 papers about 1 year ago

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

Paper • 2503.14734 • Published Mar 18, 2025 • 7

Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework

Paper • 2503.10704 • Published Mar 12, 2025 • 5

authored a paper about 1 year ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6, 2025 • 96

authored a paper about 1 year ago

FB-BEV: BEV Representation from Forward-Backward View Transformations

Paper • 2308.02236 • Published Aug 4, 2023