Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ali0001010010 's Collections
Potential9D
Realtime Voice Calling stuff
Agentic / LLm stuff

Potential9D

updated about 2 hours ago
Upvote
-

  • Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device

    Paper • 2602.20161 • Published Feb 23 • 23

  • A Very Big Video Reasoning Suite

    Paper • 2602.20159 • Published Feb 23 • 519

  • Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

    Paper • 2603.21986 • Published 25 days ago • 123

  • AURA: Always-On Understanding and Real-Time Assistance via Video Streams

    Paper • 2604.04184 • Published 12 days ago • 50

  • OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

    Paper • 2604.04707 • Published 11 days ago • 200

  • TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

    Paper • 2604.04921 • Published 11 days ago • 107

  • Memory Intelligence Agent

    Paper • 2604.04503 • Published 11 days ago • 57

  • Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

    Paper • 2604.10905 • Published 4 days ago • 26

  • HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

    Paper • 2604.14268 • Published 2 days ago • 48
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs