Kronos: A Foundation Model for the Language of Financial Markets Paper • 2508.02739 • Published Aug 2, 2025 • 21
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 12 days ago • 107
The Y-Combinator for LLMs: Solving Long-Context Rot with λ-Calculus Paper • 2603.20105 • Published 28 days ago • 37
HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising Paper • 2603.08703 • Published Mar 9 • 32
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence Paper • 2603.07660 • Published Mar 8 • 86
WildActor: Unconstrained Identity-Preserving Video Generation Paper • 2603.00586 • Published Feb 28 • 38
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published Mar 3 • 145
TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward Paper • 2603.07700 • Published Mar 8 • 13
EchoTorrent: Towards Swift, Sustained, and Streaming Multi-Modal Video Generation Paper • 2602.13669 • Published Feb 14 • 2
S2DiT: Sandwich Diffusion Transformer for Mobile Streaming Video Generation Paper • 2601.12719 • Published Jan 19 • 1
DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers Paper • 2602.16968 • Published Feb 19 • 12
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published Mar 2 • 151
Moonshine: Speech Recognition for Live Transcription and Voice Commands Paper • 2410.15608 • Published Oct 21, 2024 • 12
JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation Paper • 2602.19163 • Published Feb 22 • 14