-
Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation
Paper • 2508.07981 • Published • 63 -
CharacterShot: Controllable and Consistent 4D Character Animation
Paper • 2508.07409 • Published • 39 -
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
Paper • 2508.10881 • Published • 53 -
Puppeteer: Rig and Animate Your 3D Models
Paper • 2508.10898 • Published • 33
Collections
Discover the best community collections!
Collections including paper arxiv:2507.17744
-
Arbitrary-steps Image Super-resolution via Diffusion Inversion
Paper • 2412.09013 • Published • 13 -
Deep Researcher with Test-Time Diffusion
Paper • 2507.16075 • Published • 68 -
nablaNABLA: Neighborhood Adaptive Block-Level Attention
Paper • 2507.13546 • Published • 126 -
Yume: An Interactive World Generation Model
Paper • 2507.17744 • Published • 92
-
AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models
Paper • 2506.19851 • Published • 60 -
SeqTex: Generate Mesh Textures in Video Sequence
Paper • 2507.04285 • Published • 10 -
Yume: An Interactive World Generation Model
Paper • 2507.17744 • Published • 92 -
STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer
Paper • 2508.10893 • Published • 31
-
HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels
Paper • 2507.21809 • Published • 142 -
Yume: An Interactive World Generation Model
Paper • 2507.17744 • Published • 92 -
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models
Paper • 2507.13344 • Published • 59 -
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development
Paper • 2506.05010 • Published • 80
-
Yume: An Interactive World Generation Model
Paper • 2507.17744 • Published • 92 -
SSRL: Self-Search Reinforcement Learning
Paper • 2508.10874 • Published • 97 -
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
Paper • 2506.06941 • Published • 16 -
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Paper • 2506.01939 • Published • 190
-
A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality
Paper • 2507.07202 • Published • 25 -
StreamDiT: Real-Time Streaming Text-to-Video Generation
Paper • 2507.03745 • Published • 32 -
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Paper • 2507.01945 • Published • 76 -
TokensGen: Harnessing Condensed Tokens for Long Video Generation
Paper • 2507.15728 • Published • 8
-
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models
Paper • 2503.10437 • Published • 34 -
Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k
Paper • 2503.09642 • Published • 20 -
VGGT: Visual Geometry Grounded Transformer
Paper • 2503.11651 • Published • 37 -
1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering
Paper • 2503.16422 • Published • 15
-
Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation
Paper • 2508.07981 • Published • 63 -
CharacterShot: Controllable and Consistent 4D Character Animation
Paper • 2508.07409 • Published • 39 -
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
Paper • 2508.10881 • Published • 53 -
Puppeteer: Rig and Animate Your 3D Models
Paper • 2508.10898 • Published • 33
-
HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels
Paper • 2507.21809 • Published • 142 -
Yume: An Interactive World Generation Model
Paper • 2507.17744 • Published • 92 -
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models
Paper • 2507.13344 • Published • 59 -
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development
Paper • 2506.05010 • Published • 80
-
Arbitrary-steps Image Super-resolution via Diffusion Inversion
Paper • 2412.09013 • Published • 13 -
Deep Researcher with Test-Time Diffusion
Paper • 2507.16075 • Published • 68 -
nablaNABLA: Neighborhood Adaptive Block-Level Attention
Paper • 2507.13546 • Published • 126 -
Yume: An Interactive World Generation Model
Paper • 2507.17744 • Published • 92
-
Yume: An Interactive World Generation Model
Paper • 2507.17744 • Published • 92 -
SSRL: Self-Search Reinforcement Learning
Paper • 2508.10874 • Published • 97 -
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
Paper • 2506.06941 • Published • 16 -
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Paper • 2506.01939 • Published • 190
-
A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality
Paper • 2507.07202 • Published • 25 -
StreamDiT: Real-Time Streaming Text-to-Video Generation
Paper • 2507.03745 • Published • 32 -
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Paper • 2507.01945 • Published • 76 -
TokensGen: Harnessing Condensed Tokens for Long Video Generation
Paper • 2507.15728 • Published • 8
-
AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models
Paper • 2506.19851 • Published • 60 -
SeqTex: Generate Mesh Textures in Video Sequence
Paper • 2507.04285 • Published • 10 -
Yume: An Interactive World Generation Model
Paper • 2507.17744 • Published • 92 -
STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer
Paper • 2508.10893 • Published • 31
-
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models
Paper • 2503.10437 • Published • 34 -
Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k
Paper • 2503.09642 • Published • 20 -
VGGT: Visual Geometry Grounded Transformer
Paper • 2503.11651 • Published • 37 -
1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering
Paper • 2503.16422 • Published • 15