to read
updated
GenEx: Generating an Explorable World
Paper
• 2412.09624
• Published • 98
Image-to-Video
• Updated • 130
• 610
Track4Gen: Teaching Video Diffusion Models to Track Points Improves
Video Generation
Paper
• 2412.06016
• Published • 20
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper
• 2412.09871
• Published • 108
Paper
• 2412.15115
• Published • 377
Alibaba-NLP/gte-multilingual-mlm-base
Fill-Mask
• 0.3B • Updated • 1.26k
• 16
answerdotai/ModernBERT-large
Fill-Mask
• Updated • 216k
• 465
Parallelized Autoregressive Visual Generation
Paper
• 2412.15119
• Published • 53
Taming Multimodal Joint Training for High-Quality Video-to-Audio
Synthesis
Paper
• 2412.15322
• Published • 20
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers
Up
Paper
• 2412.16112
• Published • 23
The GAN is dead; long live the GAN! A Modern GAN Baseline
Paper
• 2501.05441
• Published • 98
Fill-Mask
• 2B • Updated • 1.14k
• 67
"Principal Components" Enable A New Language of Images
Paper
• 2503.08685
• Published • 12
Causal-Copilot: An Autonomous Causal Analysis Agent
Paper
• 2504.13263
• Published • 7
Paper2Code: Automating Code Generation from Scientific Papers in Machine
Learning
Paper
• 2504.17192
• Published • 124
Vid2World: Crafting Video Diffusion Models to Interactive World Models
Paper
• 2505.14357
• Published • 27
PixNerd: Pixel Neural Field Diffusion
Paper
• 2507.23268
• Published • 52