OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper โข 2604.11804 โข Published 3 days ago โข 66
Running on Zero MCP 679 Wan2.2 14B Fast Preview ๐ 679 generate a video from an image with a text prompt
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome Paper โข 2603.28407 โข Published 17 days ago โข 68
DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing Paper โข 2603.28713 โข Published 17 days ago โข 19