FlowInOne:Unifying Multimodal Generation as Image-in, Image-out Flow Matching
Paper • 2604.06757 • Published • 10
None defined yet.
Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance
FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection