microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 329k • 1.58k
Generate a virtual try‑on image of a person wearing a garment
Upgraded to v1.0!
Scalable and Versatile 3D Generation from images
Audio Conditioned LipSync with Latent Diffusion Models
Explore 2024 AI model release timeline