VFig Image2SVG Demo
VFig converts any diagram image into editable SVG code.
Separate audio into stems using HT-Demucs and Spleeter
Restore and enhance faces in photos
State-of-the-art music analysis with multi-scale datasets
Next-Gen High-Resolution 3D Model Generation
Universal Image Editing is worth a single LoRA
Extraction & Reconstruction for Efficient Speech Separation
Separate sounds from audio mixtures using text prompts
Relight photos with AI using custom lighting prompts
Run 3D human pose estimation with images
Transcribe audio files into text
Generate speech from text using a reference audio
Demo of Normalized Attention Guidance for 4 steps Wan2.1
Transcribe audio to MIDI
Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR