Generate a 512‑dimensional CLIP embedding for an uploaded image
Generate English search terms from Arabic text