Estimate depth from a single photo
Highlight objects in photos using natural language prompts
Analyze documents to get summary, entities, and key phrases