-
Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations
Paper • 2508.09789 • Published • 5 -
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
Paper • 2508.13186 • Published • 20 -
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents
Paper • 2508.04038 • Published • 1 -
Prompt Orchestration Markup Language
Paper • 2508.13948 • Published • 48
Saini
shankars
AI & ML interests
None yet
Recent Activity
upvoted an article 15 days ago
Build a Domain-Specific Embedding Model in Under a Day updated a collection about 1 month ago
AI-paper updated a collection about 2 months ago
AI-paperOrganizations
None yet