-
End-to-End Goal-Driven Web Navigation
Paper • 1602.02261 • Published -
Learning Language Games through Interaction
Paper • 1606.02447 • Published -
Naturalizing a Programming Language via Interactive Learning
Paper • 1704.06956 • Published -
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
Paper • 1802.08802 • Published • 2
Collections
Discover the best community collections!
Collections including paper arxiv:2311.07562
-
microsoft/markuplm-base
Updated • 346k • 27 -
microsoft/markuplm-large-finetuned-websrc
Question Answering • Updated • 112 • 10 -
osunlp/Mind2Web
Viewer • Updated • 253 • 5.45k • 124 -
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Paper • 2311.07562 • Published • 15
-
Visual In-Context Prompting
Paper • 2311.13601 • Published • 18 -
Textbooks Are All You Need
Paper • 2306.11644 • Published • 154 -
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
Paper • 2308.08155 • Published • 11 -
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Paper • 2303.02927 • Published • 3
-
Visual In-Context Prompting
Paper • 2311.13601 • Published • 18 -
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
Paper • 2308.08155 • Published • 11 -
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Paper • 2303.02927 • Published • 3 -
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4
Paper • 2311.07361 • Published • 14
-
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Paper • 2311.07562 • Published • 15 -
VACE: All-in-One Video Creation and Editing
Paper • 2503.07598 • Published • 56 -
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Paper • 2502.09509 • Published • 9
-
End-to-End Goal-Driven Web Navigation
Paper • 1602.02261 • Published -
Learning Language Games through Interaction
Paper • 1606.02447 • Published -
Naturalizing a Programming Language via Interactive Learning
Paper • 1704.06956 • Published -
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
Paper • 1802.08802 • Published • 2
-
Visual In-Context Prompting
Paper • 2311.13601 • Published • 18 -
Textbooks Are All You Need
Paper • 2306.11644 • Published • 154 -
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
Paper • 2308.08155 • Published • 11 -
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Paper • 2303.02927 • Published • 3
-
Visual In-Context Prompting
Paper • 2311.13601 • Published • 18 -
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
Paper • 2308.08155 • Published • 11 -
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Paper • 2303.02927 • Published • 3 -
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4
Paper • 2311.07361 • Published • 14
-
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Paper • 2311.07562 • Published • 15 -
VACE: All-in-One Video Creation and Editing
Paper • 2503.07598 • Published • 56 -
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Paper • 2502.09509 • Published • 9
-
microsoft/markuplm-base
Updated • 346k • 27 -
microsoft/markuplm-large-finetuned-websrc
Question Answering • Updated • 112 • 10 -
osunlp/Mind2Web
Viewer • Updated • 253 • 5.45k • 124 -
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation
Paper • 2311.07562 • Published • 15