Papers
arxiv:1808.09132

Mapping Natural Language Commands to Web Elements

Published on Aug 28, 2018
Authors:
,
,
,
,

Abstract

A dataset and baseline models for grounding language commands in web page elements are proposed, capturing functional, relational, and visual reasoning.

AI-generated summary

The web provides a rich, open-domain environment with textual, structural, and spatial properties. We propose a new task for grounding language in this environment: given a natural language command (e.g., "click on the second article"), choose the correct element on the web page (e.g., a hyperlink or text box). We collected a dataset of over 50,000 commands that capture various phenomena such as functional references (e.g. "find who made this site"), relational reasoning (e.g. "article by john"), and visual reasoning (e.g. "top-most article"). We also implemented and analyzed three baseline models that capture different phenomena present in the dataset.

Community

Sign up or log in to comment

Get this paper in your agent:

hf papers read 1808.09132
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/1808.09132 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/1808.09132 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/1808.09132 in a Space README.md to link it from this page.

Collections including this paper 1