Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Boyuan Zheng's picture
5 12 14

Boyuan Zheng

boyuanzheng010
EddyLuo's profile picture 21world's profile picture mihai-chindris's profile picture
·
https://boyuanzheng010.github.io/
  • boyuan__zheng
  • boyuanzheng010

AI & ML interests

Language Agents, Multilinguality

Organizations

OSU NLP Group's profile picture Center for Language and Speech Processing @ JHU's profile picture MMMU's profile picture Orby / OSU's profile picture

authored 2 papers over 1 year ago

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Paper • 2411.06559 • Published Nov 10, 2024 • 16

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents

Paper • 2410.05243 • Published Oct 7, 2024 • 20
authored 3 papers about 2 years ago

Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency

Paper • 2305.10713 • Published May 18, 2023

GPT-4V(ision) is a Generalist Web Agent, if Grounded

Paper • 2401.01614 • Published Jan 3, 2024 • 22

Dual-View Visual Contextualization for Web Navigation

Paper • 2402.04476 • Published Feb 6, 2024
authored 3 papers over 2 years ago

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 40

Mind2Web: Towards a Generalist Agent for the Web

Paper • 2306.06070 • Published Jun 9, 2023 • 21

Multilingual Coreference Resolution in Multiparty Dialogue

Paper • 2208.01307 • Published Aug 2, 2022
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs