Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wenqi Zhang's picture
23 65 24

Wenqi Zhang

zwq2018
meng-03's profile picture SteveSHEN's profile picture Leoyfan's profile picture
·
  • zwq2018

AI & ML interests

LLM, Multimodal, Robotics

Recent Activity

upvoted a paper about 16 hours ago
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents
upvoted a paper 4 days ago
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver
upvoted a paper 4 days ago
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents
View all activity

Organizations

Zhejiang University's profile picture Language Technology Lab at Alibaba DAMO Academy's profile picture Zhejiang University's profile picture embodied_group's profile picture

authored a paper about 1 year ago

Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks

Paper • 2503.21696 • Published Mar 27, 2025 • 23
authored a paper over 1 year ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1, 2025 • 110
authored 3 papers almost 2 years ago

Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow

Paper • 2306.07209 • Published Jun 12, 2023 • 2

Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives

Paper • 2401.02009 • Published Jan 4, 2024 • 1

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9, 2024 • 47
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs