40 76 40

Qinghong (Kevin) Lin

KevinQHLin

http://qhlin.me/

AI & ML interests

Vision-Language Model, Video Understanding, Agent

Recent Activity

upvoted a paper 1 day ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

authored a paper 2 days ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

liked a dataset 3 days ago

markov-ai/computer-use-large

View all activity

Organizations

upvoted a paper 1 day ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published 9 days ago • 106

upvoted a paper 4 days ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Paper • 2604.07413 • Published 9 days ago • 91

upvoted a paper 7 days ago

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

Paper • 2604.08516 • Published 8 days ago • 41

upvoted a paper 8 days ago

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published 10 days ago • 63

upvoted a paper 15 days ago

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published 17 days ago • 95

upvoted a paper 22 days ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published 23 days ago • 96

upvoted a collection 2 months ago

OpenScholar_V1

Collection

The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 7 items • Updated Mar 2 • 53

upvoted 4 papers 2 months ago

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published Feb 9 • 43

upvoted 2 papers 3 months ago

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

Paper • 2601.03928 • Published Jan 7 • 16

ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands

Paper • 2512.24965 • Published Dec 31, 2025 • 43

upvoted 5 papers 4 months ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published Dec 18, 2025 • 222

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published Dec 15, 2025 • 65

EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models

Paper • 2512.14666 • Published Dec 16, 2025 • 10

OmniPSD: Layered PSD Generation with Diffusion Transformer

Paper • 2512.09247 • Published Dec 10, 2025 • 51

PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing

Paper • 2512.02589 • Published Dec 2, 2025 • 73

upvoted 2 papers 5 months ago

The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

Paper • 2511.20256 • Published Nov 25, 2025 • 28

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published Nov 19, 2025 • 54

Qinghong (Kevin) Lin

AI & ML interests

Recent Activity

Organizations

KevinQHLin's activity