minghao's picture

minghao

Liam-Liu

·

liam-liu-1b262631a

AI & ML interests

LLM, AD

Recent Activity

authored a paper 10 days ago

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

authored a paper 10 days ago

Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

upvoted a paper 10 days ago

Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

View all activity

Organizations

upvoted 2 papers 10 days ago

Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

Paper • 2604.04418 • Published 11 days ago • 1

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

Paper • 2603.27064 • Published 20 days ago • 26

upvoted a paper 21 days ago

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published Jan 9 • 59

upvoted a paper 23 days ago

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Paper • 2512.12730 • Published Dec 14, 2025 • 52

upvoted 2 papers about 2 months ago

EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies

Paper • 2602.09514 • Published Feb 10 • 11

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

Paper • 2602.22675 • Published Feb 26 • 23

upvoted a paper 3 months ago

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents

Paper • 2406.13923 • Published Jun 20, 2024 • 25

upvoted a paper 5 months ago

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Paper • 2510.14901 • Published Oct 16, 2025 • 48

upvoted 3 papers 6 months ago

ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems

Paper • 2510.11652 • Published Oct 13, 2025 • 30

COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes

Paper • 2510.14763 • Published Oct 16, 2025 • 14

SimKO: Simple Pass@K Policy Optimization

Paper • 2510.14807 • Published Oct 16, 2025 • 11

upvoted 4 papers 7 months ago

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

Paper • 2509.26346 • Published Sep 30, 2025 • 19

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 148

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Paper • 2509.04292 • Published Sep 4, 2025 • 58

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7, 2025 • 151

upvoted 5 papers 8 months ago

O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing

Paper • 2509.01596 • Published Sep 1, 2025 • 4

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1, 2025 • 81

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 127

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 84

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 238