Haolin Liu's picture

19

Haolin Liu

lhl616

AI & ML interests

None yet

Recent Activity

upvoted a paper about 24 hours ago

Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization

upvoted a paper 2 months ago

Training Data Efficiency in Multimodal Process Reward Models

upvoted a paper 2 months ago

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

View all activity

Organizations

None yet

upvoted a paper about 24 hours ago

Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization

Paper • 2604.09574 • Published Feb 24 • 27

upvoted 2 papers 2 months ago

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published Feb 4 • 79

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published Feb 3 • 27

upvoted 2 papers 3 months ago

RelayLLM: Efficient Reasoning via Collaborative Decoding

Paper • 2601.05167 • Published Jan 8 • 31

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Paper • 2601.03986 • Published Jan 7 • 34

upvoted 2 papers 4 months ago

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

Paper • 2512.15687 • Published Dec 17, 2025 • 22

MotionEdit: Benchmarking and Learning Motion-Centric Image Editing

Paper • 2512.10284 • Published Dec 11, 2025 • 26

updated a model 5 months ago

lhl616/Qwen3-8B-axon-error-aware-128-8-ratio

8B • Updated Nov 29, 2025 • 4

published a model 5 months ago

lhl616/Qwen3-8B-axon-error-aware-128-8-ratio

8B • Updated Nov 29, 2025 • 4

updated a model 5 months ago

lhl616/Qwen3-8B-axon-error-aware-128-8-mixed

8B • Updated Nov 29, 2025 • 2

published a model 5 months ago

lhl616/Qwen3-8B-axon-error-aware-128-8-mixed

8B • Updated Nov 29, 2025 • 2

updated a model 5 months ago

lhl616/Qwen3-8B-Base-axon-ppo

8B • Updated Nov 29, 2025 • 1

published a model 5 months ago

lhl616/Qwen3-8B-Base-axon-ppo

8B • Updated Nov 29, 2025 • 1

updated a model 5 months ago

lhl616/Qwen3-8B-Base-axon-grpo-step-128-8

8B • Updated Nov 29, 2025 • 2

published a model 5 months ago

lhl616/Qwen3-8B-Base-axon-grpo-step-128-8

8B • Updated Nov 29, 2025 • 2

updated a model 5 months ago

lhl616/Qwen3-8B-Base-axon-error-aware-128-8-ratio-new

8B • Updated Nov 29, 2025 • 3

published a model 5 months ago

lhl616/Qwen3-8B-Base-axon-error-aware-128-8-ratio-new

8B • Updated Nov 29, 2025 • 3

updated a model 5 months ago

lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-passk

8B • Updated Nov 29, 2025 • 2

published a model 5 months ago

lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-passk

8B • Updated Nov 29, 2025 • 2

updated a model 5 months ago

lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-nstd-0.5-0.8-step-2

8B • Updated Nov 29, 2025 • 2