Jingqing Ruan's picture

Jingqing Ruan

Amanda2023

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting

upvoted a collection 9 days ago

authored a paper 29 days ago

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

View all activity

Organizations

None yet

upvoted a paper 1 day ago

SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting

Paper • 2604.10688 • Published 4 days ago • 6

upvoted a collection 9 days ago

PaTaRM

PaTaRM is a Generative Reward Model (GRM) for RLHF alignment. • 4 items • Updated 14 days ago • 1

upvoted 2 papers 29 days ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published about 1 month ago • 185

TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

Paper • 2603.16448 • Published 30 days ago • 58

upvoted a paper 4 months ago

Step-DeepResearch Technical Report

Paper • 2512.20491 • Published Dec 23, 2025 • 87

upvoted a paper 11 months ago

When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning

Paper • 2505.15400 • Published May 21, 2025 • 23