peterlee6706 's Collections WeekDaily
updated
rStar2-Agent: Agentic Reasoning Technical Report
Paper
• 2508.20722
• Published • 118
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper
• 2508.16153
• Published • 162
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains
RLVR
Paper
• 2508.14029
• Published • 119
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility,
Reasoning, and Efficiency
Paper
• 2508.18265
• Published • 217
AWorld: Orchestrating the Training Recipe for Agentic AI
Paper
• 2508.20404
• Published • 38
Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory
and Test-Time Compute Scaling
Paper
• 2508.16745
• Published • 29
Persuasion Dynamics in LLMs: Investigating Robustness and Adaptability
in Knowledge and Safety with DuET-PD
Paper
• 2508.17450
• Published • 9
Jailbreaking Commercial Black-Box LLMs with Explicitly Harmful Prompts
Paper
• 2508.10390
• Published • 1
InMind: Evaluating LLMs in Capturing and Applying Individual Human
Reasoning Styles
Paper
• 2508.16072
• Published • 4