SWE-Dev: Building Software Engineering Agents with Training and Inference Scaling Paper • 2506.07636 • Published Jun 9, 2025 • 3
Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training Paper • 2307.07909 • Published Jul 16, 2023
TreeRL: LLM Reinforcement Learning with On-Policy Tree Search Paper • 2506.11902 • Published Jun 13, 2025
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 211
DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL Paper • 2509.10446 • Published Sep 12, 2025 • 3
AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework Paper • 2510.04206 • Published Oct 5, 2025 • 3
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 211
Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling Paper • 2501.11651 • Published Jan 20, 2025 • 1
SWE-Dev: Building Software Engineering Agents with Training and Inference Scaling Paper • 2506.07636 • Published Jun 9, 2025 • 3
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published Jul 1, 2025 • 254
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline Paper • 2404.02893 • Published Apr 3, 2024 • 21
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools Paper • 2406.12793 • Published Jun 18, 2024 • 34