-
QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading
Paper • 2509.09995 • Published • 16 -
Learning to Discover at Test Time
Paper • 2601.16175 • Published • 44 -
Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization
Paper • 2601.21358 • Published • 7 -
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning
Paper • 2508.20467 • Published
Collections
Discover the best community collections!
Collections including paper arxiv:2601.16175
-
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Paper • 2601.15876 • Published • 92 -
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
Learning to Discover at Test Time
Paper • 2601.16175 • Published • 44
-
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Paper • 2510.03222 • Published • 76 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 110 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 513 -
Multi-Agent Tool-Integrated Policy Optimization
Paper • 2510.04678 • Published • 31
-
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Paper • 2510.02209 • Published • 57 -
MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial Trading
Paper • 2509.05080 • Published -
TradingGroup: A Multi-Agent Trading System with Self-Reflection and Data-Synthesis
Paper • 2508.17565 • Published • 1 -
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning
Paper • 2508.20467 • Published
-
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
Paper • 2602.12036 • Published • 93 -
Reinforcement Learning for Self-Improving Agent with Skill Library
Paper • 2512.17102 • Published • 42 -
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation
Paper • 2512.23705 • Published • 45 -
Schoenfeld's Anatomy of Mathematical Reasoning by Language Models
Paper • 2512.19995 • Published • 16
-
Guided Self-Evolving LLMs with Minimal Human Supervision
Paper • 2512.02472 • Published • 55 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 148 -
Video Reasoning without Training
Paper • 2510.17045 • Published • 8 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 277
-
CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling
Paper • 2510.04204 • Published • 21 -
Visual Diffusion Models are Geometric Solvers
Paper • 2510.21697 • Published • 20 -
Optimize Any Topology: A Foundation Model for Shape- and Resolution-Free Structural Topology Optimization
Paper • 2510.23667 • Published • 3 -
Differentiable Evolutionary Reinforcement Learning
Paper • 2512.13399 • Published • 22
-
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification
Paper • 2505.16938 • Published • 121 -
AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning
Paper • 2510.06261 • Published • 6 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 513 -
AlphaResearch: Accelerating New Algorithm Discovery with Language Models
Paper • 2511.08522 • Published • 18
-
QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading
Paper • 2509.09995 • Published • 16 -
Learning to Discover at Test Time
Paper • 2601.16175 • Published • 44 -
Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization
Paper • 2601.21358 • Published • 7 -
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning
Paper • 2508.20467 • Published
-
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
Paper • 2602.12036 • Published • 93 -
Reinforcement Learning for Self-Improving Agent with Skill Library
Paper • 2512.17102 • Published • 42 -
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation
Paper • 2512.23705 • Published • 45 -
Schoenfeld's Anatomy of Mathematical Reasoning by Language Models
Paper • 2512.19995 • Published • 16
-
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Paper • 2601.15876 • Published • 92 -
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper • 2601.16206 • Published • 86 -
Qwen3-TTS Technical Report
Paper • 2601.15621 • Published • 74 -
Learning to Discover at Test Time
Paper • 2601.16175 • Published • 44
-
Guided Self-Evolving LLMs with Minimal Human Supervision
Paper • 2512.02472 • Published • 55 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 148 -
Video Reasoning without Training
Paper • 2510.17045 • Published • 8 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 277
-
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Paper • 2510.03222 • Published • 76 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 110 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 513 -
Multi-Agent Tool-Integrated Policy Optimization
Paper • 2510.04678 • Published • 31
-
CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling
Paper • 2510.04204 • Published • 21 -
Visual Diffusion Models are Geometric Solvers
Paper • 2510.21697 • Published • 20 -
Optimize Any Topology: A Foundation Model for Shape- and Resolution-Free Structural Topology Optimization
Paper • 2510.23667 • Published • 3 -
Differentiable Evolutionary Reinforcement Learning
Paper • 2512.13399 • Published • 22
-
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Paper • 2510.02209 • Published • 57 -
MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial Trading
Paper • 2509.05080 • Published -
TradingGroup: A Multi-Agent Trading System with Self-Reflection and Data-Synthesis
Paper • 2508.17565 • Published • 1 -
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning
Paper • 2508.20467 • Published
-
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification
Paper • 2505.16938 • Published • 121 -
AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning
Paper • 2510.06261 • Published • 6 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 513 -
AlphaResearch: Accelerating New Algorithm Discovery with Language Models
Paper • 2511.08522 • Published • 18