Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2601.16175

QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading

Paper • 2509.09995 • Published Sep 12, 2025 • 16
Learning to Discover at Test Time

Paper • 2601.16175 • Published Jan 22 • 44
Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

Paper • 2601.21358 • Published Jan 29 • 7
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning

Paper • 2508.20467 • Published Aug 28, 2025

Learning to Discover at Test Time

Paper • 2601.16175 • Published Jan 22 • 44

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published Jan 22 • 92
LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 86
Qwen3-TTS Technical Report

Paper • 2601.15621 • Published Jan 22 • 74
Learning to Discover at Test Time

Paper • 2601.16175 • Published Jan 22 • 44

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3, 2025 • 76
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 110
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513
Multi-Agent Tool-Integrated Policy Optimization

Paper • 2510.04678 • Published Oct 6, 2025 • 31

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Paper • 2510.02209 • Published Oct 2, 2025 • 57
MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial Trading

Paper • 2509.05080 • Published Sep 5, 2025
TradingGroup: A Multi-Agent Trading System with Self-Reflection and Data-Synthesis

Paper • 2508.17565 • Published Aug 25, 2025 • 1
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning

Paper • 2508.20467 • Published Aug 28, 2025

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

Paper • 2602.12036 • Published Feb 12 • 93
Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published Dec 18, 2025 • 42
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Paper • 2512.23705 • Published Dec 29, 2025 • 45
Schoenfeld's Anatomy of Mathematical Reasoning by Language Models

Paper • 2512.19995 • Published Dec 23, 2025 • 16

Continual Learning

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published Jan 5 • 113
Learning to Discover at Test Time

Paper • 2601.16175 • Published Jan 22 • 44

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published Dec 2, 2025 • 55
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 148
Video Reasoning without Training

Paper • 2510.17045 • Published Oct 19, 2025 • 8
Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 277

Optimization in some sense

CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling

Paper • 2510.04204 • Published Oct 5, 2025 • 21
Visual Diffusion Models are Geometric Solvers

Paper • 2510.21697 • Published Oct 24, 2025 • 20
Optimize Any Topology: A Foundation Model for Shape- and Resolution-Free Structural Topology Optimization

Paper • 2510.23667 • Published Oct 26, 2025 • 3
Differentiable Evolutionary Reinforcement Learning

Paper • 2512.13399 • Published Dec 15, 2025 • 22

NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

Paper • 2505.16938 • Published May 22, 2025 • 121
AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning

Paper • 2510.06261 • Published Oct 5, 2025 • 6
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513
AlphaResearch: Accelerating New Algorithm Discovery with Language Models

Paper • 2511.08522 • Published Nov 11, 2025 • 18

QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading

Paper • 2509.09995 • Published Sep 12, 2025 • 16
Learning to Discover at Test Time

Paper • 2601.16175 • Published Jan 22 • 44
Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

Paper • 2601.21358 • Published Jan 29 • 7
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning

Paper • 2508.20467 • Published Aug 28, 2025

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

Paper • 2602.12036 • Published Feb 12 • 93
Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published Dec 18, 2025 • 42
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Paper • 2512.23705 • Published Dec 29, 2025 • 45
Schoenfeld's Anatomy of Mathematical Reasoning by Language Models

Paper • 2512.19995 • Published Dec 23, 2025 • 16

Learning to Discover at Test Time

Paper • 2601.16175 • Published Jan 22 • 44

Continual Learning

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published Jan 5 • 113
Learning to Discover at Test Time

Paper • 2601.16175 • Published Jan 22 • 44

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published Jan 22 • 92
LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 86
Qwen3-TTS Technical Report

Paper • 2601.15621 • Published Jan 22 • 74
Learning to Discover at Test Time

Paper • 2601.16175 • Published Jan 22 • 44

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published Dec 2, 2025 • 55
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 148
Video Reasoning without Training

Paper • 2510.17045 • Published Oct 19, 2025 • 8
Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 277

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3, 2025 • 76
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 110
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513
Multi-Agent Tool-Integrated Policy Optimization

Paper • 2510.04678 • Published Oct 6, 2025 • 31

Optimization in some sense

CALM Before the STORM: Unlocking Native Reasoning for Optimization Modeling

Paper • 2510.04204 • Published Oct 5, 2025 • 21
Visual Diffusion Models are Geometric Solvers

Paper • 2510.21697 • Published Oct 24, 2025 • 20
Optimize Any Topology: A Foundation Model for Shape- and Resolution-Free Structural Topology Optimization

Paper • 2510.23667 • Published Oct 26, 2025 • 3
Differentiable Evolutionary Reinforcement Learning

Paper • 2512.13399 • Published Dec 15, 2025 • 22

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Paper • 2510.02209 • Published Oct 2, 2025 • 57
MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial Trading

Paper • 2509.05080 • Published Sep 5, 2025
TradingGroup: A Multi-Agent Trading System with Self-Reflection and Data-Synthesis

Paper • 2508.17565 • Published Aug 25, 2025 • 1
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning

Paper • 2508.20467 • Published Aug 28, 2025

NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

Paper • 2505.16938 • Published May 22, 2025 • 121
AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning

Paper • 2510.06261 • Published Oct 5, 2025 • 6
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513
AlphaResearch: Accelerating New Algorithm Discovery with Language Models

Paper • 2511.08522 • Published Nov 11, 2025 • 18

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs