Collections
Discover the best community collections!
Collections including paper arxiv:2601.16725
-
LongCat-Flash-Thinking-2601 Technical Report
Paper • 2601.16725 • Published • 180 -
QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining
Paper • 2602.07085 • Published • 190 -
A Very Big Video Reasoning Suite
Paper • 2602.20159 • Published • 519 -
AI Can Learn Scientific Taste
Paper • 2603.14473 • Published • 424
-
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation
Paper • 2601.15369 • Published • 21 -
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model
Paper • 2601.15892 • Published • 53 -
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
Paper • 2601.16208 • Published • 55 -
NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
Paper • 2601.11004 • Published • 30
-
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
Paper • 2601.10527 • Published • 26 -
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution
Paper • 2601.10657 • Published • 20 -
TranslateGemma Technical Report
Paper • 2601.09012 • Published • 20 -
Recursive Language Models
Paper • 2512.24601 • Published • 94
-
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 33 -
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper • 2510.19363 • Published • 63 -
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
Paper • 2510.25992 • Published • 48 -
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Paper • 2511.07384 • Published • 19
-
LongCat-Flash-Thinking-2601 Technical Report
Paper • 2601.16725 • Published • 180 -
TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training
Paper • 2603.01714 • Published -
SIGHT: Reinforcement Learning with Self-Evidence and Information-Gain Diverse Branching for Search Agent
Paper • 2602.11551 • Published -
Can Tool-Integrated Reinforcement Learning Generalize Across Diverse Domains?
Paper • 2510.11184 • Published • 1
-
LongCat-Flash-Thinking-2601 Technical Report
Paper • 2601.16725 • Published • 180 -
DeepSeek-OCR 2: Visual Causal Flow
Paper • 2601.20552 • Published • 68 -
Linear representations in language models can change dramatically over a conversation
Paper • 2601.20834 • Published • 21 -
BMAM: Brain-inspired Multi-Agent Memory Framework
Paper • 2601.20465 • Published • 5
-
Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models
Paper • 2601.08955 • Published • 13 -
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines
Paper • 2601.09465 • Published • 42 -
MAXS: Meta-Adaptive Exploration with LLM Agents
Paper • 2601.09259 • Published • 96 -
Toward Efficient Agents: Memory, Tool learning, and Planning
Paper • 2601.14192 • Published • 57
-
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 97 -
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 245 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 222 -
Sharp Monocular View Synthesis in Less Than a Second
Paper • 2512.10685 • Published • 29
-
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Paper • 2510.02209 • Published • 57 -
MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial Trading
Paper • 2509.05080 • Published -
TradingGroup: A Multi-Agent Trading System with Self-Reflection and Data-Synthesis
Paper • 2508.17565 • Published • 1 -
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning
Paper • 2508.20467 • Published
-
LongCat-Flash-Thinking-2601 Technical Report
Paper • 2601.16725 • Published • 180 -
TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training
Paper • 2603.01714 • Published -
SIGHT: Reinforcement Learning with Self-Evidence and Information-Gain Diverse Branching for Search Agent
Paper • 2602.11551 • Published -
Can Tool-Integrated Reinforcement Learning Generalize Across Diverse Domains?
Paper • 2510.11184 • Published • 1
-
LongCat-Flash-Thinking-2601 Technical Report
Paper • 2601.16725 • Published • 180 -
QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining
Paper • 2602.07085 • Published • 190 -
A Very Big Video Reasoning Suite
Paper • 2602.20159 • Published • 519 -
AI Can Learn Scientific Taste
Paper • 2603.14473 • Published • 424
-
LongCat-Flash-Thinking-2601 Technical Report
Paper • 2601.16725 • Published • 180 -
DeepSeek-OCR 2: Visual Causal Flow
Paper • 2601.20552 • Published • 68 -
Linear representations in language models can change dramatically over a conversation
Paper • 2601.20834 • Published • 21 -
BMAM: Brain-inspired Multi-Agent Memory Framework
Paper • 2601.20465 • Published • 5
-
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation
Paper • 2601.15369 • Published • 21 -
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model
Paper • 2601.15892 • Published • 53 -
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
Paper • 2601.16208 • Published • 55 -
NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
Paper • 2601.11004 • Published • 30
-
Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models
Paper • 2601.08955 • Published • 13 -
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines
Paper • 2601.09465 • Published • 42 -
MAXS: Meta-Adaptive Exploration with LLM Agents
Paper • 2601.09259 • Published • 96 -
Toward Efficient Agents: Memory, Tool learning, and Planning
Paper • 2601.14192 • Published • 57
-
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
Paper • 2601.10527 • Published • 26 -
PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution
Paper • 2601.10657 • Published • 20 -
TranslateGemma Technical Report
Paper • 2601.09012 • Published • 20 -
Recursive Language Models
Paper • 2512.24601 • Published • 94
-
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 97 -
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 245 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 222 -
Sharp Monocular View Synthesis in Less Than a Second
Paper • 2512.10685 • Published • 29
-
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 33 -
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper • 2510.19363 • Published • 63 -
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
Paper • 2510.25992 • Published • 48 -
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Paper • 2511.07384 • Published • 19
-
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Paper • 2510.02209 • Published • 57 -
MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial Trading
Paper • 2509.05080 • Published -
TradingGroup: A Multi-Agent Trading System with Self-Reflection and Data-Synthesis
Paper • 2508.17565 • Published • 1 -
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning
Paper • 2508.20467 • Published