-
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Paper • 2506.04180 • Published • 34 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19 -
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
Paper • 2507.15245 • Published • 11
Collections
Discover the best community collections!
Collections including paper arxiv:2510.24701
-
Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs
Paper • 2509.24107 • Published • 80 -
Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window
Paper • 2510.08276 • Published • 10 -
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking
Paper • 2510.20168 • Published • 28 -
Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
Paper • 2510.17797 • Published • 11
-
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 103 -
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 132 -
Natural-Language Agent Harnesses
Paper • 2603.25723 • Published • 25 -
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery
Paper • 2604.01658 • Published • 54
-
RL makes MLLMs see better than SFT
Paper • 2510.16333 • Published • 49 -
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback
Paper • 2510.16888 • Published • 22 -
Reasoning with Sampling: Your Base Model is Smarter Than You Think
Paper • 2510.14901 • Published • 48 -
Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation
Paper • 2510.21583 • Published • 31
-
AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
Paper • 2505.16944 • Published • 8 -
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research
Paper • 2505.19253 • Published • 34 -
The Era of Agentic Organization: Learning to Organize with Language Models
Paper • 2510.26658 • Published • 29 -
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 103
-
DeepAgent: A General Reasoning Agent with Scalable Toolsets
Paper • 2510.21618 • Published • 103 -
A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
Paper • 2510.23587 • Published • 67 -
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 103 -
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
Paper • 2510.16872 • Published • 112
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 103 -
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-ppo-v0.3
3B • Updated • 956 -
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-grpo-v0.3
3B • Updated • 321 • 1
-
SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents
Paper • 2509.06283 • Published • 17 -
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B
Text Generation • 31B • Updated • 18k • 809 -
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
Paper • 2506.11763 • Published • 74 -
Open Data Synthesis For Deep Research
Paper • 2509.00375 • Published • 72
-
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Paper • 2402.15506 • Published • 18 -
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent
Paper • 2404.03648 • Published • 29 -
Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts
Paper • 2405.19893 • Published • 34 -
Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
Paper • 2405.19888 • Published • 7
-
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Paper • 2506.04180 • Published • 34 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19 -
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
Paper • 2507.15245 • Published • 11
-
DeepAgent: A General Reasoning Agent with Scalable Toolsets
Paper • 2510.21618 • Published • 103 -
A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
Paper • 2510.23587 • Published • 67 -
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 103 -
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
Paper • 2510.16872 • Published • 112
-
Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval and Synthesis for SLMs
Paper • 2509.24107 • Published • 80 -
Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window
Paper • 2510.08276 • Published • 10 -
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking
Paper • 2510.20168 • Published • 28 -
Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
Paper • 2510.17797 • Published • 11
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 103 -
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-ppo-v0.3
3B • Updated • 956 -
PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-grpo-v0.3
3B • Updated • 321 • 1
-
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 103 -
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 132 -
Natural-Language Agent Harnesses
Paper • 2603.25723 • Published • 25 -
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery
Paper • 2604.01658 • Published • 54
-
RL makes MLLMs see better than SFT
Paper • 2510.16333 • Published • 49 -
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback
Paper • 2510.16888 • Published • 22 -
Reasoning with Sampling: Your Base Model is Smarter Than You Think
Paper • 2510.14901 • Published • 48 -
Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation
Paper • 2510.21583 • Published • 31
-
SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents
Paper • 2509.06283 • Published • 17 -
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B
Text Generation • 31B • Updated • 18k • 809 -
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
Paper • 2506.11763 • Published • 74 -
Open Data Synthesis For Deep Research
Paper • 2509.00375 • Published • 72
-
AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
Paper • 2505.16944 • Published • 8 -
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research
Paper • 2505.19253 • Published • 34 -
The Era of Agentic Organization: Learning to Organize with Language Models
Paper • 2510.26658 • Published • 29 -
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 103
-
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Paper • 2402.15506 • Published • 18 -
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent
Paper • 2404.03648 • Published • 29 -
Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts
Paper • 2405.19893 • Published • 34 -
Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
Paper • 2405.19888 • Published • 7