Collections
Discover the best community collections!
Collections including paper arxiv:2604.27351
-
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents
Paper • 2604.18543 • Published • 27 -
Near-Future Policy Optimization
Paper • 2604.20733 • Published • 74 -
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
Paper • 2604.20987 • Published • 21 -
PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering
Paper • 2602.23161 • Published
-
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Paper • 2510.03222 • Published • 76 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 111 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 514 -
Multi-Agent Tool-Integrated Policy Optimization
Paper • 2510.04678 • Published • 31
-
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper • 2508.06471 • Published • 211 -
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
Paper • 2508.00414 • Published • 96 -
Continuous Autoregressive Language Models
Paper • 2510.27688 • Published • 74 -
MiMo-Embodied: X-Embodied Foundation Model Technical Report
Paper • 2511.16518 • Published • 26
-
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
Paper • 2605.00658 • Published • 79 -
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling
Paper • 2604.28185 • Published • 86 -
Representation Fréchet Loss for Visual Generation
Paper • 2604.28190 • Published • 28 -
Co-Evolving Policy Distillation
Paper • 2604.27083 • Published • 61
-
ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-7B-v1
Visual Document Retrieval • 8B • Updated • 67 • 17 -
ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-3B-v1
Visual Document Retrieval • 4B • Updated • 204 • 12 -
LiquidAI/LFM2-8B-A1B
Text Generation • 8B • Updated • 114k • 352 -
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 10k • 1.6k
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 276 -
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks
Paper • 2510.08002 • Published • 24 -
Self-Improving LLM Agents at Test-Time
Paper • 2510.07841 • Published • 10 -
The Denario project: Deep knowledge AI agents for scientific discovery
Paper • 2510.26887 • Published • 8
-
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Paper • 2506.04180 • Published • 34 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19 -
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
Paper • 2507.15245 • Published • 11
-
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
Paper • 2605.00658 • Published • 79 -
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling
Paper • 2604.28185 • Published • 86 -
Representation Fréchet Loss for Visual Generation
Paper • 2604.28190 • Published • 28 -
Co-Evolving Policy Distillation
Paper • 2604.27083 • Published • 61
-
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents
Paper • 2604.18543 • Published • 27 -
Near-Future Policy Optimization
Paper • 2604.20733 • Published • 74 -
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
Paper • 2604.20987 • Published • 21 -
PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering
Paper • 2602.23161 • Published
-
ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-7B-v1
Visual Document Retrieval • 8B • Updated • 67 • 17 -
ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-3B-v1
Visual Document Retrieval • 4B • Updated • 204 • 12 -
LiquidAI/LFM2-8B-A1B
Text Generation • 8B • Updated • 114k • 352 -
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 10k • 1.6k
-
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Paper • 2510.03222 • Published • 76 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 111 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 514 -
Multi-Agent Tool-Integrated Policy Optimization
Paper • 2510.04678 • Published • 31
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 276 -
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks
Paper • 2510.08002 • Published • 24 -
Self-Improving LLM Agents at Test-Time
Paper • 2510.07841 • Published • 10 -
The Denario project: Deep knowledge AI agents for scientific discovery
Paper • 2510.26887 • Published • 8
-
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper • 2508.06471 • Published • 211 -
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
Paper • 2508.00414 • Published • 96 -
Continuous Autoregressive Language Models
Paper • 2510.27688 • Published • 74 -
MiMo-Embodied: X-Embodied Foundation Model Technical Report
Paper • 2511.16518 • Published • 26
-
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Paper • 2506.04180 • Published • 34 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19 -
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
Paper • 2507.15245 • Published • 11