-
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Paper • 2506.04180 • Published • 34 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19 -
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
Paper • 2507.15245 • Published • 11
Collections
Discover the best community collections!
Collections including paper arxiv:2604.06392
-
ExGRPO: Learning to Reason from Experience
Paper • 2510.02245 • Published • 83 -
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning
Paper • 2510.01132 • Published • 6 -
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 131 -
MixReasoning: Switching Modes to Think
Paper • 2510.06052 • Published • 23
-
unsloth/NVIDIA-Nemotron-3-Nano-4B-GGUF
Text Generation • 4B • Updated • 21.7k • 54 -
On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models
Paper • 2603.27481 • Published • 35 -
Emergent Social Intelligence Risks in Generative Multi-Agent Systems
Paper • 2603.27771 • Published • 52 -
Qualixar OS: A Universal Operating System for AI Agent Orchestration
Paper • 2604.06392 • Published • 16
-
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Paper • 2510.03222 • Published • 76 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 110 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 513 -
Multi-Agent Tool-Integrated Policy Optimization
Paper • 2510.04678 • Published • 31
-
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Paper • 2506.04180 • Published • 34 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19 -
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
Paper • 2507.15245 • Published • 11
-
unsloth/NVIDIA-Nemotron-3-Nano-4B-GGUF
Text Generation • 4B • Updated • 21.7k • 54 -
On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models
Paper • 2603.27481 • Published • 35 -
Emergent Social Intelligence Risks in Generative Multi-Agent Systems
Paper • 2603.27771 • Published • 52 -
Qualixar OS: A Universal Operating System for AI Agent Orchestration
Paper • 2604.06392 • Published • 16
-
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Paper • 2510.03222 • Published • 76 -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
Paper • 2510.05592 • Published • 110 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 513 -
Multi-Agent Tool-Integrated Policy Optimization
Paper • 2510.04678 • Published • 31
-
ExGRPO: Learning to Reason from Experience
Paper • 2510.02245 • Published • 83 -
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning
Paper • 2510.01132 • Published • 6 -
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 131 -
MixReasoning: Switching Modes to Think
Paper • 2510.06052 • Published • 23