Collections
Discover the best community collections!
Collections including paper arxiv:2603.18743
-
FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use
Paper • 2603.08262 • Published • 42 -
On-Policy Context Distillation for Language Models
Paper • 2602.12275 • Published • 3 -
Online Experiential Learning for Language Models
Paper • 2603.16856 • Published • 58 -
Mixture-of-Depths Attention
Paper • 2603.15619 • Published • 80
-
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning
Paper • 2602.12099 • Published • 61 -
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning
Paper • 2602.10560 • Published • 31 -
G-LNS: Generative Large Neighborhood Search for LLM-Based Automatic Heuristic Design
Paper • 2602.08253 • Published • 26 -
ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression
Paper • 2602.11008 • Published • 18
-
Self-Supervised Prompt Optimization
Paper • 2502.06855 • Published • 18 -
Context Learning for Multi-Agent Discussion
Paper • 2602.02350 • Published • 4 -
XSkill: Continual Learning from Experience and Skills in Multimodal Agents
Paper • 2603.12056 • Published • 33 -
Online Experiential Learning for Language Models
Paper • 2603.16856 • Published • 58
-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 107 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 80 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 43 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 45
-
Hyperagents
Paper • 2603.19461 • Published • 50 -
Internal Safety Collapse in Frontier Large Language Models
Paper • 2603.23509 • Published • 31 -
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory
Paper • 2504.19413 • Published • 52 -
Memento-Skills: Let Agents Design Agents
Paper • 2603.18743 • Published • 57
-
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 152 -
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data
Paper • 2603.15594 • Published • 149 -
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
Paper • 2603.13398 • Published • 153 -
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
Paper • 2603.06569 • Published • 119
-
XSkill: Continual Learning from Experience and Skills in Multimodal Agents
Paper • 2603.12056 • Published • 33 -
Memento-Skills: Let Agents Design Agents
Paper • 2603.18743 • Published • 57 -
SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?
Paper • 2603.15401 • Published • 19 -
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills
Paper • 2603.25158 • Published • 50
-
More Agents Is All You Need
Paper • 2402.05120 • Published • 57 -
UFO: A UI-Focused Agent for Windows OS Interaction
Paper • 2402.07939 • Published • 17 -
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Paper • 2406.04151 • Published • 24 -
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents
Paper • 2407.04363 • Published • 34
-
Hyperagents
Paper • 2603.19461 • Published • 50 -
Internal Safety Collapse in Frontier Large Language Models
Paper • 2603.23509 • Published • 31 -
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory
Paper • 2504.19413 • Published • 52 -
Memento-Skills: Let Agents Design Agents
Paper • 2603.18743 • Published • 57
-
FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use
Paper • 2603.08262 • Published • 42 -
On-Policy Context Distillation for Language Models
Paper • 2602.12275 • Published • 3 -
Online Experiential Learning for Language Models
Paper • 2603.16856 • Published • 58 -
Mixture-of-Depths Attention
Paper • 2603.15619 • Published • 80
-
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 152 -
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data
Paper • 2603.15594 • Published • 149 -
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
Paper • 2603.13398 • Published • 153 -
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
Paper • 2603.06569 • Published • 119
-
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning
Paper • 2602.12099 • Published • 61 -
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning
Paper • 2602.10560 • Published • 31 -
G-LNS: Generative Large Neighborhood Search for LLM-Based Automatic Heuristic Design
Paper • 2602.08253 • Published • 26 -
ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression
Paper • 2602.11008 • Published • 18
-
XSkill: Continual Learning from Experience and Skills in Multimodal Agents
Paper • 2603.12056 • Published • 33 -
Memento-Skills: Let Agents Design Agents
Paper • 2603.18743 • Published • 57 -
SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?
Paper • 2603.15401 • Published • 19 -
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills
Paper • 2603.25158 • Published • 50
-
Self-Supervised Prompt Optimization
Paper • 2502.06855 • Published • 18 -
Context Learning for Multi-Agent Discussion
Paper • 2602.02350 • Published • 4 -
XSkill: Continual Learning from Experience and Skills in Multimodal Agents
Paper • 2603.12056 • Published • 33 -
Online Experiential Learning for Language Models
Paper • 2603.16856 • Published • 58
-
More Agents Is All You Need
Paper • 2402.05120 • Published • 57 -
UFO: A UI-Focused Agent for Windows OS Interaction
Paper • 2402.07939 • Published • 17 -
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Paper • 2406.04151 • Published • 24 -
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents
Paper • 2407.04363 • Published • 34
-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 107 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 80 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 43 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 45