Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2603.17187

Good agents related space, model, dataset

Good agents related space, model, dataset collection

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11, 2025 • 70.2k • • 1.4k
Running

Agents

31

GLM 4.5V Demo App

🏃

31

Demo App of dmg file
nvidia/Cosmos-Reason1-7B

Image-Text-to-Text • Updated Dec 10, 2025 • 73.6k • 240
Running

MCP

Featured

162

Web Search MCP

🔎

162

Search and extract web content for LLM ingestion

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Paper • 2602.23008 • Published Feb 26 • 37
SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards

Paper • 2602.21158 • Published Feb 24 • 1
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 138
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 74

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 138
Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

Paper • 2604.04759 • Published 14 days ago • 22

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 138

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published Mar 10 • 151
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 138

Read Later 📚

Interesting papers on AI, LLMs, etc. to add to reading list

Monitored Markov Decision Processes

Paper • 2402.06819 • Published Feb 9, 2024
Generalization in Monitored Markov Decision Processes (Mon-MDPs)

Paper • 2505.08988 • Published May 13, 2025
Bayesian Risk Markov Decision Processes

Paper • 2106.02558 • Published Jun 4, 2021
Sotopia-RL: Reward Design for Social Intelligence

Paper • 2508.03905 • Published Aug 5, 2025 • 23

Interesting Papers

ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Paper • 2603.10160 • Published Mar 10 • 26
Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously

Paper • 2603.12262 • Published Mar 12 • 31
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published Mar 13 • 148
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 138

about 1 month ago

FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use

Paper • 2603.08262 • Published Mar 9 • 42
On-Policy Context Distillation for Language Models

Paper • 2602.12275 • Published Feb 12 • 3
Online Experiential Learning for Language Models

Paper • 2603.16856 • Published Mar 17 • 58
Mixture-of-Depths Attention

Paper • 2603.15619 • Published Mar 16 • 80

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 138
Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 180
MOSS-TTS Technical Report

Paper • 2603.18090 • Published Mar 18 • 12
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 48

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published Feb 12 • 61
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published Feb 11 • 31
G-LNS: Generative Large Neighborhood Search for LLM-Based Automatic Heuristic Design

Paper • 2602.08253 • Published Feb 9 • 26
ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression

Paper • 2602.11008 • Published Feb 11 • 18

Good agents related space, model, dataset

Good agents related space, model, dataset collection

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11, 2025 • 70.2k • • 1.4k
Running

Agents

31

GLM 4.5V Demo App

🏃

31

Demo App of dmg file
nvidia/Cosmos-Reason1-7B

Image-Text-to-Text • Updated Dec 10, 2025 • 73.6k • 240
Running

MCP

Featured

162

Web Search MCP

🔎

162

Search and extract web content for LLM ingestion

Read Later 📚

Interesting papers on AI, LLMs, etc. to add to reading list

Monitored Markov Decision Processes

Paper • 2402.06819 • Published Feb 9, 2024
Generalization in Monitored Markov Decision Processes (Mon-MDPs)

Paper • 2505.08988 • Published May 13, 2025
Bayesian Risk Markov Decision Processes

Paper • 2106.02558 • Published Jun 4, 2021
Sotopia-RL: Reward Design for Social Intelligence

Paper • 2508.03905 • Published Aug 5, 2025 • 23

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Paper • 2602.23008 • Published Feb 26 • 37
SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards

Paper • 2602.21158 • Published Feb 24 • 1
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 138
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 74

Interesting Papers

ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Paper • 2603.10160 • Published Mar 10 • 26
Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously

Paper • 2603.12262 • Published Mar 12 • 31
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published Mar 13 • 148
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 138

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 138
Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

Paper • 2604.04759 • Published 14 days ago • 22

about 1 month ago

FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use

Paper • 2603.08262 • Published Mar 9 • 42
On-Policy Context Distillation for Language Models

Paper • 2602.12275 • Published Feb 12 • 3
Online Experiential Learning for Language Models

Paper • 2603.16856 • Published Mar 17 • 58
Mixture-of-Depths Attention

Paper • 2603.15619 • Published Mar 16 • 80

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 138

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 138
Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 180
MOSS-TTS Technical Report

Paper • 2603.18090 • Published Mar 18 • 12
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 48

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published Mar 10 • 151
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 138

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published Feb 12 • 61
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published Feb 11 • 31
G-LNS: Generative Large Neighborhood Search for LLM-Based Automatic Heuristic Design

Paper • 2602.08253 • Published Feb 9 • 26
ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression

Paper • 2602.11008 • Published Feb 11 • 18

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs