Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2604.27660

about 18 hours ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 69
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9, 2025 • 38
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 195
SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20, 2025 • 100

about 19 hours ago

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

Paper • 2604.18564 • Published 17 days ago • 45
From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 4 days ago • 136
MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published 3 days ago • 198

about 19 hours ago

Natural-Language Agent Harnesses

Paper • 2603.25723 • Published Mar 26 • 25
From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

Paper • 2604.09459 • Published 24 days ago • 13
From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 4 days ago • 136

Self Supervision

about 10 hours ago

Self-Supervised Prompt Optimization

Paper • 2502.06855 • Published Feb 7, 2025 • 18
Context Learning for Multi-Agent Discussion

Paper • 2602.02350 • Published Feb 2 • 4
XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published Mar 12 • 33
Online Experiential Learning for Language Models

Paper • 2603.16856 • Published Mar 17 • 59

about 20 hours ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 4 days ago • 136

about 5 hours ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155
TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published Mar 27 • 143
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156
LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 145

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published Mar 12 • 33
Memento-Skills: Let Agents Design Agents

Paper • 2603.18743 • Published Mar 19 • 58
SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?

Paper • 2603.15401 • Published Mar 16 • 19
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published Mar 26 • 52

about 8 hours ago

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8, 2025 • 403 • 99
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 39
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30, 2025 • 97
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

about 18 hours ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 69
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9, 2025 • 38
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20, 2025 • 195
SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20, 2025 • 100

about 20 hours ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 4 days ago • 136

about 19 hours ago

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

Paper • 2604.18564 • Published 17 days ago • 45
From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 4 days ago • 136
MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published 3 days ago • 198

about 5 hours ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155
TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published Mar 27 • 143
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156
LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 145

about 19 hours ago

Natural-Language Agent Harnesses

Paper • 2603.25723 • Published Mar 26 • 25
From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

Paper • 2604.09459 • Published 24 days ago • 13
From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 4 days ago • 136

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published Mar 12 • 33
Memento-Skills: Let Agents Design Agents

Paper • 2603.18743 • Published Mar 19 • 58
SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?

Paper • 2603.15401 • Published Mar 16 • 19
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Paper • 2603.25158 • Published Mar 26 • 52

Self Supervision

about 10 hours ago

Self-Supervised Prompt Optimization

Paper • 2502.06855 • Published Feb 7, 2025 • 18
Context Learning for Multi-Agent Discussion

Paper • 2602.02350 • Published Feb 2 • 4
XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published Mar 12 • 33
Online Experiential Learning for Language Models

Paper • 2603.16856 • Published Mar 17 • 59

about 8 hours ago

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8, 2025 • 403 • 99
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 39
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30, 2025 • 97
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs