-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 69 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 38 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
Collections
Discover the best community collections!
Collections including paper arxiv:2604.24026
-
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills
Paper • 2604.24026 • Published • 18 -
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company
Paper • 2604.22446 • Published • 120 -
The Last Harness You'll Ever Build
Paper • 2604.21003 • Published • 3
-
SemShareKV: Efficient KVCache Sharing for Semantically Similar Prompts via Token-Level LSH Matching
Paper • 2509.24832 • Published -
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
Paper • 2605.00658 • Published • 79 -
Map2World: Segment Map Conditioned Text to 3D World Generation
Paper • 2605.00781 • Published • 24 -
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills
Paper • 2604.24026 • Published • 18
-
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 40 -
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models
Paper • 2310.08491 • Published • 57 -
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding
Paper • 2411.04282 • Published • 37 -
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Paper • 2411.14432 • Published • 25
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 69 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 38 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
-
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills
Paper • 2604.24026 • Published • 18 -
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company
Paper • 2604.22446 • Published • 120 -
The Last Harness You'll Ever Build
Paper • 2604.21003 • Published • 3
-
SemShareKV: Efficient KVCache Sharing for Semantically Similar Prompts via Token-Level LSH Matching
Paper • 2509.24832 • Published -
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
Paper • 2605.00658 • Published • 79 -
Map2World: Segment Map Conditioned Text to 3D World Generation
Paper • 2605.00781 • Published • 24 -
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills
Paper • 2604.24026 • Published • 18
-
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 40 -
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models
Paper • 2310.08491 • Published • 57 -
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding
Paper • 2411.04282 • Published • 37 -
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Paper • 2411.14432 • Published • 25