Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2509.25911

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Paper • 2406.04151 • Published Jun 6, 2024 • 24
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper • 2510.16872 • Published Oct 19, 2025 • 112
Scaling Generalist Data-Analytic Agents

Paper • 2509.25084 • Published Sep 29, 2025 • 20
Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

Open Data Synthesis For Deep Research

Paper • 2509.00375 • Published Aug 30, 2025 • 72
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published Sep 3, 2025 • 23
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Paper • 2509.03405 • Published Sep 3, 2025 • 24
SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

Paper • 2509.00930 • Published Aug 31, 2025 • 5

Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs

Paper • 2404.15676 • Published Apr 24, 2024
How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior

Paper • 2404.10198 • Published Apr 16, 2024 • 8
RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15, 2024 • 72
FaaF: Facts as a Function for the evaluation of RAG systems

Paper • 2403.03888 • Published Mar 6, 2024

Mem-α: Learning Memory Construction via Reinforcement Learning

Paper • 2509.25911 • Published Sep 30, 2025 • 15
Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Paper • 2507.05257 • Published Jul 7, 2025 • 15
M+: Extending MemoryLLM with Scalable Long-Term Memory

Paper • 2502.00592 • Published Feb 1, 2025 • 2

Representation & Optimization

Understanding about representation sheds light on optimization

Nuclear Norm Regularization for Deep Learning

Paper • 2405.14544 • Published May 23, 2024 • 1
Token embeddings violate the manifold hypothesis

Paper • 2504.01002 • Published Apr 1, 2025 • 1
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers

Paper • 2403.10476 • Published Mar 15, 2024 • 1
ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning

Paper • 2504.00254 • Published Mar 31, 2025 • 1

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Paper • 2406.04151 • Published Jun 6, 2024 • 24
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper • 2510.16872 • Published Oct 19, 2025 • 112
Scaling Generalist Data-Analytic Agents

Paper • 2509.25084 • Published Sep 29, 2025 • 20
Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

Mem-α: Learning Memory Construction via Reinforcement Learning

Paper • 2509.25911 • Published Sep 30, 2025 • 15
Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Paper • 2507.05257 • Published Jul 7, 2025 • 15
M+: Extending MemoryLLM with Scalable Long-Term Memory

Paper • 2502.00592 • Published Feb 1, 2025 • 2

Open Data Synthesis For Deep Research

Paper • 2509.00375 • Published Aug 30, 2025 • 72
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published Sep 3, 2025 • 23
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Paper • 2509.03405 • Published Sep 3, 2025 • 24
SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

Paper • 2509.00930 • Published Aug 31, 2025 • 5

Representation & Optimization

Understanding about representation sheds light on optimization

Nuclear Norm Regularization for Deep Learning

Paper • 2405.14544 • Published May 23, 2024 • 1
Token embeddings violate the manifold hypothesis

Paper • 2504.01002 • Published Apr 1, 2025 • 1
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers

Paper • 2403.10476 • Published Mar 15, 2024 • 1
ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning

Paper • 2504.00254 • Published Mar 31, 2025 • 1

Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs

Paper • 2404.15676 • Published Apr 24, 2024
How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior

Paper • 2404.10198 • Published Apr 16, 2024 • 8
RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15, 2024 • 72
FaaF: Facts as a Function for the evaluation of RAG systems

Paper • 2403.03888 • Published Mar 6, 2024

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs