Stanford University

university

Verified

https://www.stanford.edu/

AI & ML interests

None defined yet.

Recent Activity

gist-sparse-attention authored a paper 3 days ago

Mem-α: Learning Memory Construction via Reinforcement Learning

gist-sparse-attention authored a paper 3 days ago

IceCache: Memory-efficient KV-cache Management for Long-Sequence LLMs

gist-sparse-attention submitted a paper 4 days ago

IceCache: Memory-efficient KV-cache Management for Long-Sequence LLMs

View all activity

Papers

Zero-shot World Models Are Developmentally Efficient Learners

TRACE: Capability-Targeted Agentic Training

View all Papers

authored a paper 3 days ago

Zero-shot World Models Are Developmentally Efficient Learners

Paper • 2604.10333 • Published 8 days ago • 7

submitted a paper to Daily Papers 4 days ago

TRACE: Capability-Targeted Agentic Training

Paper • 2604.05336 • Published 12 days ago • 13

submitted a paper to Daily Papers 5 days ago

Multi-User Large Language Model Agents

Paper • 2604.08567 • Published about 1 month ago • 26

in StanfordUniversity/AbdCTBench 16 days ago

[bot] Conversion to Parquet

#1 opened 18 days ago by

parquet-converter

updated a dataset 18 days ago

StanfordUniversity/AbdCTBench

Viewer • Updated 18 days ago • 23.5k • 28

published a dataset 27 days ago

StanfordUniversity/AbdCTBench

Viewer • Updated 18 days ago • 23.5k • 28

authored 3 papers about 1 month ago

Dynamics as Prompts: In-Context Learning for Sim-to-Real System Identifications

Paper • 2410.20357 • Published Oct 27, 2024

Creative Robot Tool Use with Large Language Models

Paper • 2310.13065 • Published Oct 19, 2023 • 10

Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment

Paper • 2602.12281 • Published Feb 12 • 1

submitted a paper to Daily Papers about 2 months ago

SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation

Paper • 2602.16863 • Published Feb 18 • 18

submitted a paper to Daily Papers 2 months ago

Large Language Model Reasoning Failures

Paper • 2602.06176 • Published Feb 5 • 12

submitted a paper to Daily Papers 3 months ago

Latent Adversarial Regularization for Offline Preference Optimization

Paper • 2601.22083 • Published Jan 29 • 14

authored a paper 3 months ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published Jan 29 • 27

submitted a paper to Daily Papers 3 months ago

Learning to Discover at Test Time

Paper • 2601.16175 • Published Jan 22 • 44

zhaoyue-zephyrus

submitted a paper to Daily Papers 4 months ago

Spherical Leech Quantization for Visual Tokenization and Generation

Paper • 2512.14697 • Published Dec 16, 2025 • 8

authored a paper 6 months ago

Guided Query Refinement: Multimodal Hybrid Retrieval with Test-Time Optimization

Paper • 2510.05038 • Published Oct 6, 2025 • 1

authored 3 papers 6 months ago

Personalized Preference Fine-tuning of Diffusion Models

Paper • 2501.06655 • Published Jan 11, 2025 • 1

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

Paper • 2502.17387 • Published Feb 24, 2025 • 7

RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems

Paper • 2510.02263 • Published Oct 2, 2025 • 9

authored a paper 7 months ago

World Modeling with Probabilistic Structure Integration

Paper • 2509.09737 • Published Sep 10, 2025 • 14