allthingsdisaggregated's picture

allthingsdisaggregated

lastweek

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 23 days ago

Attention Residuals

upvoted a paper 24 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

upvoted a paper 24 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

View all activity

Organizations

None yet

upvoted a paper 23 days ago

Attention Residuals

Paper • 2603.15031 • Published 29 days ago • 179

upvoted 3 papers 24 days ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published 27 days ago • 138

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 25 days ago • 66

Memento-Skills: Let Agents Design Agents

Paper • 2603.18743 • Published 26 days ago • 56

upvoted 3 papers about 1 month ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519

Dynamic Long Context Reasoning over Compressed Memory via End-to-End Reinforcement Learning

Paper • 2602.08382 • Published Feb 9 • 11

The Trinity of Consistency as a Defining Principle for General World Models

Paper • 2602.23152 • Published Feb 26 • 201

upvoted a collection 4 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 168

upvoted a paper 7 months ago

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22, 2025 • 153

upvoted a paper 8 months ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4, 2025 • 138

upvoted 10 papers 10 months ago

Inference-Time Hyper-Scaling with KV Cache Compression

Paper • 2506.05345 • Published Jun 5, 2025 • 30

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published Jan 7, 2025 • 82

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16, 2025 • 170

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19, 2025 • 217

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13, 2025 • 172

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 128

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 447

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6, 2025 • 96

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10, 2025 • 139

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8, 2025 • 187