-
Attention Is All You Need
Paper • 1706.03762 • Published • 120 -
Scaling Laws for Neural Language Models
Paper • 2001.08361 • Published • 10 -
Training Compute-Optimal Large Language Models
Paper • 2203.15556 • Published • 11 -
Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT
Paper • 2210.04186 • Published
Collections
Discover the best community collections!
Collections including paper arxiv:2601.06943
-
PR Puppet Sora
👁661Generate AI videos from text prompts
-
An Atlas of Color-selected Quiescent Galaxies at z>3 in Public JWST Fields
Paper • 2302.10936 • Published -
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
Paper • 2601.10387 • Published • 15 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 322
-
Gemini Robotics: Bringing AI into the Physical World
Paper • 2503.20020 • Published • 31 -
Magma: A Foundation Model for Multimodal AI Agents
Paper • 2502.13130 • Published • 58 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 51 -
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Paper • 2410.23218 • Published • 49
-
Attention Is All You Need
Paper • 1706.03762 • Published • 120 -
Scaling Laws for Neural Language Models
Paper • 2001.08361 • Published • 10 -
Training Compute-Optimal Large Language Models
Paper • 2203.15556 • Published • 11 -
Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT
Paper • 2210.04186 • Published
-
Gemini Robotics: Bringing AI into the Physical World
Paper • 2503.20020 • Published • 31 -
Magma: A Foundation Model for Multimodal AI Agents
Paper • 2502.13130 • Published • 58 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 51 -
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Paper • 2410.23218 • Published • 49
-
PR Puppet Sora
👁661Generate AI videos from text prompts
-
An Atlas of Color-selected Quiescent Galaxies at z>3 in Public JWST Fields
Paper • 2302.10936 • Published -
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
Paper • 2601.10387 • Published • 15 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 322