-
OpenClaw-RL: Train Any Agent Simply by Talking
Paper • 2603.10165 • Published • 151 -
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights
Paper • 2603.12228 • Published • 12 -
Efficient Memory Management for Large Language Model Serving with PagedAttention
Paper • 2309.06180 • Published • 53 -
1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs
Paper • 2410.16144 • Published • 5
Collections
Discover the best community collections!
Collections including paper arxiv:2010.11929
-
Transporter Networks: Rearranging the Visual World for Robotic Manipulation
Paper • 2010.14406 • Published -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 21 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15
-
MIO: A Foundation Model on Multimodal Tokens
Paper • 2409.17692 • Published • 53 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15 -
Going deeper with Image Transformers
Paper • 2103.17239 • Published -
Training data-efficient image transformers & distillation through attention
Paper • 2012.12877 • Published • 2
-
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
Paper • 2602.10693 • Published • 220 -
Reinforced Attention Learning
Paper • 2602.04884 • Published • 29 -
Learning to Reason in 13 Parameters
Paper • 2602.04118 • Published • 6 -
LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters
Paper • 2405.17604 • Published • 3
-
Neural Machine Translation by Jointly Learning to Align and Translate
Paper • 1409.0473 • Published • 7 -
Attention Is All You Need
Paper • 1706.03762 • Published • 120 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 26 -
Hierarchical Reasoning Model
Paper • 2506.21734 • Published • 50
-
MemOS: A Memory OS for AI System
Paper • 2507.03724 • Published • 166 -
How to Train Your LLM Web Agent: A Statistical Diagnosis
Paper • 2507.04103 • Published • 52 -
NeuralOS: Towards Simulating Operating Systems via Neural Generative Models
Paper • 2507.08800 • Published • 81 -
AnyCoder
🏆3.21kGenerate code snippets with AI
-
Multispectral Vineyard Segmentation: A Deep Learning approach
Paper • 2108.01200 • Published -
PLLaMa: An Open-source Large Language Model for Plant Science
Paper • 2401.01600 • Published • 3 -
yusuf802/Leaf-Disease-Predictor
Image Classification • Updated • 1 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15
-
OpenClaw-RL: Train Any Agent Simply by Talking
Paper • 2603.10165 • Published • 151 -
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights
Paper • 2603.12228 • Published • 12 -
Efficient Memory Management for Large Language Model Serving with PagedAttention
Paper • 2309.06180 • Published • 53 -
1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs
Paper • 2410.16144 • Published • 5
-
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
Paper • 2602.10693 • Published • 220 -
Reinforced Attention Learning
Paper • 2602.04884 • Published • 29 -
Learning to Reason in 13 Parameters
Paper • 2602.04118 • Published • 6 -
LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters
Paper • 2405.17604 • Published • 3
-
Neural Machine Translation by Jointly Learning to Align and Translate
Paper • 1409.0473 • Published • 7 -
Attention Is All You Need
Paper • 1706.03762 • Published • 120 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 26 -
Hierarchical Reasoning Model
Paper • 2506.21734 • Published • 50
-
Transporter Networks: Rearranging the Visual World for Robotic Manipulation
Paper • 2010.14406 • Published -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 21 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15
-
MemOS: A Memory OS for AI System
Paper • 2507.03724 • Published • 166 -
How to Train Your LLM Web Agent: A Statistical Diagnosis
Paper • 2507.04103 • Published • 52 -
NeuralOS: Towards Simulating Operating Systems via Neural Generative Models
Paper • 2507.08800 • Published • 81 -
AnyCoder
🏆3.21kGenerate code snippets with AI
-
Multispectral Vineyard Segmentation: A Deep Learning approach
Paper • 2108.01200 • Published -
PLLaMa: An Open-source Large Language Model for Plant Science
Paper • 2401.01600 • Published • 3 -
yusuf802/Leaf-Disease-Predictor
Image Classification • Updated • 1 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15
-
MIO: A Foundation Model on Multimodal Tokens
Paper • 2409.17692 • Published • 53 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15 -
Going deeper with Image Transformers
Paper • 2103.17239 • Published -
Training data-efficient image transformers & distillation through attention
Paper • 2012.12877 • Published • 2