nitishpandey04 's Collections Reading List
updated
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning
Paper
• 2504.07128
• Published • 87
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper
• 2412.09871
• Published • 108
BitNet b1.58 2B4T Technical Report
Paper
• 2504.12285
• Published • 85
FAST: Efficient Action Tokenization for Vision-Language-Action Models
Paper
• 2501.09747
• Published • 29
Towards Generalist Robot Policies: What Matters in Building
Vision-Language-Action Models
Paper
• 2412.14058
• Published • 1
π_0: A Vision-Language-Action Flow Model for General Robot Control
Paper
• 2410.24164
• Published • 31
Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon
Robotic Manipulation
Paper
• 2502.16707
• Published • 14
OpenVLA: An Open-Source Vision-Language-Action Model
Paper
• 2406.09246
• Published • 47
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic
Control
Paper
• 2307.15818
• Published • 32
A Dual Process VLA: Efficient Robotic Manipulation Leveraging VLM
Paper
• 2410.15549
• Published
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Paper
• 2310.08864
• Published • 2
Scaling LLM Test-Time Compute Optimally can be More Effective than
Scaling Model Parameters
Paper
• 2408.03314
• Published • 63
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient
Robotics
Paper
• 2506.01844
• Published • 158
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and
lighter
Paper
• 1910.01108
• Published • 22
Block Pruning For Faster Transformers
Paper
• 2109.04838
• Published • 2
The case for 4-bit precision: k-bit Inference Scaling Laws
Paper
• 2212.09720
• Published • 3
Matryoshka Representation Learning
Paper
• 2205.13147
• Published • 25
Language Models are Few-Shot Learners
Paper
• 2005.14165
• Published • 20
Scaling Vision Transformers to 22 Billion Parameters
Paper
• 2302.05442
• Published • 2
Robust Speech Recognition via Large-Scale Weak Supervision
Paper
• 2212.04356
• Published • 53
Emu3: Next-Token Prediction is All You Need
Paper
• 2409.18869
• Published • 98
Neural Architecture Search with Reinforcement Learning
Paper
• 1611.01578
• Published • 2
Regularized Evolution for Image Classifier Architecture Search
Paper
• 1802.01548
• Published • 2
High-Resolution Image Synthesis with Latent Diffusion Models
Paper
• 2112.10752
• Published • 17
Denoising Diffusion Probabilistic Models
Paper
• 2006.11239
• Published • 9
Scalable Diffusion Models with Transformers
Paper
• 2212.09748
• Published • 17
GLIDE: Towards Photorealistic Image Generation and Editing with
Text-Guided Diffusion Models
Paper
• 2112.10741
• Published • 4
Diffusion Models Beat GANs on Image Synthesis
Paper
• 2105.05233
• Published • 2
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper
• 2310.11453
• Published • 107
On the Generalization of SFT: A Reinforcement Learning Perspective with
Reward Rectification
Paper
• 2508.05629
• Published • 191
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper
• 2509.02547
• Published • 238
A Survey of Reinforcement Learning for Large Reasoning Models
Paper
• 2509.08827
• Published • 193
A Survey of Scientific Large Language Models: From Data Foundations to
Agent Frontiers
Paper
• 2508.21148
• Published • 142
Why Language Models Hallucinate
Paper
• 2509.04664
• Published • 199