Reading List - a nitishpandey04 Collection

nitishpandey04 's Collections

Classic Reinforcement Learning

Distributed Inference

Reading List

updated Sep 29, 2025

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87
Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108
BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16, 2025 • 85
FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published Jan 16, 2025 • 29
Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models

Paper • 2412.14058 • Published Dec 18, 2024 • 1
π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 31
Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

Paper • 2502.16707 • Published Feb 23, 2025 • 14
OpenVLA: An Open-Source Vision-Language-Action Model

Paper • 2406.09246 • Published Jun 13, 2024 • 47
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

Paper • 2307.15818 • Published Jul 28, 2023 • 32
A Dual Process VLA: Efficient Robotic Manipulation Leveraging VLM

Paper • 2410.15549 • Published Oct 21, 2024
Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Paper • 2310.08864 • Published Oct 13, 2023 • 2
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 63
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 158
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Paper • 1910.01108 • Published Oct 2, 2019 • 22
Block Pruning For Faster Transformers

Paper • 2109.04838 • Published Sep 10, 2021 • 2
The case for 4-bit precision: k-bit Inference Scaling Laws

Paper • 2212.09720 • Published Dec 19, 2022 • 3
Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 25
Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 20
Scaling Vision Transformers to 22 Billion Parameters

Paper • 2302.05442 • Published Feb 10, 2023 • 2
Robust Speech Recognition via Large-Scale Weak Supervision

Paper • 2212.04356 • Published Dec 6, 2022 • 53
Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 98
Neural Architecture Search with Reinforcement Learning

Paper • 1611.01578 • Published Nov 5, 2016 • 2
Regularized Evolution for Image Classifier Architecture Search

Paper • 1802.01548 • Published Feb 5, 2018 • 2
High-Resolution Image Synthesis with Latent Diffusion Models

Paper • 2112.10752 • Published Dec 20, 2021 • 17
Denoising Diffusion Probabilistic Models

Paper • 2006.11239 • Published Jun 19, 2020 • 9
Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 17
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

Paper • 2112.10741 • Published Dec 20, 2021 • 4
Diffusion Models Beat GANs on Image Synthesis

Paper • 2105.05233 • Published May 11, 2021 • 2
BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 107
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 191
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 238
A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 193
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Paper • 2508.21148 • Published Aug 28, 2025 • 142
Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 199