7 48 53

Manan Shah

cs-mshah

https://cs-mshah.github.io/

AI & ML interests

Computer Vision

Recent Activity

liked a Space 1 day ago

HuggingFaceTB/trl-distillation-trainer

upvoted an article 15 days ago

2. Attention Optimizations: From Standard Attention to FlashAttention

upvoted an article 15 days ago

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

View all activity

Organizations

liked a Space 1 day ago

Distilling 100B+ Models 40x Faster with TRL

📝

TRL distillation for 100B+ teachers, 40x faster

liked a dataset 20 days ago

allenai/MolmoWeb-SyntheticTrajs

Viewer • Updated 6 days ago • 108k • 1.56k • 9

liked a model 22 days ago

nvidia/nvpanoptix-3d-v1.1-matterport3d

Updated 24 days ago • 131 • 18

liked a dataset about 1 month ago

bones-studio/seed

Updated 14 days ago • 7.26k • 101

liked a dataset about 2 months ago

flashinfer-ai/mlsys26-contest

Updated 10 days ago • 517 • 11

liked a model about 2 months ago

nvidia/Qwen3.5-397B-A17B-NVFP4

Text Generation • Updated 17 days ago • 482k • 89

liked 2 datasets about 2 months ago

tencent/HY3D-Bench

Updated 7 days ago • 113k • 86

cindyxl/ObjaversePlusPlus

Viewer • Updated Dec 4, 2025 • 789k • 204 • 16

liked a Space 3 months ago

Ai Paper Finder

👀

Find AI research papers with a simple query

liked 2 models 3 months ago

nvidia/personaplex-7b-v1

Audio-to-Audio • Updated Mar 2 • 496k • 2.44k

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 175k • 1.29k

liked a Space 3 months ago

The Smol Training Playbook

📚

3.11k

The secrets to building world-class LLMs

liked 2 datasets 3 months ago

facebook/action100m-preview

Viewer • Updated Jan 29 • 120k • 1.14k • 140

genrobot2025/10Kh-RealOmin-OpenData

Updated less than a minute ago • 69.7k • 198

liked a dataset 4 months ago

Daniellesry/TransPhy3D

Preview • Updated Dec 31, 2025 • 6.16k • 76

liked 2 Spaces 4 months ago

Evaluation Guidebook

📝

302

Explore LLM benchmark trends over time

The Ultra-Scale Playbook

🌌

3.79k

The ultimate guide to training LLM on large GPU Clusters

liked 3 models 4 months ago