Ran Zilberstein

RanZilberstein-Nvidia

AI & ML interests

None yet

Recent Activity

published an article 29 days ago

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

liked a model 2 months ago

nvidia/DeepSeek-R1-NVFP4

liked a model 9 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8

View all activity

Organizations

published an article 29 days ago

Article

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

29 days ago

•

liked a model 2 months ago

nvidia/DeepSeek-R1-NVFP4

Text Generation • 397B • Updated Jun 6, 2025 • 4.4k • 277

liked 2 models 9 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8

Text Generation • 50B • Updated Oct 15, 2025 • 28.9k • 27

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5

Text Generation • 50B • Updated Oct 15, 2025 • 184k • 231

liked a model about 1 year ago

nvidia/Llama-3_1-Nemotron-Ultra-253B-v1

Text Generation • Updated Oct 15, 2025 • 6.89k • • 345

authored a paper about 1 year ago

FFN Fusion: Rethinking Sequential Computation in Large Language Models

Paper • 2503.18908 • Published Mar 24, 2025 • 19

liked 2 models about 1 year ago

nvidia/Llama-3.1-Nemotron-Nano-8B-v1

Text Generation • 8B • Updated Oct 15, 2025 • 57.6k • • 222

nvidia/Llama-3_3-Nemotron-Super-49B-v1

Text Generation • 50B • Updated Oct 15, 2025 • 33.8k • 322

authored a paper over 1 year ago

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Paper • 2411.19146 • Published Nov 28, 2024 • 20

liked a model over 1 year ago

nvidia/Llama-3_1-Nemotron-51B-Instruct

Text Generation • Updated Jul 6, 2025 • 3.66k • 210

updated a Space over 1 year ago

README

👀

Ran Zilberstein

AI & ML interests

Recent Activity

Organizations

RanZilberstein-Nvidia's activity

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

README