Nima Nooshiri's picture

Open to Collab

Nima Nooshiri

nimanzik

·

AI & ML interests

None yet

Recent Activity

updated a collection about 18 hours ago

Hugging Face Playbooks & Guidebooks

liked a Space about 18 hours ago

HuggingFaceTB/trl-distillation-trainer

upvoted an article about 18 hours ago

Multimodal Embedding & Reranker Models with Sentence Transformers

View all activity

Organizations

updated a collection about 18 hours ago

Hugging Face Playbooks & Guidebooks

5 items • Updated about 18 hours ago

liked a Space about 18 hours ago

Distilling 100B+ Models 40x Faster with TRL

TRL distillation for 100B+ teachers, 40x faster

upvoted an article about 18 hours ago

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

5 days ago

•

38

upvoted 2 articles 6 days ago

Article

AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality

Jan 21

•

32

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

Mar 10

•

124

upvoted an article 9 days ago

Article

Liberate your OpenClaw

+6

18 days ago

•

42

updated a collection 22 days ago

Hugging Face Playbooks & Guidebooks

5 items • Updated about 18 hours ago

liked 3 Spaces 22 days ago

The Synthetic Data Playbook: Generating Trillions of the Finest Tokens

Explore synthetic data experiments on a virtual bookshelf

The Smol Training Playbook

The secrets to building world-class LLMs

Evaluation Guidebook

Explore LLM benchmark trends over time

upvoted an article about 1 month ago

Article

Ulysses Sequence Parallelism: Training with Million-Token Contexts

Mar 9

•

26

upvoted 3 articles about 2 months ago

Article

Mixture of Experts Explained

+4

Dec 11, 2023

•

1.11k

Article

Mixture of Experts (MoEs) in Transformers

+5

Feb 26

•

153

Article

Train AI models with Unsloth and Hugging Face Jobs for FREE

+4

Feb 20

•

96

liked a Space about 2 months ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

Who needs 1T parameters? Olympiad proofs with a 4B model

upvoted a paper about 2 months ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16, 2025 • 124

upvoted an article 3 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

+3

May 24, 2023

•

176