Juan CM's picture

Juan CM PRO

jucamohedano

·

AI & ML interests

AI Systems MSc at Trento 🚀🤖

Recent Activity

upvoted an article 1 day ago

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

upvoted an article 21 days ago

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

upvoted a changelog 28 days ago

Hugging Face Papers for AI Agents

View all activity

Organizations

upvoted an article 1 day ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18, 2025

•

97

upvoted an article 21 days ago

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Jan 2

•

19

upvoted a changelog 28 days ago

Hugging Face Changelog

Hugging Face Papers for AI Agents

30 days ago

• 137

updated a dataset about 2 months ago

jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_predict

Viewer • Updated Feb 16 • 60 • 10

published a dataset about 2 months ago

jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_predict

Viewer • Updated Feb 16 • 60 • 10

updated a dataset about 2 months ago

jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_cot

Viewer • Updated Feb 16 • 60 • 13

published a dataset about 2 months ago

jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_cot

Viewer • Updated Feb 16 • 60 • 13

updated 2 collections 6 months ago

Model merging

2 items • Updated Nov 1, 2025

Model search via model weights

2 items • Updated Nov 1, 2025

liked a Space 6 months ago

The Smol Training Playbook

The secrets to building world-class LLMs

updated a collection 7 months ago

Model merging

2 items • Updated Nov 1, 2025

upvoted 4 articles 7 months ago

Article

Vision Language Model Alignment in TRL ⚡️

+3

Aug 7, 2025

•

109

Article

KV Cache from scratch in nanoVLM

+3

Jun 4, 2025

•

115

Article

Vision Language Models (Better, faster, stronger)

+3

May 12, 2025

•

606

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

+5

May 21, 2025

•

255

liked a Space 9 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 10 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 253

updated a collection 11 months ago

Model search via model weights

2 items • Updated Nov 1, 2025

upvoted a paper 11 months ago

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published Feb 13, 2025 • 36